MATHEMATICS 


A Foundation for Electronic, Electrical, 
Communications and Systems Engineers 


FIFTH EDITION 


Anthony Croft - Robert Davison “a 
Martin Hargreaves + James Flint <= 


a 


Engineering Mathematics 


Anthony Croft 


Loughborough University 
Robert Davison 


Martin Hargreaves 
Chartered Physicist 


James Flint 
Loughborough University 


@ Pearson 


Harlow, England * London « New York « Boston « San Francisco * Toronto * Sydney 
Dubai * Singapore * Hong Kong *« Tokyo *« Seoul + Taipei * New Delhi 
Cape Town * Sado Paulo * Mexico City * Madrid * Amsterdam * Munich ¢ Paris * Milan 


PEARSON EDUCATION LIMITED 
Edinburgh Gate 

Harlow CM20 2JE 

United Kingdom 

Tel: +44 (0)1279 623623 

Web: www.pearson.com/uk 


First edition published under the Addison-Wesley imprint 1992 (print) 
Second edition published under the Addison-Wesley imprint 1996 (print) 
Third edition published under the Prentice Hall imprint 2001 (print) 
Fourth edition published 2013 (print and electronic) 

Fifth edition published 2017 (print and electronic) 


© Addison-Wesley Publishers Limited 1992, 1996 (print) 
© Pearson Education Limited 2001 (print) 
© Pearson Education Limited 2013, 2017 (print and electronic) 


The rights of Anthony Croft, Robert Davison, Martin Hargreaves and James Flint 
to be identified as authors of this work have been asserted by them in 
accordance with the Copyright, Designs and Patents Act 1988. 


The print publication is protected by copyright. Prior to any prohibited reproduction, 
storage in a retrieval system, distribution or transmission in any form or by any means, 
electronic, mechanical, recording or otherwise, permission should be obtained from 
the publisher or, where applicable, a licence permitting restricted copying in the 
United Kingdom should be obtained from the Copyright Licensing Agency Ltd, 
Barnard’s Inn, 86 Fetter Lane, London EC4A 1EN. 


The ePublication is protected by copyright and must not be copied, reproduced, transferred, 
distributed, leased, licensed or publicly performed or used in any way except as specifically 
permitted in writing by the publishers, as allowed under the terms and conditions under which 
it was purchased, or as strictly permitted by applicable copyright law. Any unauthorised 
distribution or use of this text may be a direct infringement of the authors’ and the publisher's 
rights and those responsible may be liable in law accordingly. 


Pearson Education is not responsible for the content of third-party internet sites. 


ISBN: 978-1-292-14665-2 (print) 
978-1-292-14667-6 (PDF) 
978-1-292-14666-9 (ePub) 


British Library Cataloguing-in-Publication Data 


A catalogue record for the print edition is available from the British Library 


Library of Congress Cataloging-in-Publication Data 

Names: Croft, Tony, 1957- author. 

Title: Engineering mathematics : a foundation for electronic, electrical, 
communications and systems engineers / Anthony Croft, Loughborough 
University, Robert Davison, De Montfort University, Martin Hargreaves, 

De Montfort University, James Flint, Loughborough University. 

Description: Fifth edition. | Harlow, England ; New York : Pearson, 2017. || 
Revised edition of: Engineering mathematics : a foundation for electronic, 
electrical, communications, and systems engineers / Anthony Croft, Robert 
Davison, Martin Hargreaves. 3rd editon. 2001. | Includes index. 

Identifiers: LCCN 2017011081| ISBN 9781292146652 (Print) | ISBN 9781292146676 
(PDF) | ISBN 9781292146669 (ePub) 

Subjects: LCSH: Engineering mathematics. | Electrical 
engineering—Mathematics. | Electronics-Mathematics. 

Classification: LCC TA330 .C76 2017 | DDC 510-dc23 

LC record available at https://lccn.loc.gov/2017011081 

A catalog record for the print edition is available from the Library of Congress 


10987654321 
21 20 19 18 17 


Print edition typeset in 10/12 Times Roman by iEnerziger Aptara , Ltd. 
Printed in Slovakia by Neografia 


NOTE THAT ANY PAGE CROSS REFERENCES REFER TO THE PRINT EDITION 


To Kate, Tom and Harvey - A.C. 
To Kathy - R.D. 
To my father and mother - M.H. 


To Suzanne, Alexandra and Dominic - J.F. 


This page intentionally left blank 


Contents 


Preface xvii 
Acknowledgements xix 
Review of algebraic techniques 1 
1.1. Introduction 1 
1.2 Laws of indices 2 
1.3 Number bases 11 
1.4 Polynomial equations 20 
1.5 Algebraic fractions 26 
1.6 Solution of inequalities 33 
1.7. Partial fractions 39 
1.8 Summation notation 46 

Review exercises 1 50 
Engineering functions 54 
2.1 Introduction 54 
2.2 Numbers and intervals 55 
2.3. Basic concepts of functions 56 
2.4 Review of some common engineering functions and techniques 70 

Review exercises 2 113 
The trigonometric functions 115 
3.1. Introduction 115 
3.2 Degrees and radians 116 
3.3. The trigonometric ratios 116 
3.4 The sine, cosine and tangent functions 120 
3.5 The sinc x function 123 
3.6 Trigonometric identities 125 
3.7. Modelling waves using sin¢ and cost 131 
3.8 Trigonometric equations 144 


Review exercises 3 150 


viii 


Contents 


Chapter 4 


Chapter 5 


Chapter 6 


Chapter 7 


Chapter 8 


Coordinate systems 


4.1 
4.2 
4.3 
4.4 
4.5 
4.6 
4.7 


Introduction 

Cartesian coordinate system — two dimensions 
Cartesian coordinate system — three dimensions 
Polar coordinates 

Some simple polar curves 

Cylindrical polar coordinates 

Spherical polar coordinates 

Review exercises 4 


Discrete mathematics 


5.1 
5.2 
5.3 
5.4 


Introduction 

Set theory 

Logic 

Boolean algebra 
Review exercises 5 


Sequences and series 


6.1 — Introduction 

6.2 Sequences 

6.3 Series 

6.4 The binomial theorem 

6.5 Power series 

6.6 Sequences arising from the iterative solution 
of non-linear equations 
Review exercises 6 

Vectors 

7.1. Introduction 

7.2. Vectors and scalars: basic concepts 

7.3. Cartesian components 

7.4 Scalar fields and vector fields 

7.5 The scalar product 

7.6 The vector product 

7.7 Vectors of nm dimensions 


Review exercises 7 


Matrix algebra 


8.1 
8.2 


Introduction 
Basic definitions 


154 


154 
154 
157 
159 
163 
166 
170 
173 


175 


175 
175 
183 
185 
197 


Contents ix 


8.3. Addition, subtraction and multiplication 259 
8.4 Using matrices in the translation and rotation of vectors 267 
8.5 Some special matrices 271 
8.6 The inverse of a2 x 2 matrix 274 
8.7. Determinants 278 
8.8 The inverse of a3 x 3 matrix 281 
8.9 Application to the solution of simultaneous equations 283 
8.10 Gaussian elimination 286 
8.11 Eigenvalues and eigenvectors 294 
8.12 Analysis of electrical networks 307 
8.13 Iterative techniques for the solution of simultaneous equations 312 
8.14 Computer solutions of matrix problems 319 
Review exercises 8 321 
Chapter9 Complex numbers 324 
9.1 Introduction 324 
9.2 Complex numbers 325 
9.3 Operations with complex numbers 328 
9.4 Graphical representation of complex numbers 332 
9.5 Polar form of a complex number 333 
9.6 Vectors and complex numbers 336 
9.7 The exponential form of a complex number 337 
9.8 Phasors 340 
9.9 De Moivre’s theorem 344 
9.10 Loci and regions of the complex plane 351 
Review exercises 9 354 
Chapter10 Differentiation 356 
10.1 Introduction 356 
10.2 Graphical approach to differentiation 357 
10.3 Limits and continuity 358 
10.4 Rate of change at a specific point 362 
10.5 Rate of change at a general point 364 
10.6 Existence of derivatives 370 
10.7 Common derivatives 372 
10.8 Differentiation as a linear operator 375 
Review exercises 10 385 
Chapter11 Techniques of differentiation 386 


11.1. Introduction 386 


x 


Contents 


Chapter 12 


Chapter 13 


Chapter 14 


Chapter 15 


Chapter 16 


11.2 Rules of differentiation 


11.3 Parametric, implicit and logarithmic differentiation 


11.4 Higher derivatives 
Review exercises 11 


Applications of differentiation 
12.1 Introduction 

12.2 Maximum points and minimum points 
12.3 Points of inflexion 


12.4 The Newton-Raphson method for solving equations 


12.5 Differentiation of vectors 


Review exercises 12 


Integration 


13.1 Introduction 

13.2 Elementary integration 

13.3 Definite and indefinite integrals 
Review exercises 13 


Techniques of integration 


14.1 Introduction 

14.2 Integration by parts 

14.3 Integration by substitution 

14.4 Integration using partial fractions 
Review exercises 14 


Applications of integration 

15.1 Introduction 

15.2 Average value of a function 

15.3 Root mean square value of a function 
Review exercises 15 


Further topics in integration 


16.1 Introduction 
16.2 Orthogonal functions 
16.3 Improper integrals 
16.4 Integral properties of the delta function 
16.5 Integration of piecewise continuous functions 
16.6 Integration of vectors 
Review exercises 16 


Chapter 17 


Numerical integration 


17.1 
17.2 
17.3 


Introduction 
Trapezium rule 
Simpson's rule 
Review exercises 17 


Contents 


Chapter18 Taylor polynomials, Taylor series and Maclaurin series 


Chapter 19 


Chapter 20 


Chapter 21 


18.1 
18.2 
18.3 
18.4 
18.5 
18.6 


Introduction 

Linearization using first-order Taylor polynomials 
Second-order Taylor polynomials 

Taylor polynomials of the nth order 

Taylor’s formula and the remainder term 

Taylor and Maclaurin series 

Review exercises 18 


Ordinary differential equations | 


19.1 
19.2 
19.3 


19.4 
19.5 
19.6 
19.7 
19.8 


Introduction 
Basic definitions 


First-order equations: simple equations and separation 
of variables 


First-order linear equations: use of an integrating factor 
Second-order linear equations 

Constant coefficient equations 

Series solution of differential equations 

Bessel's equation and Bessel functions 

Review exercises 19 


Ordinary differential equations Il 


20.1 
20.2 
20.3 
20.4 
20.5 
20.6 
20.7 
20.8 


Introduction 

Analogue simulation 

Higher order equations 
State-space models 

Numerical methods 

Euler’s method 

Improved Euler method 
Runge-Kutta method of order 4 
Review exercises 20 


The Laplace transform 


21.1 
21.2 


Introduction 


Definition of the Laplace transform 


xi 


507 


513 
517 
521 
524 


540 
547 


xii 


Contents 


Chapter 22 


Chapter 23 


21.3 
21.4 
21.5 
21.6 
21.7 
21.8 
21.9 


Laplace transforms of some common functions 

Properties of the Laplace transform 

Laplace transform of derivatives and integrals 

Inverse Laplace transforms 

Using partial fractions to find the inverse Laplace transform 
Finding the inverse Laplace transform using complex numbers 
The convolution theorem 


21.10 Solving linear constant coefficient differential 


equations using the Laplace transform 


21.11 Transfer functions 


21.12 Poles, zeros and the s plane 


21.13 Laplace transforms of some special functions 


Review exercises 21 


Difference equations and the z transform 


22.1 
22.2 
22.3 
22.4 
22.5 
22.6 
22.7 
22.8 
22.9 


Introduction 

Basic definitions 

Rewriting difference equations 

Block diagram representation of difference equations 
Design of a discrete-time controller 

Numerical solution of difference equations 

Definition of the z transform 

Sampling a continuous signal 


The relationship between the z transform and the 
Laplace transform 


22.10 Properties of the z transform 


22.11 Inversion of z transforms 


22.12 The z transform and difference equations 


Review exercises 22 


Fourier series 


23.1 
23.2 
23.3 
23.4 
23.5 
23.6 
23.7 
23.8 
23.9 


Introduction 

Periodic waveforms 

Odd and even functions 

Orthogonality relations and other useful identities 
Fourier series 

Half-range series 

Parseval’s theorem 

Complex notation 

Frequency response of a linear system 


Review exercises 23 


Chapter 24 


Chapter 25 


Chapter 26 


Contents 


The Fourier transform 


24.1 
24.2 
24.3 
24.4 
24.5 
24.6 
24.7 


24.8 
24.9 


Introduction 

The Fourier transform — definitions 

Some properties of the Fourier transform 
Spectra 

The t—w duality principle 

Fourier transforms of some special functions 


The relationship between the Fourier transform 
and the Laplace transform 


Convolution and correlation 
The discrete Fourier transform 


24.10 Derivation of the d.f.t. 
24.11 Using the d.f.t. to estimate a Fourier transform 


24.12 Matrix representation of the d.ft. 


24.13 Some properties of the d.f.t. 


24.14 The discrete cosine transform 


24.15 Discrete convolution and correlation 


Review exercises 24 


Functions of several variables 


25.1 
25.2 
25.3 
25.4 
25.5 
25.6 
25.7 


Introduction 

Functions of more than one variable 

Partial derivatives 

Higher order derivatives 

Partial differential equations 

Taylor polynomials and Taylor series in two variables 
Maximum and minimum points of a function of two variables 


Review exercises 25 


Vector calculus 


26.1 
26.2 
26.3 
26.4 
26.5 
26.6 
26.7 


Introduction 

Partial differentiation of vectors 

The gradient of a scalar field 

The divergence of a vector field 

The curl of a vector field 

Combining the operators grad, div and curl 
Vector calculus and electromagnetism 
Review exercises 26 


xiii 


774 


xiv 


Contents 


Chapter 27 


Chapter 28 


Chapter 29 


Line integrals and multiple integrals 


27.1 Introduction 
27.2 Line integrals 
27.3 Evaluation of line integrals in two dimensions 
27.4 Evaluation of line integrals in three dimensions 
27.5 Conservative fields and potential functions 
27.6 Double and triple integrals 
27.7. Some simple volume and surface integrals 
27.8 The divergence theorem and Stokes’ theorem 
27.9 Maxwell's equations in integral form 

Review exercises 27 
Probability 
28.1 Introduction 
28.2 Introducing probability 
28.3 Mutually exclusive events: the addition law of probability 
28.4 Complementary events 
28.5 Concepts from communication theory 
28.6 Conditional probability: the multiplication law 
28.7 Independent events 


Review exercises 28 


Statistics and probability distributions 


29.1 
29.2 
29.3 
29.4 
29.5 
29.6 
29.7 
29.8 
29.9 


Introduction 

Random variables 

Probability distributions — discrete variable 
Probability density functions — continuous variable 
Mean value 

Standard deviation 

Expected value of a random variable 

Standard deviation of a random variable 
Permutations and combinations 


29.10 The binomial distribution 
29.11 The Poisson distribution 
29.12 The uniform distribution 
29.13 The exponential distribution 
29.14 The normal distribution 
29.15 Reliability engineering 


Review exercises 29 


Appendix | 


Appendix II 
Appendix III 
Appendix IV 


Index 


Contents 


Representing a continuous function and a sequence 
as a sum of weighted impulses 


The Greek alphabet 
SI units and prefixes 


The binomial expansion of (=) 


nN 


ON THE 
WEBSITE 


* 


XV 


This page intentionally left blank 


Preface 


Audience 


This book has been written to serve the mathematical needs of students engaged in a 
first course in engineering at degree level. It is primarily aimed at students of electronic, 
electrical, communications and systems engineering. Systems engineering typically en- 
compasses areas such as manufacturing, control and production engineering. The text- 
book will also be useful for engineers who wish to engage in self-study and continuing 
education. 


Motivation 


Engineers are called upon to analyse a variety of engineering systems, which can be 
anything from a few electronic components connected together through to a complete 
factory. The analysis of these systems benefits from the intelligent application of mathe- 
matics. Indeed, many cannot be analysed without the use of mathematics. Mathematics 
is the language of engineering. It is essential to understand how mathematics works in 
order to master the complex relationships present in modern engineering systems and 
products. 


Aims 


There are two main aims of the book. Firstly, we wish to provide an accessible, readable 
introduction to engineering mathematics at degree level. The second aim is to encourage 
the integration of engineering and mathematics. 


Content 


The first three chapters include a review of some important functions and techniques 
that the reader may have met in previous courses. This material ensures that the book is 
self-contained and provides a convenient reference. 

Traditional topics in algebra, trigonometry and calculus have been covered. Also in- 
cluded are chapters on set theory, sequences and series, Boolean algebra, logic, differ- 
ence equations and the z transform. The importance of signal processing techniques is 
reflected by a thorough treatment of integral transform methods. Thus the Laplace, z and 
Fourier transforms have been given extensive coverage. 

In the light of feedback from readers, new topics and new examples have been added 
in the fifth edition. Recognizing that motivation comes from seeing the applicability 
of mathematics we have focused mainly on the enhancement of the range of applied 
examples. These include topics on the discrete cosine transform, image processing, ap- 
plications in music technology, communications engineering and frequency modulation. 


xviii 


Preface 


Style 


The style of the book is to develop and illustrate mathematical concepts through ex- 
amples. We have tried throughout to adopt an informal approach and to describe math- 
ematical processes using everyday language. Mathematical ideas are often developed 
by examples rather than by using abstract proof, which has been kept to a minimum. 
This reflects the authors’ experience that engineering students learn better from prac- 
tical examples, rather than from formal abstract development. We have included many 
engineering examples and have tried to make them as free-standing as possible to keep 
the necessary engineering prerequisites to a minimum. The engineering examples, which 
have been carefully selected to be relevant, informative and modern, range from short il- 
lustrative examples through to complete sections which can be regarded as case studies. 
A further benefit is the development of the link between mathematics and the physical 
world. An appreciation of this link is essential if engineers are to take full advantage of 
engineering mathematics. The engineering examples make the book more colourful and, 
more importantly, they help develop the ability to see an engineering problem and trans- 
late it into a mathematical form so that a solution can be obtained. This is one of the most 
difficult skills that an engineer needs to acquire. The ability to manipulate mathemati- 
cal equations is by itself insufficient. It is sometimes necessary to derive the equations 
corresponding to an engineering problem. Interpretation of mathematical solutions in 
terms of the physical variables is also essential. Engineers cannot afford to get lost in 
mathematical symbolism. 


Format 


Important results are highlighted for easy reference. Exercises and solutions are provided 
at the end of most sections; it is essential to attempt these as the only way to develop 
competence and understanding is through practice. A further set of review exercises is 
provided at the end of each chapter. In addition some sections include exercises that are 
intended to be carried out on a computer using a technical computing language such as 
MATLAB®, GNU Octave, Mathematica or Python®. The MATLAB® command syntax 
is supported in several software packages as well as MATLAB? itself and will be used 
throughout the book. 


Supplements 


A comprehensive Solutions Manual is obtainable free of charge to lecturers using this 
textbook. It is also available for download via the web at www.pearsoned.co.uk/croft. 

Finally we hope you will come to share our enthusiasm for engineering mathematics 
and enjoy the book. 


Anthony Croft 
Robert Davison 
Martin Hargreaves 


James Flint 
March 2017 


Acknowledgements 


We are grateful to the following for permission to reproduce copyright material: 


Tables 


Table 29.7 from Biometrika Tables for Statisticians, Vol. 1, New York: Holt, Rinehart & 
Winston (Hays, W.L. and Winkler, R.L. 1970) Table 1, © Cambridge University Press. 


Text 


General Displayed Text on page xviii from https://www.mathworks.com/products/ 
matlab.html, MATLAB® isa registered trademark of The MathWorks, Inc.; General Dis- 
played Text xviii from Mathematica, https://www.wolfram.com/mathematica/, © Wol- 
fram; General Displayed Text xviii from https://www.python.org/, Python® and the 
Python logos are trademarks or registered trademarks of the Python Software Foun- 
dation, used by Pearson Education Ltd with permission from the Foundation; General 
Displayed Text on page 291 from http://www.blu-raydisc.com/en/, Blu-ray Disc™ is 
a trademark owned by Blu-ray Disc Association (BDA); General Displayed Text on 
page 291 from http://wimaxforum.org/home, WiMAX©® is a registered trademarks of 
the WiMAX Forum. This work is produced by Pearson Education and is not endorsed 
by any trademark owner referenced in this publication. 


This page intentionally left blank 


a pre Ce 


Brel \" 


Contents 


Review of algebraic 
techniques 


1.1 Introduction 1 
1.2 Laws of indices 2 
1.3 Number bases 11 
1.4 Polynomial equations 20 
1.5 Algebraic fractions 26 
1.6 Solution of inequalities 33 
1.7 Partial fractions 39 
1.8 Summation notation 46 
Review exercises 1 50 


INTRODUCTION 


This chapter introduces some algebraic techniques which are commonly used in engi- 
neering mathematics. For some readers this may be revision. Section 1.2 examines the 
laws of indices. These laws are used throughout engineering mathematics. Section 1.3 
looks at number bases. Section 1.4 looks at methods of solving polynomial equations. 
Section 1.5 examines algebraic fractions, while Section 1.6 examines the solution of 
inequalities. Section 1.7 looks at partial fractions. The chapter closes with a study of 
summation notation. 

Computers are used extensively in all engineering disciplines to perform calcula- 
tions. Some of the examples provided in this book make use of the technical comput- 
ing language MATLAB®, which is commonly used in both an academic and industrial 
setting. 

Because MATLAB® and many other similar languages are designed to compute not 
just with single numbers but with entire sequences of numbers at the same time, data 
is entered in the form of arrays. These are multi-dimensional objects. Two particular 
types of array are vectors and matrices which are studied in detail in Chapters 7 and 8. 


2 Chapter 1 Review of algebraic techniques 


Example 1.1 


Solution 


Example 1.2 


Solution 


1.2.1 


Apart from being able to perform basic mathematical operations with vectors and 
matrices, MATLAB® has, in addition, a vast range of built-in computational functions 
which are straightforward to use but nevertheless are very powerful. Many of these high- 
level functions are accessible by passing data to them in the form of vectors and matrices. 
A small number of these special functions are used and explained in this text. How- 
ever, to get the most out of a technical computing language it is necessary to develop 
a good understanding of what the software can do and to make regular reference to the 
manual. 


LAWS OF INDICES 


Consider the product 6 x 6 x 6 x 6 x 6. This may be written more compactly as 6°. We 
call 5 the index or power. The base is 6. Similarly, y x y x y x y may be written as y*. 
Here the base is y and the index is 4. 


Write the following using index notation: 


(a) (—-2)(-2)(-2)  ) 4.4455 (©) 2 cede a 


XXXX @) bb(—b) 


(a) (—2)(—2)(—2) may be written as (—2)°. 


(b) 4.4.4.5.5 may be written as 4°57. 


3 
(c) eae may be written as i 
XXXX x 
(d) Note that (—a)(—a) = aa since the product of two negative quantities is positive. 
So aa(—a)(—a) = aaaa = a‘. Also bb(—b) = —bbb = —b*. Hence 


aa(—a)(—a)__ a a’ 


bb(—b) ss — BB b3 


Evaluate 


(a7? tb) (3)? fey 273)" 


(a) 3 =7.7.7 = 343 
(b) (—3)3 = (—3)(—3)(—3) = —27 
(c) 23(-3)* = 8(81) = 648 


Most scientific calculators have an x” button to enable easy calculation of expressions 
of a similar form to those in Example 1.2. 


Multiplying expressions involving indices 
Consider the product (6?) (6°). We may write this as 


(67)(6°) = (6.6)(6.6.6) =@ 


Example 1.3 


Solution 


1.2 Laws ofindices 3 


So 
66° = 6 


This illustrates the first law of indices which is 


When expressions with the same base are multiplied, the indices are added. 


Simplify each of the following expressions: 


(a) 3°39) (by) 43444 (ec) 'x® dd) yy’? 


(a) 39310 = 39+10 — 319 
(b) 434446 = 43+4+6 = 4i3 
3+6 9 


(c) x3x® = x3+6 = x 


(d) y*y’y = y 


Engineering application 1.1 


Power dissipation in a resistor 


44243 _ 19 
= 


The resistor is one of the three fundamental electronic components. The other two 
are the capacitor and the inductor, which we will meet later. The role of the resistor 
is to reduce the current flow within the branch of a circuit for a given voltage. As 
current flows through the resistor, electrical energy is converted into heat. Because 
the energy is lost from the circuit and is effectively wasted, it is termed dissipated 
energy. The rate of energy dissipation is known as the power, P, and is given by 


P= irik (1.1) 


where / is the current flowing through the resistor and R is the resistance value. Note 
that the current is raised to the power 2. Note that power, P, is measured in watts; 
current, J, is measured in amps; and resistance, R, is measured in ohms. 

There is an alternative formula for power dissipation in a resistor that uses the volt- 
age, V, across the resistor. To obtain this alternative formula we need to use Ohm’s 
law, which states that the voltage across a resistor, V, and the current passing through 
it, are related by the formula 


Vis (1.2) 
From Equation (1.2) we see that 
V 
— 13} 
5 (1.3) 


Combining Equations (1.1) and (1.3) gives 


4 Chapter 1 Review of algebraic techniques 


Example 1.4 


Solution 


1.2.2 


Example 1.5 


Solution 


Note that in this formula for P, the voltage is raised to the power 2. Note an im- 
portant consequence of this formula is that doubling the voltage, while keeping the 
resistance fixed, results in the power dissipation increasing by a factor of 4, that is 
2. Also trebling the voltage, for a fixed value of resistance, results in the power dis- 
sipation increasing by a factor of 9, that is 37. 

Similar considerations can be applied to Equation 1.1. For a fixed value of resis- 
tance, doubling the current results in the power dissipation increasing by a factor of 
4, and trebling the current results in the power dissipation increasing by a factor of 9. 


Consider the product 3(3°). Now 
337) ]=30.33)=3° 


Also, using the first law of indices we see that 3!37 = 3%. This suggests that 3 is the 
same as 3!. This illustrates the general rule: 


a=a 


Raising a number to the power | leaves the number unchanged. 


Simplify (a) 5°5 


(b) x°xx? 


(a) 565 = 5orl = 5’ 


(b) x8 xx2 = aotl+2 = x6 


Dividing expressions involving indices 
5 
Consider the expression 2 


4 4.4.4.4.4 

eB  444— 
= 4.4 by cancelling 4s 
= 4? 


This serves to illustrate the second law of indices which is 


When expressions with the same base are divided, the indices are subtracted. 


Simplify 

(a) = (b) a (c) (@) " 
(a) >. a eo 

yy) P= 2) = 2 


1.2 Laws ofindices 5 


Ke} 


(c) a =P? =x4 
x3 
6 
by _ 
i as 
3 
Consider the expression Be Using the second law of indices we may write 
2? — 33-3 — 79 
2 7 


3 
But, clearly, B = 1, and so 2° = 1. This illustrates the general rule: 


a&=1 
Any expression raised to the power 0 is 1. 


1.2.3 Negative indices 
4B 
Consider the expression rs We can write this as 
43 4.4.4 1 1 
4 44444 44° @ 
Alternatively, using the second law of indices we have 


- a 43-5 _ 4-2 
45 a 
So we see that 
— b 
42 


Thus we are able to interpret negative indices. The sign of an index changes when the 
expression is inverted. In general we can state 


Example 1.6 Evaluate the following: 


* 2 6-3 
(a) 3° (b) 73 fe)" (d) (-3)* (e) 62 
: = 1 1 
Solution (a) 3? = = 5 
(b) a = 2(43) = 2(64) = 128 
() 34=4 = 
~ 31 3 
(@ (-3)2 = = 
(-3)? 9 
OP 2 pee got 
73-8 =6 si 6 


6 
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Example 1.7 Write the following expressions using only positive indices: 


Solution 


=) 


@r &art OS Oy 
Ce ee # 

(b) =5 . 

() = ayed 

(d) 3xy3 = a 


Engineering application 1.2 


Power density of a signal transmitted by a radio antenna 


A radio antenna is a device that is used to convert electrical energy into electromag- 
netic radiation, which is then transmitted to distant points. 

An ideal theoretical point source radio antenna which radiates the same power in 
all directions is termed an isotropic antenna. When it transmits a radio wave, the wave 
spreads out equally in all directions, providing there are no obstacles to block the 
expansion of the wave. The power generated by the antenna is uniformly distributed 
on the surface of an expanding sphere of area, A, given by 


A= 4nr 


where r is the distance from the generating antenna to the wave front. 

The power density, S, provides an indication of how much of the signal can po- 
tentially be received by another antenna placed at a distance r. The actual power 
received depends on the effective area or aperture of the antenna, which is usually 
expressed in units of m7’. 

Electromagnetic field exposure limits for humans are sometimes specified in terms 
of a power density. The closer a person is to the transmitter, the higher the power 
density will be. So a safe distance needs to be determined. 

The power density is the ratio of the power transmitted, P., to the area over which 
it is spread 


ce power transmitted 7 IP. a Ip. Wm 
area Arr © 4i 


Note that r in this equation has a negative index. This type of relationship is 
known as an inverse square law and is found commonly in science and engineering. 

Note that if the distance, r, is doubled, then the area, A, increases by a factor of 
4 (i.e. 27). If the distance is trebled, the area increases by a factor of 9 (i.e. 37) and 
so on. This means that as the distance from the antenna doubles, the power density, 
S, decreases to a quarter of its previous value; if the distance trebles then the power 
density is only a ninth of its previous value. 
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1.2.4 Multiple indices 


Consider the expression (4*)*. This may be written as 


(43)? = 4.43 -_ 43+3 = 46 


This illustrates the third law of indices which is 
(Gian — Ghee 


Note that the indices m and n have been multiplied. 


Example 1.8 Write the following expressions using a single index: 


ager G0") ©@” @e)- 


Solution (a) (37)* =3?%4 = 38 
(b) (7 3=7 2x3 _ 7 6 
(c) (x7)-3 = 2% (-3) = x6 


(d) (x77) = x 2x3 = x6 


Consider the expression (2*5”)?. We see that 
(245)3 = (2457) (2457) (2457) 
= 22S 5 
= 91256 


This illustrates a generalization of the third law of indices which is 


(ab")* = al™ pr* 


Example 1.9 Remove the brackets from 
(a) (2x°)° (b) (—3y*)? (c) @y)? 
Solution (a) (2x2)? = (2!x2)3 = 23x° = 8x6 
(b) (—3y*)? = (—3)*y* = 9y* 
(c) @ yy? =x Fy? 


Engineering application 1.3 


Radar scattering 


It has already been shown in Engineering application 1.2 that the power density of 
an isotropic transmitter of radio waves is 


8 


Chapter 1 Review of algebraic techniques 


It is possible to use radio waves to detect distant objects. The technique involves 
transmitting a radio signal, which is then reflected back when it strikes a target. This 
weak reflected signal is then picked up by a receiving antenna, thus allowing a number 
of properties of the target to be deduced, such as its angular position and distance from 
the transmitter. This system is known as radar, which was originally an acronym 
standing for RAdio Detection And Ranging. 

When the wave hits the target it produces a quantity of reflected power. The 
power depends upon the object’s radar cross-section (RCS), normally denoted by 
the Greek lower case letter sigma, o, and having units of m?. The power reflected at 
the object, P., is given by 


Poo 
L=eS<o= W 
Ant 


Some military aircraft use special techniques to minimize the RCS in order to reduce 
the amount of power they reflect and hence minimize the chance of being detected. 
If the reflected power at the target is assumed to spread spherically, when it 
returns to the transmitter position it will have the power density, S,, given by 
= power reflected at target = P. a 
area 4n 


Substituting for the reflected power, P., gives 


Rae 
s power reflected at target 4n 5) Po 
= — ie ae 
4 area 4n 4n x 4 


[Gera =e 


~ (am 


Note that the product of the two r~” terms has been calculated using the third law of 
indices. 

This example illustrates one of the main challenges with radar design which is that 
the power density returned by a distant object is very much smaller than the transmit- 
ted power, even for targets with a large RCS. For theoretical isotropic antennas, the 
received power density depends upon the factor r~*. This factor diminishes rapidly 
for large values of r, that is, as the object being detected gets further away. 

In practice, the transmit antennas used are not isotropic but directive and often 
scan the area of interest. They also make use of receive antennas with a large effective 
area which can produce a viable signal from the small reflected power densities. 


1.2.5 Fractional indices 


The third law of indices states that (a’”)” = a". If we take a = 2, m= 5 and n = 2 we 
obtain 


(2'/?)? = 2! =2 


So when 2!/? is squared, the result is 2. Thus, 2'/? is a square root of 2. Each positive 
number has two square roots and so 


ON? a25/) = E1AIAD. .. 


Example 1.10 


Solution 


Example 1.11 


Solution 


1.2 Laws of indices 
Similarly 
(23)3 = 2! 22 
so that 2'/? is a cube root of 2: 
2/3 — /2 = 1.2599... 


In general 2!/” is an nth root of 2. The general law states 


x!/" is an nth root of x 


Write the following using a single positive index: 


gery’ Oo)28rr gw? Wve 


1 

1 

@) BAA = 3h = 3 = 
(b) 42/345/3 = 42/3+5/3 = x@/3 

(c) yy 2 — yly = yl = yh 


(d) J kB = (k3) 1/2 a k3xd = [3/2 


Evaluate 


(a) 83 (b) 873) 8B) 8-23) 843 
We note that 8 may be written as 2°. 

(a) gis = (218 = 2! =9 

(b) 82/3 = (81/3)? = 9? —4 


1 1 

-1/3 _ a 

(c) 8 ~ 913 9 
1 1 

-2/3 _ a 

(@) 8 ~ 92/3 4 


(e) 94/3 — (81/34 = a4 = 16 


Engineering application 1.4 


Skin depth in a radial conductor 


When an alternating current signal travels along a conductor, such as a copper wire, 
most of the current is found near the surface of the conductor. Nearer to the centre 
of the conductor, the current diminishes. The depth of penetration of the signal, 
termed the skin depth, into the conductor depends on the frequency of the signal. 
Skin depth, illustrated in Figure 1.1, is defined as the depth at which the current 
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density has decayed to approximately 37% of that at the edge. Skin depth is important 


because it affects the resistance of wires and other conductors: the smaller the skin 


depth, the higher the effective resistance and the greater the loss due to heating. 

At low frequencies, such as those found in the domestic mains supply, the skin 
depth is so large that often it can be neglected; however, in very large-diameter con- 
ductors and smaller conductors at microwave frequencies it becomes important and 


has to be taken into account. 
The skin depth, 4, is given by 


4 \'R 
Sar) 
OULO 


where ju is a material constant known as the permeability of the conductor, w is the 
angular frequency of the signal and o is the conductivity of the conductor. 


Figure 1.1 
Cross-section of a radial conductor 
illustrating a skin depth 6. 


EXERCISES 1.2 
1 Evaluate 
13 
(a) 23 (b) 3? ©) a 
iY 
@ pa) (f) (4)? 


(7) 412 hy) NP i) 322 
(j) V0.01 (k) 8139/4 
2 Use ascientific calculator to evaluate 


(a) 101.2 (b) 6-0-7 (c) 625 


Express each of the following expressions using a 
single positive index: 
(a) xAx7 (b) x°(—x) 


x2 


2 

x 
(c) — (d) = iy 

x x 
(rey Oia? 
Simplify as much as possible 


xh/2 
@ a5 (b) (16x4)°?° 


a7\\° Qxy? 
me (3 © Gop 


(ce) Varb®ct —(f)_ (6417)° 
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Solutions 
1 (a) 8 (b) 9 (c) 5 (d) 361 3 (a) xl! (b) —x3 (c) x 
1 1 1 1 
(e) 4 (f) T6 (g) 3 (h) 3 (d) 7 (e) 3 (f) x2 
i) 8 () 0.1 (k) 27 3 
4 (a) x'/6 — (b) 2x (c) = 
y 
2 (a) 15.8489 — (b) 0.2853 7 
(c) 88.1816  (d) 3.8159 5. Gare ie 


1.3.1 


1.3.2 


NUMBER BASES 


The decimal system of numbers in common use is based on the 10 digits 0, 1, 2, 3, 4, 
5, 6, 7, 8 and 9. However, other number systems have important applications in com- 
puter science and electronic engineering. In this section we remind the reader of what is 
meant by a number in the decimal system, and show how we can use powers or indices 
with bases of 2 and 16 to represent numbers in the binary and hexadecimal systems 
respectively. We follow this by an explanation of an alternative binary representation of 
a number known as binary coded decimal. 


The decimal system 


The numbers that we commonly use in everyday life are based on 10. For example, 253 
can be written as 
253 = 200+ 50+ 3 
= 2(100) + 5(10) + 3(1) 
= 2(10*) + 5(10') + 3(10°) 
In this form it is clear why we refer to this as a ‘base 10’ number. When we use 10 as a 
base we say we are writing in the decimal system. Note that in the decimal system there 
are 10 digits: 0, 1,2, 3, 4,5, 6, 7, 8, and 9. You may recall the phrase ‘hundreds, tens and 
units’ and as we have seen these are simply powers of 10. To avoid possible confusion 
with numbers using other bases, we denote numbers in base 10 with a small subscript, 
for example, 5192,,: 
51921, = 5000 + 100 + 90 + 2 
= 5(1000) + 1(100) + 9(10) + 2(1) 
= 5(10°) + 1(107) + 9(10') + 2(10°) 


Note that, in the previous line, as we move from right to left, the powers of 10 increase. 


The binary system 


A binary system uses the number 2 for its base. A binary system has only two digits, 0 
and 1, and these are called binary digits or simply bits. Binary numbers are based on 
powers of 2. In a computer, binary numbers are usually stored in groups of 8 bits which 
we call a byte. 


12 
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Example 1.12 


Solution 


Example 1.13 


Solution 


Converting from binary to decimal 
Consider the binary number 110101,. As the base is 2 this means that as we move from 
right to left the position of each digit represents an increasing power of 2 as follows: 
110101, = 1(2°) + 1(2*) + 0(23) + 1(27) + 0(2') + 102°) 
= 132) + 1(16) + 0(8) + 1(4) + 0@) + 10) 
= 32+16+4+1 
= 53,6 


Hence 110101, and 53,, are equivalent. 


Convert the following to decimal: (a) 1111, (b) 101010, 


(a) 1111, = 1(23) +127) +1(2') + 12°) 
= 1(8) + 1(4) +12) + 101) 
=§+44+2+1 
_ 1549 


(b) 101010, = 1(2°) + 0(2*) + 1(2?) + 0(27) + 1(2') + 0(2°) 
= 132) +0+1(8)+0+1(2)+0 
= 324+8+2 
= 4219 


Converting decimal to binary 

We now look at some examples of converting numbers in base 10 to numbers in base 2, 
that is from decimal to binary. We make use of Table 1.1, which shows various powers 
of 2, when converting from decimal to binary. Table 1.1 may be extended as necessary. 


Table 1.1 
Powers of 2. 

20 1 28 256 
2! 2 2° 512 
22 4 210 1024 
23 8 a 2048 


Convert 83), to a binary number. 


We need to express 83,, as the sum of a set of numbers, each of which is a power of 2. 
From Table 1.1 we see that 64 is the highest number in the table that does not exceed 
the given number of 83. We write 


83 = 64+ 19 


We now focus on the 19. From Table 1.1, 16 is the highest number that does not exceed 
19. So we write 


19 = 164+3 


Example 1.14 


Solution 


Example 1.15 


Solution 
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giving 
83 = 64+ 1643 
We now focus on the 3 and again using Table 1.1 we may write 
83 = 644+ 164+2+1 
— 96 4.94 4.91 + 2° 
= 12°) +02?) + 12% + 02°) + 0(27) + 12') + 1°) 
= 1010011, 


Express 200,, as a binary number. 


From Table 1.1 we note that 128 is the highest number that does not exceed 200 so we 
write 


200 = 128 + 72 
Using Table 1.1 repeatedly we may write 
200 = 128+ 72 
= 128+ 64+ 8 
= 27 ae 9 = 3 
= 1(2’) + 12°) + 0(2°) + 0(2*) + 1(2°) + 0(2”) + 0(2') + 0(2”) 
= 11001000, 


Another way to convert decimal numbers to binary numbers is to divide by 2 repeatedly 
and note the remainder. We rework the previous two examples using this method. 


Convert the following decimal numbers to binary: (a) 83 (b) 200 


(a) We divide by 2 repeatedly and note the remainder. 


Remainder 
83+2=41rl 1 
41+2=20rl 1 
20+2=10r0 0 
10+2= 5r0 0 
5$+2= 2rl 1 
2+2= 1r0 0 
1+2= Orl 1 


To obtain the binary number we write out the remainder, working from the bottom 
one to the top one. This gives 


83,) = 1010011, 


as before. 
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(b) We repeat the process by repeatedly dividing 200 by 2 and noting the remainder. 


Remainder 
200 + 2 = 100 r0 0 
100 +2= 50r0 0 
50+2= 25r0 0 
25+2= 12rl1 1 
12+2= 6r0 0 
6+2= 3r0 0) 
3+2= Irl 1 
1+2= Orl 1 


Reading the remainder column from the bottom to the top gives the required binary 
number: 


200,) = 11001000, 


1.3.3 Hexadecimal system 


We now consider the number system which uses 16 as a base. This system is termed 
hexadecimal (or simply hex). There are 16 digits in the hexadecimal system: 0, 1, 2, 
3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, and F. Notice that conventional decimal digits are 
insufficient to represent hexadecimal numbers and so additional ‘digits’, A, B, C, D, E, 
and F, are included. Table 1.2 shows the equivalence between decimal and hexadecimal 
digits. Hexadecimal numbers are based on powers of 16. 


Table 1.2 

Hexadecimal numbers. 

Decimal Hexadecimal Decimal Hexadecimal 
0 0 8 8 
1 1 9 9 
2 2 10 A 
3 3 11 B 
4 4 12 C 
5 5 13 D 
6 6 14 E 
7 7 15 F 


Converting from hexadecimal to decimal 

The following example illustrates how to convert from hexadecimal to decimal. We 
use the fact that as we move from right to left, the position of each digit represents an 
increasing power of 16. 


Example 1.16 


Solution 


Example 1.17 


Solution 
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Convert the following hexadecimal numbers to decimal numbers: (a) 93A (b) F9B3 


(a) Noting that hexadecimal numbers use base 16 we have 


93A,6 = 9(167) + 3(16') + A(16°) 
= 9(256) + 3(16) + 10(1) 
= 2362,, 


(b) F9B3,, = F(16*) + 9(167) + B(16') + 3(16°) 
= 15(4096) + 9(256) + 11(16) + 3(1) 
= 63923, 


Converting from decimal to hexadecimal 


Table 1.3 provides powers of 16 which help in the conversion from decimal to hexa- 
decimal. 


Table 1.3 

16° 1 
16! 16 
162 256 
16° 4096 


164 65 536 
The following example illustrates how to convert from decimal to hexadecimal. 


Convert 14 397 to a hexadecimal number. 


We need to express 14397 as the sum of multiples of powers of 16. From Table 1.3 we 
see that the highest number that does not exceed 14397 is 4096. We express 14397 as 
a multiple of 4096 with an appropriate remainder. Dividing 14397 by 4096 we obtain 3 
with a remainder of 2109. So we may write 


14397 = 3(4096) + 2109 


We now focus on 2109 and apply the same process as above. From Table 1.3 the highest 
number that does not exceed 2109 is 256: 


2109 = 8(256) + 61 
Finally, 61 = 3(16) + 13. So we have 
14397 = 3(4096) + 8(256) + 3(16) + 13 
= 3(16°) + 8(16°) + 3(16') + 13(16") 
From Table 1.2 we see that 13,, is D in hexadecimal, so we have 


14397) = 383D 4, 


As with base 2 we can convert decimal numbers by repeated division and noting the 
remainder. The previous example is reworked to illustrate this. 
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Example 1.18 Convert 14397 to hexadecimal. 


Solution 


Example 1.19 


Solution 


We divide repeatedly by 16, noting the remainder. 


Remainder 
14397 + 16 = 899 r 13 13 
899 +16= 56r3 3 
56+16= 3r8 8 
3+16= Or3 3 


Recall that 13 in hexadecimal is D. Reading up the Remainder column we have 
14397.) = 383D¢ 


as before. 


Electronic engineers need to be familiar with the decimal, binary and hexadecimal sys- 
tems and be able to convert between them. The equivalent representations of the decimal 
numbers 0-15 are provided in Table 1.4. 


Table 1.4 

Decimal Binary Hex Decimal Binary Hex 
0 0000 0 8 1000 8 
1 0001 1 9 1001 9 
2 0010 2 10 1010 A 
3 0011 3 11 1011 B 
+ 0100 + 12 1100 Cc 
5 0101 5 13 1101 D 
6 0110 6 14 1110 E 
7 0111 7 15 1111 F 


Converting from binary to hexadecimal 

There is a straightforward way of converting a binary number into a hexadecimal num- 
ber. The digits of the binary number are grouped into fours, or quartets, (from the right- 
hand side) and each quartet is converted to its hex equivalent using Table 1.4. 


Convert 1101011100111, into hexadecimal. 


Working from the right, the binary number is grouped into fours, with additional zeros 
being added as necessary to the final grouping. 


0001 1010 1110 0111 
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Table 1.4 is used to express each group of four as its hex equivalent. For example, 0111 = 
7,6, and continuing in this way we obtain 


1AE7 


Thus 110101110 0111, = 1AE7\¢. 


Binary coded decimal 


We have seen in Section 1.3.2 that decimal numbers can be expressed in an equivalent 
binary form where the position of each binary digit, moving from the right to the left, 
represents an increasing power of 2. There is an alternative way of expressing numbers 
using the binary digits 1 and 0 that is often used in electronic engineering because for 
some applications it is more straightforward to build the necessary hardware. This sys- 
tem is called binary coded decimal (b.c.d.). 

First of all, recall how the decimal digits 0, 1, 2,...,9 are expressed in their usual 
binary form. Note that the largest decimal digit 9 is 1001 in binary, and so we need 
at most four digits to store the binary representations of 0, 1,...,9. Expressing each 
decimal digit as a four-digit binary number we obtain Table 1.5. 


Table 1.5 
Decimal digits and their four-digit 
binary representations. 


0 0000 5 0101 
1 0001 6 0110 
2 0010 7 0111 
3 0011 8 1000 
4 0100 9 1001 


A four-digit binary number is referred to as a nibble. To express a multi-digit decimal 
number, such as 347, in b.c.d. each decimal digit in turn is converted into its binary 
representation as shown. Note that a nibble is used for each decimal digit. 


3 4 7 


Y Y 1 
0011 0100 Ol11 


Recall from Section 1.3.2 that a byte is a group of 8 bits (binary digits). Computers 
usually store numbers in 8-bit bytes so there are two common ways of encoding b.c.d. 
The first is to use a whole byte for each nibble, with the first 4 bits always set to 0. So, 
for example, 347,, can be stored as 


00000011 00000100 00000111 


Alternatively, each byte can be used to store two nibbles, in which case 347,) would be 
stored as 


00000011 01000111 


Rules have been developed for performing calculations in b.c.d. but these are beyond the 
scope of this book. 
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Engineering application 1.5 


Seven-segment displays 


The number displays found on music systems, video and other electronic equip- 
ment commonly employ one or more seven-segment indicators. A single seven- 
segment indicator is shown in Figure 1.2(a). The individual segments are typically 
illuminated with a light-emitting diode (LED) or similar optical device and are either 
on or off. The segments are illuminated according to the table shown in Figure 1.2(b), 
where | indicates that the segment is turned on and 0 indicates that it is turned off. 


b.c.d. 
number a b c d e f g 
0000 1 1 1 1 1 1 0 
0001 0 1 1 0 0 0 0 
0010 1 1 0 1 1 0 1 
0011 1 il 1 1 0 0 1 
a 
= 0100 0 1 1 0 0 1 1 
f ih 0101 1 0 1 1 0 1 1 
g 0110 1 0 1 1 1 1 il 
Ss] 
0111 1 1 1 0 0 0 0 
: co 1000 1 1 1 1 1 1 1 
d 
| 1001 1 il 1 1 0 1 1 
(a) (b) 
Figure 1.2 


(a) Seven-segment LED display. (b) Seven-segment coding. 


The numbers in the microprocessor system driving the display are typically 
stored in binary format, known as, binary coded decimal (b.c.d.). As an example 
we consider displaying binary number 11101010, as a decimal number on seven- 
segment displays. This represents the decimal number 234, which requires three 
seven-segment displays. 

The microprocessor first divides the input number by 100 and in this case obtains 
the result 2 with a remainder of 34. This can be done directly on the binary number 
itself via a series of operations within the assembly language of the microprocessor 
without first converting to a decimal number. The result 2 = 0010, is then decoded 
using Figure 1.2(b), giving the bit pattern 1101101 which is passed to the ‘hundreds’ 
display. 

The remainder of 34 is then divided by 10 giving 3 with a final remainder of 4. The 
number 3 = 0011, and so this can be outputted to the ‘tens’ display as the pattern 
1111001. Finally, 4 = 0100,, which is passed to the display as the pattern 0110011. 


1 
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The display shows 
a a 
(I ae =| 
f b f b f b 
g g 
SSS Sa) Sa 
€ Cc € Cc € Cc 
d d 
=a =a [S|] 


display. 


Notice that prior to decoding for display, by successive division by 100 and 10 
the number has been converted into separate b.c.d. digits. Integrated circuits are 
available which convert b.c.d. directly into the bit patterns for display. Hence the 
output bit pattern of the microprocessor may be chosen to be b.c.d. In this case it 
has the advantage that fewer pins are required on the microprocessor to operate the 


EXERCISES 1.3 


Convert the following decimal numbers to binary 
numbers: (a) 19 (b)36 (c) 100 (d) 796 
(e) 5000 


Convert the following binary numbers to decimal 
numbers: (a) 111 (b) 10101 (c) 111001 
(d) 1110001 (e) 11111111 


What is the highest decimal number that can be 
written in binary form using a maximum of (a) 2 
binary digits (b) 3 binary digits (c) 4 binary digits 
(d) 5 binary digits? Can you spot a pattern? (e) Write 
a formula for the highest decimal number that can be 
written using N binary digits. 


4 Write the decimal number 0.5 in binary. 


Solutions 


(a) 19,9 =10011, (b) 100100 (c) 1100100 
(d) 1100011100 (e) 1001110001000 


(a) 111, =7 (b)21 (c)57 (@)II3. (e) 255 
(a)3 (b)7 (c) 15 (d)31. (e) 2% -1 


The binary system is based on powers of 2. The 

examples in the text can be extended to the case of 
negative powers of 2 just as in the decimal system 
numbers after the decimal place represent negative 


Convert the following hexadecimal numbers to 
decimal numbers: (a) 91 (b) 6C (c) AIB (d) F9D4 
(e) ABCD 


Convert the following decimal numbers to 
hexadecimal numbers: (a) 160 (b)396 = (c) 5010 
(d) 25000 (e) 1000000 


Calculate the highest decimal number that can be 
represented by a hexadecimal number with (a) | digit 
(b) 2 digits (c)3 digits (d)4digits (e) N digits 


Express the decimal number 375 as both a pure binary 
number and a number in b.c.d. 


Convert (a) 1111111, (b) 101010111, into 
hexadecimal. 


powers of 10. So, for example, the binary number 
11.101, is converted to decimal as follows: 


11.101, =1x2' +1x2° 41x27! 


+0277 +1273 


~2414+4! 
7 2 8 


8 
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In the same way the binary equivalent of the decimal 7 (a)15 (b)255 (c) 4095 (d) 65535 (e) 16% —1 
number 0.5 is 0.1. 

5) (a) 9144=145,9 (b) 6C=108 (c) 2587 (d) 63956 8 (a) 101110111, (b)001101110101,., 
(e) 43 981 


9 (a)7F (b) 157 


6 (a) 160;,=AO (b)18C (c) 1392 (d) 61A8 


(e) F4240 


POLYNOMIAL EQUATIONS 


1.4.1 


A polynomial equation has the form 


P(x) = a,x" +4, x" | ta, 9X"? +++ Fax +a,x+a)=0 (1.4) 
where 7 is a positive whole number, a,, a,_,,...,@) are constants and x is a 
variable. The constants a,, d,_,,-.., 45, @,, dy are called the coefficients of the 


polynomial. 


The roots of an equation are those values of x which satisfy P(x) = 0. So ifx = x, isa 
root then P(x,) = 0. 
Examples of polynomial equations are 


Te? +4x—-1=0 (1.5) 
2x -3=0 (1.6) 
x —20=0 (17) 


The degree of an equation is the value of the highest power. Equation (1.5) has degree 2, 
Equation (1.6) has degree 1 and Equation (1.7) has degree 3. A polynomial equation of 
degree n has n roots. 

There are some special names for polynomial equations of low degree (see Table 1.6). 


Table 1.6 

Equation Degree Name 
ax+b=0 1 Linear 
ax* + bx +c =0 2 Quadratic 
ax + bx* +ext+d=0 3 Cubic 
ax* + bx +x? +dx+e=0 4 Quartic 


Quadratic equations 


We now focus attention on quadratic equations. The standard form of a quadratic equa- 
tion is ax? + bx + c = 0. We look at three methods of solving quadratic equations: 


(1) factorization, 
(2) use of a formula, 


(3) completing the square. 


Example 1.20 illustrates solution by factorization. 


Example 1.20 


Solution 


Example 1.21 


Solution 


Solve 


6° +11x-10=0 


The left-hand side (1.h.s.) is factorized: 
(3x — 2)(2x+ 5) =0 

So either 
3x -—2=0 or 2x+5=0 


Hence 
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When roots cannot be found by factorization we can make use of a formula. 


The formula for finding the roots of ax” + bx +c = 0 is 


7 —b+/b* —4ac 
= 2a 


Use the quadratic formula to solve 


3x7 —x-6=0 


Comparing 3x” — x — 6 with ax” + bx + c we see that a = 3, b = —1 andc = —6. So 


—(-1) + /(-1? — 4(3)(-6) 


2(3) 


14/73 


6 
= —1.2573, 1.5907 


Engineering application 1.6 


Current used by an electric vehicle 


Personal transport systems that make use of electrical power are becoming increas- 
ingly common. One of the factors behind this change is that their use can reduce road- 
side pollution in an urban environment. Electrical vehicles have also become the base 
for self-driving cars when combined with electrical control and navigation systems. 

The motor in an operational electric vehicle has to do work to overcome wind, 
inertia, friction, road resistance and in order to climb inclines. The energy supply 
in the form of electrical power comes from the on-board battery pack. Due to its 
internal construction the battery pack has a total internal resistance, R, which serves 
to reduce the power available to the motor. 
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A simplified circuit diagram of a vehicle is shown in Figure 1.3. 


Internal 
resistance 


Figure 1.3 
Electric vehicle wiring diagram. 
The total power delivered by the battery pack is 
power = voltage x current = V/ 


This is shared between loss due to the internal resistance and the power, P, to the 
motor. The power loss due to the internal resistance is /7R (see Engineering appli- 
cation 1.1). So the equation for the power in the circuit is 


VI=PR+P 
This can be rewritten into the form of a quadratic equation 
RP —VI+P=0 


which can be solved to calculate the current in the wire for a particular power deliv- 
ered to the motor. It is important to know the current in order to specify the size of 
the fuses, the motor controller and the wire diameters used in the vehicle. 

Consider the case where the power output is 2 kW. If the circuit parameters are 
V = 150 volts, R = 1.6 Q, we have 


1.67 — 1507 + 2000 = 0 
The solutions to the quadratic equation are 
1a bt JB — 4ac Bee = V(—150)? — (4 x 1.6 x 2000) 
2a Dee 
=77.7A, 16.1A 


The relevant solution depends on the electrical characteristics of the motor used in the 
circuit. In practice, the larger of the two currents would correspond to a substantial 
loss in the internal resistance and would be avoided by the correct choice of motor. 

The technical computing language MATLAB® has the function roots which 
finds the solutions of a polynomial equation. In this example we would type 
roots ([1.6 -150 2000]) at the command line to obtain the results calculated 
above. 


Example 1.22 


Solution 
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We now introduce the method of completing the square. The idea behind completing 
the square is to absorb both the x” and the x term into a single squared term. Note that 
this is possible since 


x 4+ 2kx +h = (x +h)? 

and so 
+ 2kx = (x tk)? —K? 

and finally 
P+2kx+A=(x+kh?+A-K 


The x* and the x terms are both contained in the (x + k)* term. The coefficient of x on 
the l.h.s. is 2k. The squared term on the right-hand side (r.h.s.) has the form (x + k)?, 


. coefficient of x 
that is {x + ==— 


) . The following example illustrates the idea. 
Solve the following quadratic equations by completing the square: 
(a) x°+8x+2=0 
(b) 2x7 -4x+1=0 
(a) By comparing x? + 8x + 2 with x° + 2kx + A we see k = 4. Thus the squared term 
must be (x + 4)*. Now 
(«+ 4) =x + 8x+4+ 16 
and so 
x + 8x = («4+ 4)? — 16 
Therefore 
8x42 = O44 = 1642 
= (x +4)’ - 14 


At this stage we have completed the square. Finally, solving x* + 8x +2 = 0 we 
have 


r+ 8x+2=0 
(x +4)? -—14=0 


(x+4) = 14 
x+4= +/14 
x= —44 J/14 = —7.7417, —0.2583 
(b) 2x? —4x+ 1 = 0 may be expressed as x” — 2x + 0.5 = 0. Comparing x? — 2x +0.5 
with x? + 2kx + A we see that k = —1. Thus the required squared term must be 
(x — 1)?. Now 


(x— 1)? =x? —2x+1 
and so 


x —2x=(x-1)7?-1 
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1.4.2 


Example 1.23 


Solution 


Example 1.24 


Solution 


and 


x —2x4+0.5 = (x— 1)? —140.5 
= (x—1)?-0.5 


Finally, solving x? — 2x + 0.5 = 0 we have 
@=—1" +05 =0 


(x— 1)? =0.5 
x-1l=+V0.5 
x =1+£0.5 = 0.2929, 1.7071 


Polynomial equations of higher degree 


Verify that x = 1 and x = 2 are roots of 


P(x) =x' -—2x —x+2=0 


P(x) = xt —2x7 —x+2 

Pd) =1-2-1+2=0 

P(2) = 24 —2(23) -24+2=16-16-2+2=0 
Since P(1) = 0 and P(2) = 0, then x = | and x = 2 are roots of the given polynomial 
equation and are sometimes referred to as real roots. Further knowledge is required to 


find the two remaining roots, which are known as complex roots. This topic is covered 
in Chapter 9. 


Solve the equation 

P(x) =x° + 2x? — 37x 4+52=0 
As seen in Example 1.21 a formula can be used to solve quadratic equations. For higher 
degree polynomial equations such simple formulae do not always exist. However, if 
one of the roots can be found by inspection we can proceed as follows. By inspection 


P(4) = 4 + 2(4)? — 37(4) + 52 = 0 so that x = 4 is a root. Hence x — 4 is a factor of 
P(x). Therefore P(x) can be written as 


P(x) =x +. 2x — 37x +52 = (x-4)( +0x4+ B) 
where a and f must now be found. Expanding the r.h.s. gives 

P(x) =x +.ax° + Bx — 4° — 4dax — 46 
Hence 

x 42x? — 37x +52 =x + (a —4)x + (B — 4a)x — 48 
By comparing constant terms on the I.h.s. and r.h.s. we see that 


52 = —46 


so that 


p=-13 


1.4 Polynomial equations 


By comparing coefficients of x we see that 


2=a-4 
Therefore, 


a=6 
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Hence, P(x) = (x — 4) (x? + 6x — 13). The quadratic equation x° + 6x — 13 = Ocan be 


solved using the formula 


_ 6+ /36— 4-13) 
= 2 

_ 6 + 88 

~ 2 

= 1.690, —7.690 


We conclude that P(x) = 0 has roots at x = 4, x = 1.690 and x = —7.690. 


EXERCISES 1.4 


1. Calculate the roots of the following linear equations: 


(a) 
(b) 
(c) 
(d) 
(e) 
(f) 
(g) 
(h) 
(i) 
G) 
(k) 
(1) 


4x—12 =0 

5t +20 =-0 

t+10=2r 

ae ee 

2 

0.5t -6 =0 

2x+3=5x-6 

3% _ 7 =0 

2 

XX. x 

Se eal 

2° 3 

waa * 49 
2 

Ay +1) =6 


3(Q2y — 1) = 2(y +2) 
d 3 =" 1 
get haa eS ) 


2 Solve the following quadratic equations by 
factorization: 
(a) 7? —5t+6=0 
(b) x7 +x—-12=0 
(c) 1? = 10r — 25 
(d) x7+4,-21=0 


(e) x7 -9x+18=0 


(f) x27 =1 
(g) y* -10y+9=0 
(h) 22*7-z-1=0 


(i) 2x? 4+3x-2=0 
G) 3t7+4¢+1=0 
(k) 4y*+12y+5=0 
() 4r? -—9r+2=0 
(m) 6d? —d—-2=0 
(n) 6x? — 13x+2=0 


Complete the square for the following quadratic 
equations and hence find their roots: 


(a) x7+2x—-8=0 


(b) x7 —6x -5=0 
(c) x7+4x-6=0 
(d) x2 —14x-10=0 


(e) x7 +5x—-49=0 


Solve the following quadratic equations using the 
quadratic formula: 


(a) x?+x—-1=0 
(b) 72 —3r -2=0 
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(c) W?+5h+1=0 
(d) 0.5x7 +3x-2=0 
(e) 2k? -k—-3=0 
(f) -y?+3y+1=0 
(g) 3° =7r+2 

(h) x7 -70=0 

(i) 4s?-2=s 

(Gj) 2+5t+2=0 
(k) 3x* = 50 

Calculate the roots of the following polynomial 
equations: 


(a) x? — 6x? + 11x — 6 = Ogivenx = 1 is a root 


Solutions 
1 (a) 3 (bt) -4 () 10 @s8 
34 6 
(e) 12 (f) 3 (g) 3 (h) 5 
(i) 2 j) 2 (k) : 00) = 
” 4 i 
2 (a) 2,3 (b) —4,3 (c) 5 
(d) —7,3 (e) 3,6 (f) -1,1 
(g) 1,9 (h) —0.5, 1 (i) —2, 0.5 
qj) —1, ; (k) —2.5,-0.5 (1) 0.25,2 
(m) ae (n) : 2 
oe Me 
3 (a2) @4+1)?-9=0,x=-4,2 


(b) (x—3)2 — 14 =0,x = —0.7417, 6.7417 
(c) (x +2)? — 10 =0,x = —5.1623, 1.1623 
(d) («—7)? —59 =0,x = —0.6811, 14.6811 


p18 | ALGEBRAIC FRACTIONS 


An algebraic fraction has the form 


(b) i — 21? — 5t +6 = 0 givent = 3 isa root 
(c) v> — v2 — 30v + 72 = 0 given v = 4 is a root 
(d) 2y3+3y? — 1ly +3 =0 given y = 1.5 is a root 


> 
(e) 2x? + 3x2 —7x-5=0 given x = = is a root. 
Check that the given values are roots of the following 
polynomial equations: 


(a) +x-2=0 x=-2,1 


(b) 2 —3727-3f+2=0 t=-1,0.5 
(©) ~y+y+y+1=0 y=-1 
(d) vt} +4027 +602 +3v=0 v=-1,0 


(e) ( + =), = 0 9.9330, 4.9330 
e) (x+ =) ———=0,x=-9. : 
2 4 ; ; 


(a) —1.6180, 0.6180 
(b) —0.5616, 3.5616 
(c) —4.7913, —0.2087 
(d) —6.6056, 0.6056 
(e) —1, 1.5 

(f) —0.3028, 3.3028 
(g) —0.2573, 2.5907 
(h) —8.3666, 8.3666 
(i) —0.5931, 0.8431 


Gg —%=05 

(k) —4.0825, 4.0825 

(a) 1,2,3 1 =2:1,3 

(c) —6,3,4  (d) —3.3028, 0.3028, 1.5 


(e) —2.5, —0.6180, 1.6180 


. . numerator polynomial expression 
algebraic fraction = : = 5 : 
denominator polynomial expression 
For example, 
3t+1 x y+ 
>——., and > 
?e+r+4 r+) y?+2y+3 


are all algebraic fractions. 


1.5.1 


Example 1.25 


Solution 


1.5.2 


Example 1.26 


1.5 Algebraic fractions 27 


Proper and improper fractions 


When presented with a fraction, we can note the degree of the numerator, say n, and the 
degree of the denominator, say d. 

A fraction is proper if d > n, that is the degree of the denominator is greater than 
the degree of the numerator. If d < n then the fraction is improper. 


Classify the following fractions as either proper or improper. In each case, state the 
degree of both numerator and denominator. 


r+9x—6 
in WS 
3x3 + x* + 100 
Pp+r+9t-—6 
(b) ———_ 
P+9 
(v + 1)(v — 6) 
i) aaa 
v2 +3v+6 
(z+ 2)? 
(d) 


522 + 10z + 16 


(a) The degree of the numerator, n, is 2. The degree of the denominator, d, is 3. Since 
d > nthe fraction is proper. 

(b) Here n = 3 andd = 5. The fraction is proper since d > n. 

(c) Here n = 2 and d = 2, sod = n and the fraction is improper. 


(d) Here n = 3 andd = 2, sod < n and the fraction is improper. 


Equivalent fractions 
1 2 
Consider the numerical fractions 5 and rs These fractions have the same value. Sim- 


2 6 20 ; . x 2K xt 
=, ~ and — all have the same value. The algebraic fractions -, — and — all 
3° 9 30 y 2y yt 
have the same value. Fractions with the same value are called equivalent fractions. 

The value of a fraction remains unchanged if both numerator and denominator are 
multiplied or divided by the same quantity. This fact can be used to write a fraction in 
many equivalent forms. Consider for example the fractions 
2(x+ 1) 2xt 

(c) >= 


x(x + 1) xt 


ilarly, 


2 
(a) = — (b) 
X 


These are all equivalent fractions. Fraction (b) can be obtained by multiplying both nu- 
merator and denominator of fraction (a) by (x + 1), so they are equivalent. Fraction (a) 
can be obtained by dividing numerator and denominator of fraction (c) by xt and so they 
are also equivalent. 


Show that 


x+1 VP+4x4+3 
an ——_______ 
x+7 x2 +10x+ 21 


are equivalent. 
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Solution 


1.5.3 


Example 1.27 


Solution 


Example 1.28 


Solution 


We factorize the numerator and denominator of the second fraction: 


eae 3 Gt I)@+3) 
x2+10x+21 (x +7)(x+3) 


1 
7 So the two given 


Dividing both numerator and denominator by (x + 3) results in a 
x 


fractions are equivalent. 
Dividing both numerator and denominator by x + 3 is often referred to as ‘cancelling 
x+3". 


Expressing a fraction in its simplest form 


Consider the numerical fraction —. To simplify this we factorize both numerator and 
denominator and then cancel any common factors. Thus 


6 2x3 _ 3 


10 2x5 5 


3 3 6 
The fractions — and = have identical values but 5 is in a simpler form than To" It is im- 


portant to stress that only factors which are common to both numerator and denominator 
can be cancelled. 


Simplify 
6x 
18x2 
12x37 
Ax? yz 


(a) 


(b) 


Note that 18 can be factorized to 6 x 3 and so 6 is a factor common to both numerator 
and denominator. Also x” is x x x and so x is also a common factor. Cancelling the 
common factors, 6 and x, produces 


(a 


w~ 


6x _ 6x _ 1 
18x2(6)(3)(x)(x) 3x 


(b) The common factors are 4, x” and y. Cancelling these factors gives 


12x3y? _ 3xy 
Aye z 
Simplify (a) 4 (b) 6? + 317 + Ot 
implify (a) ——— ee 
Pe eee 3 + 3 


(a) Factorizing both numerator and denominator and cancelling common factors yields 


4 (2)(2) 2 


6x +4 23x+2) 3x42 


Example 1.29 


Solution 


1.5.4 


Example 1.30 


Solution 
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(b) Factorizing and cancelling common factors yields 


64+307+4+6¢ 3420? +1+2) 2? +t4+2 
9243 ~~ 31) 41 


Note that the common factor, 3f, has been cancelled. 


4t+8 (b) 2y--y-1 
+ 3r+2 y? —2y+1 
The numerator and denominator are factorized and common factors are cancelled. 
4t+8 4(t + 2) 4 
@ 43427 G+DE4D 741 
The common factor, t + 2, has been cancelled. 
2~-y-1  Qy+IQ-1)_ 2y+1 
y—2y+1 = Gy 1? y= 


The common factor, y — 1, has been cancelled. 


Simplify (a) 


(b) 


Multiplication and division of algebraic fractions 


To multiply two algebraic fractions together, we multiply their numerators together, and 
multiply their denominators together, that is 


a c axc ac 


x = = 
b d bxd_ bd 


Division is performed by inverting the second fraction and then multiplying, that is 


a. .c oa_d_ad 

bi’ d bc be 
Before multiplying or dividing fractions it is advisable to express each fraction in its 
simplest form. 


Simplify 


Vr +5x+6 - vr —x 
2x — 2 x2 +3x+2 


Factorizing numerators and denominators produces 


we +5x+6 | vr—x _ VIDED » x(x — 1) 
2x —2 x2 4+ 3x+2 2(x—1) (x + 1)(x+ 2) 
7 (x + 2)(x + 3)x(x — 1) 
~ 26e-1DO+1I)@4+2) 
Common factors (x + 2) and (x — 1) can be cancelled from numerator and denominator 
to give 
(X+2)@4+3)x(Qa-—1)  @+3)x 
2(x—1)@+1@+2) 241) 
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Example 1.31 


Solution 


1.5.5 


Example 1.32 


Solution 


Hence 


r+ 5x+6 rx x(x + 3) 


a ee ee eee 


Simplify 


48x47 | x+7 
P-—6x 9 84x 


The second fraction is inverted to give 


P+8x4+7 24x 
x 
x2 — 6x x+7 


Factorizing numerators and denominators yields 


(x+ 1)(«+7) - x(x +1) _ @+0DG@+ 7)x? (x + 1) 
x(x — 6) (x+7) x(x —6)(x +7) 
Common factors of x and (x + 7) are cancelled leaving 
(x + 1)x(x+ 1) 
x—6 


which may be written as 


x(x + 1)? 
x-6 


Addition and subtraction of algebraic fractions 


The method of adding and subtracting algebraic fractions is identical to that for numer- 
ical fractions. 

Each fraction is written in its simplest form. The denominators of the fractions are 
then examined and the lowest common denominator (1.c.d.) is found. This is the sim- 
plest expression that has the given denominators as factors. All fractions are then writ- 
ten in an equivalent form with the l.c.d. as denominator. Finally the numerators are 
added/subtracted and placed over the I.c.d. Consider the following examples. 


Express as a single fraction 


2 ze 4 
x+1 <x+2 


Both fractions are already in their simplest form. The l.c.d. of the denominators, (x + 1) 
and (x + 2), is found. This is (x + 1)(x + 2). Note that this is the simplest expression 
that has both x + 1 and x + 2 as factors. 

Each fraction is written in an equivalent form with the l.c.d. as denominator. So 


Wak 2(x + 2) 
1s written as ———_—___—__ 
x+1 (x+ 1)@+2) 
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and 


F . 4(x+ 1) 
— 1s written as —————__—__ 
+2 (x + 1) + 2) 


Finally the numerators are added. Hence we have 


2 4  &x+2) A(x + 1) 
x+1 x42 @+DG+2  @+DE+2) 

_ 24642) +4041) 

ae DGS) 

_ 6x+8 

~ @+1I)@+42) 

6x +8 
~ 2 43x42 


Example 1.33 Express as a single fraction 


V?+3x4+2 2, 
x—1 2x +6 


Solution Each fraction is written in its simplest form: 


w+ 3x+4+2 (x+1)(*+2) «+2 
*®-1  («—-D@t+)h x-1 
(a 
Ax+6 22%x+3) x+3 


The l.c.d. is (x — 1)(x + 3). Each fraction is written in an equivalent form with l.c.d. as 
denominator: 


+2 (*+2)@+4+3) 1 _ x-1 
x=1° G=—DG+3)' #43 k= 1G +3) 


So 


xv +3x+2 2 x+2 1 

©@-1 wt6 x-1 243 
_ @+2)@43) (x — 1) 
~ @=DG+3) &-D@+3) 
_ @+2)@4+3)-@-1) 
7 (x — 1) +3) 

?4+5x+6—-x+1 
(x — 1)@ +3) 

w+ 447 
~ («= D(e+3) 
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EXERCISES 1.5 


Engineering application 1.7 


Resistors in parallel 


When carrying out circuit analysis it is often helpful to reduce the complexity of a 
circuit by calculating an equivalent single resistance for several resistors connected 
together in parallel. This simplified version of the original circuit then becomes much 
easier to understand. Figure 1.4 shows the simplest case of two resistors connected 
together in parallel. 


Figure 1.4 
Two resistors in parallel. 


The equivalent resistance, R,, of this simple network is found from the formula 


1 1 1 
RRR, 
By combining the fractions on the r.h.s. we see 
lease, 
Ry a RR, 
and hence 
RR, 
POR +R, 


Consider the case when R, and R, are equal and have value R. The equivalent resis- 
tance then becomes 


- RR Re R 
ERR OR ee 


So 
2 i 


Therefore the effect of putting two equal resistors in parallel is to produce an overall 
equivalent resistance which is half that of a single resistor. 


1 Classify each fraction as either proper or improper. 2 Classify each of the following algebraic fractions as 
proper or improper. 
2 2 2 
Qc 5 OSG 2 ey PHL gy Wr tdv-6 (6-440 
= * P—1 3u2+u—1 6 +1 
2 2 2 
242 x +2 41 r+] 100f7 + 1 («+ 1)(x4+ 2) 
d +— d 
Oe ear Ye Oper pay Yeas 


(y+ DO+2)0+3) 


(g) +43 

(h) (z+ 1)!° ; (q+ 1)! 
(2z + 1)10 (q?7 + 1)6 
3k? + 2k —1 

i) = 


RB+k2—4k+1 


3. Express each fraction in its simplest form. 


es ale ) 5x7 +5 
2y-—y 10x — 10 

i ?+7t+12 @ x1 
e45t+4 x3 — 2x2 +x 
42x41 

(©) x2 —2x+1 


4 Simplify the following: 
x+1l x43 
x+3 x+2 
4 x+1 
x 
x2 —] 6 


(a) 


(b) 


Solutions 


x2 + 3x 
(c) 


1.6 Solution of inequalities 


e+4x4+4 


4xt + 4t 
xt? — 12 


(d) 


x 
x3 42x? 


4x 
4x2 —4 


Sf 
8x +8 


x24+2x-15 x2 43x-4 
x 
x2 +4x—5 x2 —4x+3 


(e) 


Express as a single fraction 


3 ‘a 2 
x+6 x+l1 

4 2 
x+2  (x+2)? 

2x +1 4 
eee hil 
xtx+1l <x«x-1 


(a) 


(b) 


(c) 


4+3x—-18 2x7 4+7x-4 
x27 +7x +6 x2 + 9x + 20 
3(x + 1) 2(x — 1) 

wetayt 4" 32-4 


(d) 


(e) 


1 (a) proper 
(d) improper 

2 (a) proper 
(d) improper 
(g) improper 


(b) proper (c) improper 
(e) improper (f) improper 
(b) improper (c) improper 
(e) proper (f) proper 
(h) improper (i) proper 


x+1 


(a) ae, 


(x + 2)(x + 3) 2(x4+ 1) x+4 


2 


®) 3(x— 1) 


(c) 4x2 


5x +15 
(«+ 1)(« + 6) @+2) 


(a) 


(d) (e) 


t x— 


4x+6 


(b) 


6x2 + 3x+3 


(c) 


(«—- 10? 4x41) 
—4+x—14 


5x2 —x— 10 


(j) proper 
2 2, 
yr +2 24] ras 
b ea 
ee PE On Par, a CS 
x+1 x2 +2x+1 
a) 2 ace a 
Oo 26=0 “gait l 


OQ) a «COC 


@+1)@+4+5) (x + 2)2(x — 2) 


GER] = sovution oF inequatities 


An inequality is any expression involving one of the symbols >, >, <, <. 


a > bmeans ais greater than b 
a < bmeans ais less than b 


a > bmeans ais greater than or equal to b 
a < bmeans ais less than or equal to b 


33 


Just as with an equation, when we add or subtract the same quantity to both sides of an 
inequality the inequality still remains. Mathematically we have 
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Example 1.34 


Solution 


Ifa > b then 
at+k>b+k adding k to both sides 
a—k>b—-k subtracting k from both sides 


We can make similar statements for a > b,a < banda < b. 

When multiplying or dividing both sides of an inequality extra care must be taken. 
Suppose we wish to multiply or divide an inequality by a quantity k. If k is positive the 
inequality remains the same; if k is negative then the inequality is reversed. 

Ifa > b then 


ka > kb ka <kb 
a ob if k is positive a b if k is negative 
kk kk 


Note that when k is negative the inequality changes from > to <. Similar statements can 
be made for a > b, a < banda < b. When asked to solve an inequality we need to state 
all the values of the variable for which the inequality is true. 


Solve the following inequalities: 


(a) 3t+1>t+7 (b) 2—3z<6+2z 


(a) 3t+1>t+7 


2t+1>7 subtracting ¢ from both sides 
2t > 6 subtracting | from both sides 
t>3 dividing both sides by 2 


Hence all values of t greater than 3 satisfy the inequality. 


(b) 2—3z < 6+2z 
—3z < 44+2z subtracting 2 from both sides 
—-4z, < 4 subtracting z from both sides 
z2-l dividing both sides by —4, remembering to reverse 


the inequality 


Hence all values of z greater than or equal to —1 satisfy the inequality. 


We often have inequalities of the form 3 > 0, ; < 0,a6 > Oand af < 0 to solve. It 


is useful to note that if 


> 0 then either a > 0 and B > Oora < Oands <0 


< 0 then either a > O and 6 <Oora <Oands > 0 


W/L WIR 


aB > 0 then eithera > Oand 6 > Oora < Oand 6B <0 


aB <0 then eithera > Oand 6 < Oora < Oand 6B > 0 


The following examples illustrate this. 


Example 1.35 


Solution 


Case (i) 


Case (ii) 


Case (i) 


Case (ii) 
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Solve the following inequalities: 


(a) 


(a 


wm 


(b) 


1 2t +3 
ee eyo i 
2x — 6 t+2 
po I ah ae 
Consider the fraction a For the fraction to be positive requires either of the 


following: 
Gi) x+1>0Oand2x-—6>0. 


Gi) x + 1 < Oand 2x —6 < 0. 
We consider both cases. 


x+1>0Oandsox>-—l. 


2x —6 > Oandsox > 3. 


Both of these inequalities are true only when x > 3. Hence the fraction is positive 
when x > 3. 


x+1<Oandsox < —-l. 


2x —6 < Oandsox < 3. 


Both of these inequalities are true only when x < —1. Hence the fraction is positive 
when x < —1. 


x+1 
In summary, 5 6 > 0Owhenx > 3 0rx < —1. 
i — 


2t+3 

t+2 
2t +3 
t+2 

t+1 & 


t+2~ 


<1 


1<0 


; . ttl 
We now consider the fraction 
either of the following: 


. For the fraction to be negative or zero requires 


Ga) t+1<O0Oandt+2>0. 
Gi) f+1 > 0 andt+2 <0. 
We consider each case in turn. 


t+1<Oandsor< —l. 

t+2>Oandsor > —2. 

Hence the inequality is true when ¢ is greater than —2 and less than or equal to —1. 
We write this as —2 <t< —l. 

t+12>Oandsot> —-1l. 

t+2<Oandsot < —2. 
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It is impossible to satisfy both t > —l and t < —2 and so this case yields no values 
of f. 


2t+3 
In summary, ——— < | when -2 <t< -l. 
t+2 


Example 1.36 Solve the following inequalities: 
(ay ?>4 (b) x <4 


Solution (a) Vr?>4 
x —4>0 
(x —2)(x+ 2) >0 

For the product (x — 2)(x + 2) to be positive requires either 


Gi) x-2>Oandx+2>0 


or 
Gi) x —2 <Oandx+2 <0. 


We examine each case in turn. 


Case (i) x—2>Oandsox >2. 
x+2>O0andsox > —2. 


Both of these are true only when x > 2. 
Case (ii) x—2 <Oandsox <2. 
x+2<Oand sox < —2. 


Both of these are true only when x < —2. 
In summary, x? > 4whenx > 2 orx < —2. 


(b) vr <4 

vr —4<0 

(x — 2)(x+ 2) <0 
For the product (x — 2)(x + 2) to be negative requires either 


G) x-2>Oandx+2 <0 


or 


Gi) x —2 <Oandx+2>0. 


We examine each case in turn. 


Case (i) x—2>Oandsox >2. 
x+2 <Oandsox < —2. 


No values of x are possible. 


Case (ii) 


Example 1.37 


Solution 


Case (i) 


Case (ii) 
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x—2 <Oandsox <2. 


x+2>0Oandsox > —2. 


Here we have x < 2 and x > —2. This is usually written as —2 < x < 2. Thus all 
values of x between —2 and 2 will ensure that x” < 4. 
In summary, x” < 4 when —2 < x < 2. 


The previous example illustrates a general rule. 


Ifx? > kthenx > Jk orx < —Vk. 
Ifx? <kthen —J/k <x < Vk. 


Solve the following inequalities: 


(a) x7 +x-6>0 (b) x7 +8x+1<0 


(a) 


(b 


~ 


vr+x-6>0 
(x —2)(x+3)>0 


For the product (x — 2)(x + 3) to be positive requires either 


Gi) x-2>Oandx+3>0 


or 


(ii) x —-2 <Oandx+3 <0. 


x—2>O0andsox > 2. 


x+3>0andsox > —3. 
Both of these inequalities are satisfied only when x > 2. 


x—2<Oandsox <2. 


x+3 <Oandsox < —3. 


Both of these inequalities are satisfied only when x < —3. 
In summary, x” + x — 6 > 0 when either x > 2 or x < —3. 


The quadratic expression x” + 8x + 1 does not factorize and so the technique of 
completing the square is used. 


x +8x+1=(x+4)?-15 
Hence 


(x +4)? —15 <0 
(x +4)? < 15 
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Using the result after Example 1.36 we may write 


—V15<x+4<vI15 
—V15-4<x<vV15-4 
—7.873 < x < —0.127 


EXERCISES 1.6 


1. Solve the following inequalities: (d) y’ —2y— 2430 
(a) 2x > 6 (b) 7 > 0.6 (c) #2 +6h+9<1 
(c) 3r < 12 (d) z+1>4 @ P+6r+7>20 
(e) 3u-2<4 (f) 6-k> —l (g) x +4x-6 <0 
() G=20 4 (h) m2>2 (h) 417 4+4t+9 < 12 
3 () x+4 1 6) 2=3 2¢ 
i > < 
(i) x2 <9 (j) v+1<10 x—5 t+6 
(k) x27 +10 <6 (l) 2k2-3>1 3v4 12 x2 
(m) 10-20? <6 (n) 5 +4K > 21 Gea 
—2)° < 25 3t+1 16 1 
(0) (v—2) (p) Gt+ 1° > tis — : 26 (a) a <2 
2 Solve the following inequalities: ts y~ 
(0) RF >0 (p) 8 >8 
(a) x —6r4+8>0 ?P+6r+9 
(b) 2+ 6x48 <0 Oa 
(c) 2 +4+31-2<0 (1) @+1)\@—-2)@+3) >0 
Solutions 
1 (a) x>3 (b) y > 2.4 (c) t<4 (d) y>6ory< 4 
(d) z>3 (e) v<2 ff) k<7 (ec) -4<h< 2 
(g) v>5 (h) m> V20rm< —V2 (f) r>V2—-30rr< -V2-3 
G) —3<x<3 (g) -V10-2<x< V10-—2 
a 1 
Q -i<v<3 th) -2 <tr 
(k) no solution ; 2 2 
() k>V2ork < —V2 @G) x>5 
(m) v > V2 orv < —V2 Gj) t< > ort > ~6 
(n) k>2ork < —2 (k) -4<v0 <3 
(0) -3<u<7 
5 qd) x>-I1withx 40 
(p) t> lort <—= 
3 (m) x < 0 (n) -S<y<2 
2 (a) x>4orx <2 (o) k>0 (p) x>2 
(b) -4<x< -2 (q) t<—5 


(c) eee (@”) x>2o0r—3<x<-l 


1.7.1 
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PARTIAL FRACTIONS 


Given a set of fractions, we can add them together to form a single fraction. For example, 
in Example 1.32 we saw 


2 4 %(x+2)4+441) 
eet” #2” ~~ eG 4D) 
6x + 8 
~ 2 43x4+2 


Alternatively, if we are given a single fraction, we can break it down into the sum of easier 
fractions. These simple fractions, which when added together form the given fraction, 
6x + 8 2 4 

are and . 

x? + 3x+2 x+1 x+2 
When expressing a given fraction as a sum of partial fractions it is important to clas- 
sify the fraction as proper or improper. The denominator is then factorized into a product 
of factors which can be linear and/or quadratic. Linear factors are those of the form 
ax + b, for example 2x — 1, = + 6. Repeated linear factors are those of the form 


are called partial fractions. The partial fractions of 


(ax + b)?, (ax + b) and so on, for example (3x — 2)* and (2x + 1)? are repeated linear 
factors. Quadratic factors are those of the form ax? + bx + c, for example 2x” — 6x+1. 


Linear factors 


We can calculate the partial fractions of proper fractions whose denominator can be 
factorized into linear factors. The following steps are used: 


( 


(2) Each factor of the denominator produces a partial fraction. A factor ax + b produces 


Factorize the denominator. 


Ym 


a partial fraction of the form ey where A is an unknown constant. 
ax 


(3 


wm 


Evaluate the unknown constants of the partial fractions. This is done by evaluation 
using a specific value of x or by equating coefficients. 


A ee factor ax + b in the denominator produces a partial fraction of the form 


ax +b 


Example 1.38 Express 


Solution 


6x+ 8 
x2+3x+2 


as its partial fractions. 


The denominator is factorized as 


P+ 3x+2= (+1042) 


40 
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A 
The linear factor, x + 1, produces a partial fraction of the form eat The linear factor, 
x 


B 
x + 2, produces a partial fraction of the form a3" A and B are unknown constants 
whose values have to be found. So we have 
6x +8 6x + 8 A B 
5 = = + (1.8) 
x? +3x+4+2 (x+1)(~+2) x+1 x42 


Multiplying both sides of Equation (1.8) by (x + 1) and (x + 2) we obtain 


6x +8 =A(x+2)+ B(x +1) (1.9) 


We now evaluate A and B. There are two techniques which enable us to do this: evalu- 
ation using a specific value of x and equating coefficients. Each is illustrated in turn. 


Evaluation using a specific value of x 


We examine Equation (1.9). We will substitute a specific value of x into this equation. 
Although any value can be substituted for x we will choose a value which simplifies 
the equation as much as possible. We note that substituting x = —2 will simplify the 
r.h.s. of the equation since the term A(x + 2) will then be zero. Similarly, substituting in 
x = —1 will simplify the r-h.s. because the term B(x + 1) will then be zero. Sox = —1 
and x = —2 are two convenient values to substitute into Equation (1.9). We substitute 
each in turn. 
Evaluating Equation (1.9) with x = —1 gives 


—6+8=A(—-1+2) 
2=A 
Evaluating Equation (1.9) with x = —2 gives 
—4= B(-1) 
B=4 
Substituting A = 2, B = 4 into Equation (1.8) yields 


6x+8 _ 2 i: 4 
e4+3x4+2 x4+1 x42 


2 
Thus the required partial fractions are ——— and ——. 
x+1 x+2 
The constants A and B could have been found by equating coefficients. 


Equating coefficients 


Equation (1.9) may be written as 
6x+8 = (A+B)x+2A+B 

Equating the coefficients of x on both sides gives 
6=A+B 

Equating the constant terms on both sides gives 


8=2A+B 


1.7.2 


Example 1.39 


Solution 


Example 1.40 


Solution 
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Thus we have two simultaneous equations in A and B, which may be solved to give 
A = 2 and B = 4as before. 


Repeated linear factor 


We now examine proper fractions whose denominators factorize into linear factors, 
where one or more of the linear factors is repeated. 
A repeated linear factor, (ax + b)?, produces two partial fractions of the form 
A B 
7 2 
ax+b  (ax+b) 


A repeated linear factor, (ax + b)*, leads to partial fractions 
A B 
+ 
ax+b  (ax+b)? 


Express 
2x+5 
x +2x+1 


as partial fractions. 


The denominator is factorized to give (x + 1)”. Here we have a case of a repeated factor. 


This repeated factor generates partial fractions . Thus 


Pas ie (x+ 1)? 
2x+5 2x+5 A B 


xe+2x+1 (etl)? x+1 (+1) 


Multiplying by (x + 1)? gives 
2x+5=A(x+1)+B=Ax+A+B 
Equating coefficients of x gives A = 2. Evaluation with x = —1 gives B = 3. So 


ae 2 3 
xe+2x4+1 x+1 («+17 


Express 
14x? + 13x 
(4x? + 4x + 1)(x- 1) 


as partial fractions. 


The denominator is factorized to (2x-+1)* (x—1). The repeated factor, (2x+1)*, produces 
partial fractions of the form 


A,B 
+1 (Qx+1) 
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Cc 
The factor, (x — 1), produces a partial fraction of the form ———. So 


14x? + 13x = 14x? + 13x = A ae B 4 C 
(42 +4x+1)@—-—1) (x+1)22—-1) 2x+1  (Qe+1)2 x-1 
Multiplying both sides by (2x + 1)?(x — 1) gives 


14x? + 13x = A(2x + 1)(x— 1) + Box — 1) + C(x +1)? (1.10) 


The unknown constants A, B and C can now be found. 
Evaluating Equation (1.10) with x = | gives 


7=Cey 
from which 
C=3 
Evaluating Equation (1.10) with x = —0.5 gives 
14(—0.5)” + 13(—0.5) = B(—0.5 — 1) 
from which 
B=2 
Finally, comparing the coefficients of x” on both sides of Equation (1.10) we have 
14=2A+4C 
Since we already have C = 3 then 
A=1 
Hence we see that 


14x? + 13x ee. . 2 a 3 
(4+ 4+ 1)@—-1) 0 2e+1  Qxt+1P  x-1 


1.7.3 Quadratic factors 


We now look at proper fractions whose denominator contains a quadratic factor, that is 
a factor of the form ax” + bx +c. 


A quadratic factor, ax* + bx + c, produces a partial fraction of the form 
Ax+B 
ax? + bx +c 


Example 1.41 Noting that x° + 2x7 — 11x — 52 = (x — 4)(x? + 6x4 13), express 


3x7 + llx+ 14 
x3 + 2x2 — 11x — 52 


as partial fractions. 


Solution 


1.7.4 
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The denominator has already been factorized. The linear factor, x — 4, produces a partial 


fraction of the form <a 
The quadratic factor, x? + 6x + 13, will not factorize further into two linear factors. 
B. C 
Thus this factor generates a partial fraction of the form ee Hence 
x? + 6x + 13 
3x7 + llx+ 14 2 A + Bx+C 

(x—4)(02 46x +13) x-4 246x413 
Multiplying by (x — 4) and (x* + 6x + 13) produces 

3x° + lIx+ 14 =A? + 6x + 13) + (Bx +C)(x — 4) (1.11) 


The constants A, B and C can now be found. 
Putting x = 4 into Equation (1.11) gives 


106 = A(53) 
A=2 


Equating the coefficients of x? gives 


3=A+B 
B=1 


Equating the constant term on both sides gives 


14 = A(13) —4C 
C=3 
Hence 


3x7 +11x+14 7 2 x+3 
x3 +2x2-—11x-52 x-4 x2 +6x4+13 


Improper fractions 


The techniques of calculating partial fractions in Sections 1.7.1 to 1.7.3 have all been 
applied to proper fractions. We now look at the calculation of partial fractions of im- 
proper fractions. The techniques described in Sections 1.7.1 to 1.7.3 are all applicable 
to improper fractions. However, when calculating the partial fractions of an improper 
fraction, an extra term needs to be included. The extra term is a polynomial of degree 
n — d, where n is the degree of the numerator and d is the degree of the denomina- 
tor. A polynomial of degree 0 is a constant, a polynomial of degree | has the form 
Ax + B, a polynomial of degree 2 has the form Ax* + Bx + C, and so on. For exam- 
ple, if the numerator has degree 3 and the denominator has degree 2, the partial fractions 
will include a polynomial of degree n — d = 3 — 2 = 1, that is a term of the form 
Ax + B. If the numerator and denominator are of the same degree, the fraction is im- 
proper. The partial fractions will include a polynomial of degree n — d = 0, that is a 
constant term. 
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Let the degree of the numerator be 7 and the degree of the denominator bed. Ifn > d 
then the fraction is improper. Improper fractions have partial fractions in addition to 
those generated by the factors of the denominator. These additional partial fractions 
take the form of a polynomial of degree n — d. 


Example 1.42 Express as partial fractions 


4x3 + 10x +4 
2x2 +x 


Solution The degree of the numerator is 3, that is n = 3. The degree of the denominator is 2, that 
is d = 2. Thus, the fraction is improper. 
Now n — d = | and this is a measure of the extent to which the fraction is improper. 
The partial fractions will include a polynomial of degree 1, that is Ax + B, in addition to 
the partial fractions generated by the factors of the denominator. 
The denominator factorizes to x(2x + 1). These factors generate partial fractions of 


G D 
the form — + ——\. Hence 


x 2x+1 
4°410x+4 4°410x4+4 Cc D 
= = Ax+B+—+>—— 
2x? +x x(2x + 1) x 2x+1 
Multiplying by x and 2x + 1 yields 
Ax? + 10x +4 = (Ax + B)x(2x + 1) + C(2x + 1) + Dx (1.12) 


The constants A, B, C and D can now be evaluated. 
Putting x = 0 into Equation (1.12) gives 


4=C 
Putting x = —0.5 into Equation (1.12) gives 


js 
=o 


D=3 
Equating coefficients of x* gives 


4=2A 
A=2 


Equating coefficients of x gives 


10=B+2C+D 


B=-1 

Hence 
4x3 + 10 4 4 3 
eee aed 


2x2 +x eed 


1 


EXERCISES 1.7 


Calculate the partial fractions of the following 
fractions: 


(a) 6x + 14 () 7—2x 

a). ees ea eee 

x2 +4x+3 x2—x—-2 
3x +6 8-—x 
nadine dad) ———>—_ 

©) 524 3x ” Geaeel 

13x? + llx +2 
(e) 


(x + 1)(2x + 1)(3x+ 1) 


Calculate the partial fractions of the following 
fractions: 
2x+7 4x —5 
paris fake hy) ee 
(a) x2 +6x+9 ) x2—2x4+1 
ie 3x? + 8x +6 
ic 
(x2 + 2x + 1)(x +2) 
3x? — 3x—2 3x7 + 7x +6 
(qd) ~,————~ (e.) —3-—35- 
(x- — 1)(x—- 1) x? + 2x 
Express the following as partial fractions: 
ies x2 +x+2 
G? + I@t+1) 
Solutions 
(a) 2 nq 4 (b) 1 3 
OS” eal a cae es | 
2; 1 3 5 
ie tie Se, Gy. ee 
es Om tei 
oy 2 * 1 1 
e 
xt+1 2x4+1 3x+1 
(a) 2 is 1 ) 4 1 
a eee ee 
x+3  (x+3)2 x=1> @=1) 
Cee ae 
c 
x+1 (x+1)2 x42 
(a) 2 1 1 
x-1 (x—1)2 x4+1 
2 3 1 
nt a eae 
Ce eae 
a alee 
x+1 x24] 
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és) 5x2 4+ 11x +5 
(2x + 3)(x2 + 5x +5) 
42 4.5 
(c) 


(x? + 1)(x? + 2) 
18x? + 7x +44 
(2x — 3) (2x2 + 5x +7) 
2x 
(x2 —x+ 1) (x2? +241) 


(d) 


(e) 


Express the following fractions as partial fractions: 


2 
x + 7x + 13 12x—4 
(a) ————~— (b) 
x+4 2x — 1 
6 + 8x42 
6) ee 
x2 +6x+1 
(a) x9 — 2x? + 3x —3 
x2 —2x+1 
2x3 + 2x2 —2x-1 
(e) = 2a - 
XW +X 
(b) 2x 1 
x245x+5  2x+3 
1 3 
©) etl 242 
5 4x —3 
d 
@) 2x—3 2x? 4+ 5x+7 
(ec) 1 1 
e 
xw—xt+1 4x41 
(a) 3+ : (b) 6+ z 
ane x44 x —1 
2x+1 
ce) 1+ ——— 
(©) + 6x41 
(d) 2 1 
oT eet pee 
(e) 2 1 1 
e epi en mili 
x x+1 
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Technical Computing Exercises 1.7 


1 Use a technical computing language such as The result is: 


MATLAB® to verify the solutions to the problems in r= 
Exercises 1.7. In MATLAB®, the function residue 

calculates the partial fraction expansion. For example, 4.0000 
exercise 1(a) would be solved by typing the following: 2.0000 
b = [6 14]; p= 

a= [1 4 3]; 

[r,p,k] = residue(b, a) -1 
Notice how the coefficients of the numerator are input “2 

in the formb = [6 14]; this is known as a row k= 
vector. The concept of a vector will be discussed in [] 


later chapters. For now it is adequate to treat this as a 


horizontal list of numbers which are passed to Examining the solution we note that the output for 


MATLAB® in a specific order. both r and p is arranged as a vertical list. This way of 
representing the output is known as a column vector. 

Similarly, the coefficients of the denominator are We note that the numbers returned in column vector p 

input bya = [1 4 3]. have a negative sign. This is because the result 

Each vector is arranged with the coefficient of the calculated contains the poles of the partial fraction 

highest power of x first. expansion. These are values of the variable which 


make the denominator of the fraction zero. The 
significance of this will become clear later in the text 
but for now it is adequate to note the difference in 
sign from what might have been expected. 


GEE] summation notation 


In engineering we often want to measure the value of a variable, such as current, voltage 
or pressure. 

Suppose we make three measurements of a variable x. We can label these measure- 
ments x,, x, and x,. In this context, the numbers 1, 2, 3 are called subscripts. 

In mathematics, the Greek letter sigma, written > , Stands for a ‘sum’. For example, 
the sum x, + x, + x3 is written 


3 
» x 
k=l 


Note that the subscript k ranges from | to 3. As k ranges from | to 3, x, becomes x, then 
x, and then x, and the sigma sign tells us to add up these quantities. 
In general, 


N 
De aM tte tay 
k=1 


This notation is often used to express some of the fundamental equations of electrical 
circuit analysis. Sometimes ‘Summation Notation’ is known as ‘Sigma Notation’. 
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Engineering application 1.8 


Kirchhoff’s current law 


Kirchhoff’s current law, often abbreviated to KCL, provides one of the fundamental 
equations for analysing electrical circuits. The law states that the sum of the currents 
flowing out of any junction, or node, in a circuit must equal the sum of the currents 
flowing into it. 

This principle is intuitive as it has a direct analogy with fluid flow in connected 
water pipes. Currents flowing into a junction are considered positive; those flowing 
out of a junction are negative. It is then valid to say that the sum of the currents 
at a junction is zero. If there are N currents at the junction, denoted J,, J,,..., I), 
then 


ar ap ise Ss al eral) 


This can be expressed using the summation notation as 


N 
0 
Kl 


Here J, means ‘the current, /, in branch k’. The first equation can be produced from 
the summation notation by first substituting k = 1, then k = 2, right up tok = N. 
The expression below the summation symbol tells you where to start and the variable 
to be substituted, and the number above the summation symbol indicates where to 
stop counting. Summation notation is a very compact and precise way of expressing 
KCL for any number of currents at a node. 

Consider the node shown in Figure 1.5. 


Branch | Branch 3 
1A 2A 
3A 2A Figure 1.5 
A circuit node with four separate branches. The currents are 
Branch 2 Branch4 given in amperes (or amps, A). 


It can be seen that the total current flowing into the node is 1 + 3 = 4 amps. The 
current flowing out of the node is 2 + 2 = 4 amps. Clearly, 


Total current flowing into node = total current flowing out of node 


Alternatively, using the summation form of KCL we have 


4 
Yok =h+h+h+l,=0=14+3-2-2 
k=l 
Note that for currents flowing out of the node a negative sign is used and for currents 
flowing into the node a positive sign is used. This is equivalent to considering the 
currents separately as inward and outward flowing currents and equating the two. 
Suppose for a moment that we did not know the current in branch 4 and, 
furthermore, it was not labelled with an arrow to show the direction of current flow. 
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This situation is likely to occur in a circuit problem in electronics. There are 
two options for labelling the current flow direction, and these are summarized in 


Figure 1.6. 
1A AN 1A aN 
3A i 3A I, Figure 1.6 
Two different ways of defining the 
Branch 4 Branch 4 current direction in Branch 4. 
qe 3—=2= 1, =O Isp 3=2ar/h =O 
iy, =2 i 
Note that the two solutions are both correct but J, = —2 has a negative sign, which 


simply indicates that the current flows in the opposite direction to the arrow drawn 
on the right-hand diagram. It does not matter which way round the arrow is marked, 
as long as we observe the sign. 


Engineering application 1.9 


Kirchhoff’s voltage law 


Kirchhoff’s voltage law, often abbreviated to KVL, provides another of the funda- 
mental equations for analysing electrical circuits. The law states that the sum of the 
voltages around a closed loop equals zero. It is often written down in the form of a 
summation, as follows: 


N 
Da 
k=1 


For the circuit shown in Figure 1.7 there are three possible loops to which we 
could apply KVL. 


Figure 1.7 
A simple circuit to illustrate Kirchhoff’s voltage law. 


In this example an ideal voltage source and resistors are used, although any compo- 
nents could be substituted as KVL applies universally. Note that we ‘walk around’ 
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the circuit when writing down the equations. If the arrow is in the direction of travel 
then it is given a positive sign; if it opposes the direction of travel it is given a 
negative sign. 

The equations are 


— 


Vi | Vy 


V,-V,-V, =0 et) V,-V,—-V, =0 


a e 


If the equations are solved and the voltage has a negative sign it indicates that the 
polarity is opposite to the direction of the voltage arrow drawn on the diagram. KVL 
and KCL are the fundamental circuit laws that allow networks of electronic compo- 
nents to be mathematically analysed. Although they are simple in concept they are 
very powerful techniques. 


EXERCISES 1.8 


1. Write out fully what is meant by each of the following 4 Determine the current, /, at each of the following 
expressions: circuit nodes: 
(a) Dea % (b) Dei (a) (b) 
© Thais @ rh a ie ae 
(©) Y410j;-2%  — () YpipQn+ 1? 1 I 

2 Write out fully (c) 
@ Tepe — &) DED a 

3 Write the following sums more concisely by using I 


sigma notation: 
(a) 3423433 +---+10° 
1 


Cer 
1 2°3 4° 12 
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5 Find V, in each of the following circuits: 6 Find V, and V, using KVL. 
(a) 


ty [hy 


2V aN fiv 


(b) 
[ety 
2V 
{¥ 
Solutions 
1 (a) x) +X) +243 424 (b) x) +X +23 + X4 4 Allsolved using KCL 
(C) xy +X +43 4X4 +5 +X +7 (a) 3-1-l1=0../=2 
b) 34+1-/J=0.7,1=4 
@d) s+4+% (b) 3+ Ae 


(c) 34+14+7=0..1=-4 


3 3 3 3 
(©) (& — 2)" + @ — 2)? + 3 — 2)° + (4 — 2) (d) h+h+h-1=0, JT=h+tht+h 
@ 14324747 orl = Dia Ik 
2 (a) -1+2-3+4 5 Both solved using KVL 


(a) 2-1-V,=0.-.V,=1 
(b) 24+ (-1)4+V,=0..V,=-1 
6 3-1-V,=0..V,=2 


(b) 1-44+9- 16425 
ma yew 


(b) 12 (—1)*t! 3-1-1-YV=0..Yy=1 
k=1—_k or V, —1—V, =0 .”. by substitution for V,, 
3 4 = 
(c) R=6 st or nel oT My ; 
REVIEW EXERCISES 1 
1. Simplify each of the following as far as possible: 2 Simplify as far as possible: 
6 -1 
674 oF 4)-2 X 
(a) 7°7 (b) =) (c) (3*) (a) x! x73 (b) (x2)4 (c) (“) 


3 
(d) 3462 (e) (32/3.41/3)6 (f) a (d) (y~2)7! (e) yl Byy2 


3 Remove the brackets and simplify: 


_2\7! 
(a) 2xy)3 (b) (a2 Ye)? (c) @ 


=? 
=) ee) 
@) (@y7)5 © (2 ~ 
= 


4 Express the following as their partial fractions: 


3x+ 11 —3-x 
oO E-37Gen won 
©) 6x? — 2 @ 2x? —x—7 
2x2 — x (x+ 1)(«-— 1)@4 2) 
4x —11 
) 92 4 ne7 
5 Convert the following into a single fraction: 
1 3 6 
OD go Gad 
(b) -+5+e — 
sos 85 +6 
6 10 s+1 ' s—1 
oo 2 Geese) bt OoLs 


6 Express the following as partial fractions: 


) 5x b) 3x+2 
©) G+ G3) OY Sea 
y+3 1 
= y2 + 3y+2 ) 124 3t +2 

ies 222 + 152+ 30 
(z+ 2)(z+3)(z+ 6) 
i 24x? + 33x +11 
(2x + 1)(3x + 2)(4x + 3) 
s+3 Urtk+d 
©) (s +1)? (h) BH-k 
; el ‘ t+5 
@ +41 (t +3)? 
s2 8x — 15 
) stl ®) Ax? — 12x +9 
6d? + 15d +8 2x7 + x43 
ON) saat, | eae 
(d+ 1)*(d+ 2) x? + 2x41 
-y-1 s* —85—5 
Eanes) “ @eepEss 
2 =9 253 352 — _ 
(q) t+t () s+ 3s S 


(t —2)2(t + 1) 
+42 + 7x +5 


(s) x2 43x42 


srts—1 


11 


a2; 


13 


Review exercises 1 


Solve the following quadratic equations using the 
quadratic formula: 


(a) x7+10x+2=0 
(b) y? —6y-3=0 
(c) 212 4+2r-9=0 
(d) 3z2—9z-1=0 
(e) 5v°+v—-6=0 


Solve the quadratic equations in Question 7 by 
completing the square. 


Solve 
x — 4° — 25x +28 =0 
given x = 7 is a root. 


Solve the following inequalities: 


(a) 6&r-1<4 (b) —6 < 3r <6 
2 

@ (=% e448 (d) 2< 7 

(e) (x—2)? > 36 (f) x2 —2x -3 <0 
==" 30 (h) x2 —8x+5<0 
x 3 x2 —2x-3 

i <- j) ————— 50 

@) D x @ x—5 ‘ 


Express each fraction in its simplest form. 


3ab" 6x7 y?z ot +6 
(a) (b) — (c) 

12ab 3xy3z 12 — 31 
(a) 3x7 + 3x (e) xyz — 2x7y*z 

ee ey eee 

6x2 — 3x x2y2 — Ix3y 3 


Express each fraction in its simplest form. 


ep a5 y +4y—12 
Cees () == — 
x“ — 2x —3 yo + 13y + 42 
2x7 + 7x —4 3x71 + 3xt — 3¢ 
eel aaa @ = 
2x- —3x+1 Ax¢z + 4xz — 4z 
(e) we -—W+4+x-2 
a). 
e4+x24+x41 
Express as a single fraction in its simplest form. 
x+1l %*x+6 
(a) x+6 . x+2 


3x-6 xyt+3y 
x 
xy+2y 4x-8 


(b) 


2-1 x-1 
4° 6 

x? — 9x x-9 

x+1 7 8432 

x? —5x-6 | x2 -1 
xe+x—42 ° x2+6x-—7 


(c) 


(d) 


(e) 
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14 Express as a single fraction: 


5 3 
(a) +6 x41 
3x 4 
(b) —— - —~ 
2x —1 x+5 
Solutions 
1 (a) 7/0 (b) 6 (c) 378 
(d) 326 (e) 344? (f) 10 
2 (a) x* (b) x8 (c) Yx 
(d) y? (ey y 
3 (a) 8x°y3 (b) 36a*b%c (c) 2y2 
9x4 
(d) xy () > 
y 
4 (a) 2 
" x-3 x+7 
3 4 
(b) — — 
x x-l 
ts) 342 1 
; ee 
x 2x—1 
2 1 1 
(d) 
x+1 x—1 x+2 
3 2 
(e) = 
x+7 2x+1 
19x? + 22x +4 
5 (a) Pi ie Bier Dalal 
Qx(x + 2)(2x + 1) 
(by 387 12s? + 22s + 12 
2s? (4s +3) 
‘és 2(3s* + 30s? + 120s? + 202s + 120) 
Cc 
s*(s + 2)(s +3)(s +4) 
1 3 
6 
) Fal ps 
7 4 
(b) a 
x+3 x4+2 
é 2 1 
ey ee 
ytl y+2 
Xs 1 1 
t+l t+2 
2 1 1 
(e) 
z+2 72+3 7<+6 
1 3 2 
© +1 3x42 4x +3 


(c) 
(d) 


(e) 


(g) 


(h) 


(i) 


Q@) 


(k) 


@) 


(m) 


(n) 


(0) 


1 r 2 
stl (s+1)? 


oe 
1 " 2 
t+3  (t+3)2 


4 3 
2x—-3 (2x—3)2 
4 1 2 


i 
d+1 (d+1)? 
3 i 4 
xt1l 0 («+1)? 


@) 53.7 = a 


(q) 


(r) 


(s) 


(a) 
(b) 
(c) 
(d) 
(e) 


(a) 
(b) 


(c) 


11 4 


"d+2 


2 


9(t — 2) a 3(t — 2)2 
3 
szt+s—1 


1 1 
| ne ere 
— vat) has 


2s+1— 


—9.7958, —0.2042 
—0.4641, 6.4641 
—2.6794, 1.6794 
—0.1073, 3.1073 
=1.2 51 


(x +5)? -23=0 
(y —3)?-12=0 


[oy 2] 


9(t+ 1) 


9 


11 


Solutions same as for Question 7 


x=-—4,1,7 


(a) t< (b) -2<r<2 
(c) v>-l (d) x>8 

(e) x< —4o0rx>8 

(ff) -l<x<3 

(g) x <—lorx>3 

(h) 4—VJ11 <x<44V11 
@) O0<x<v6,x< —-V6 


G) x>Sor-l<x<3 


5 
6 


12 


13 


14 


(a) 


(d) 


(a) 


(c) 


(a) 


(b) 


(c) 


(d) 


(e) 


Review exercises 1 


x+5 y-2 x+4 
x+1 (b) yt7 (©) x-1 
3t x—2 

dy (e) xo+1 
x+1 (b) 3(x + 3) 
x+2 4(x+ 2) 

et) we ©) 1 

2 

8x + 23 

(x+ l)(@+6) 

3x2 +7x+4 

(2x — 1)(x +5) 

5x* — 29x + 3 

(x — 6)(x +3) 

x2 —2x-1 

x-3 


2x4 — x3 — x2 -2x-2 
(x+1)0? +1) 
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INTRODUCTION 


The study of functions is central to engineering mathematics. Functions can be used to 
describe the way quantities change: for example, the variation in the voltage across an 
electronic component with time, the variation in position of an electric motor with time 
and the variation in the strength of a signal with both position and time. 

In this chapter we introduce several concepts associated with functions before going 
on to catalogue a number of engineering functions in Section 2.4. Much of the material 
of Section 2.4 will already be familiar to the reader and so this section should be treated 
as a reference section to be dipped into whenever necessary. A number of mathematical 
methods are also included in Section 2.4, most of which will be familiar but they have 
been collected together in order to make the book complete. 

When trying to understand a mathematical function it is always useful to sketch a 
graph in order to obtain an idea of its behaviour. The reader is encouraged to sketch such 
graphs whenever a new function is met. Graphics calculators are now readily available 
and they make this task relatively easy. If you possess such a calculator then it would be 
useful to make use of it whenever a new function is introduced. Software packages are 
also available to allow such plots to be carried out on a computer. These can be useful for 
plotting more complicated functions and ones that depend on more than one variable. 
We examine functions of more than one variable in Chapter 25. 

Throughout the book we make use of the term mathematical model. When doing so 
we mean an idealization of an engineering system or a physical situation so that it can be 
described by mathematical equations. To reflect an engineering system very accurately, 
a sophisticated model, consisting of many interrelated equations, may be needed. Al- 
though accurate, such a model may be cumbersome to use. Accuracy can be sacrificed 


2.2 Numbers and intervals 55 


in order to achieve a simple, easy-to-use model. A judgement is made as to when the 
right blend of accuracy and conciseness is achieved. For example, the most common 
mathematical model for a resistor uses Ohm’s law which states that the voltage across a 
resistor equals the current through the resistor multiplied by the resistance value of the 
resistor, that is V = JR. However, this model is based on a number of simplifications. It 
ignores any variation in current density across the cross-section of the resistor and as- 
sumes a single current value is acceptable. It also ignores the fact that if a large enough 
voltage is placed across the resistor then the resistor will break down. In most cases it is 
worth accepting these simplifications in order to obtain a concise model. 

Having obtained a mathematical model, it is then used to predict the effect of chang- 
ing elements or conditions within the actual system. Using the model to examine these 
effects is often cheaper, safer and more convenient than using the actual system. 


NUMBERS AND INTERVALS 


Numbers can be grouped into various classes, or sets. The integers are the set of numbers 
{esi = 3,'=2;,=1;:0;.1, 2,332.2} 


denoted by Z. The natural numbers are {0, 1, 2, 3, ...} and this set is denoted by N. The 
positive integers, denoted by N‘, are given by {1, 2, 3,...}. Note that some numbers 
occur in more than one set, that is the sets overlap. 

A rational number has the form p/g, where p and gq are integers with g 4 0. For 
example, 5/2, 7/118, —1/9 and 3/1 are all rational numbers. The set of rational numbers 
is denoted by Q. When rational numbers are expressed as a decimal fraction they either 
terminate or recur infinitely. 


cole Nin 


can be expressed as 2.5 | These decimal fractions terminate, 


can be expressed as 0.125 that is they are of finite length. 


JR el 


can be expressed as 0.111 111... | These are infinitely 


can be expressed as 0.090909... recurring decimal fractions. 


1 


A number which cannot be expressed in the form p/q is called irrational. When 
written as a decimal fraction, an irrational number is infinite in length and non-recurring. 
The numbers 7 and V2 are both irrational. 

It is useful to introduce the factorial notation. We write 3! to represent the product 
3 x 2 x 1. The expression 3! is read as “factorial 3’. Similarly 4! is a shorthand way of 
writing 4 x 3 x 2 x 1. In general, for any positive integer, n, we can write 


n! = n(n — 1)(n—2)(n — 3)... (3)(2)(1) 


It is useful to represent numbers by points on the real line. Figure 2.1 illustrates some 
rational and irrational numbers marked on the real line. Numbers which can be repre- 
sented by points on the real line are known as real numbers. The set of real numbers 
is denoted by R. This set comprises all the rational and all the irrational numbers. In 
Chapter 9 we shall meet complex numbers which cannot be represented as points on the 


Figure 2.1 
Both rational and irrational numbers are represented on the real line. 
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6 +4 -1 O 2 3 4 


Figure 2.2 
The intervals (—6, —4), [—1, 2], (3, 4] depicted on the real line. 


real line. The real line extends indefinitely to the left and to the right so that any real 
number can be represented. 

Sometimes we are interested in only a small section, or interval, of the real line. We 
write [1, 3] to denote all the real numbers between 1 and 3 inclusive, that is 1 and 3 are 
included in the interval. Thus the interval [1, 3] consists of all real numbers x, such that 
1 <x < 3. The square brackets, [ ], are used to denote that the end-points are included 
in the interval and such an interval is said to be closed. The interval (1, 3) consists of 
all real numbers x, such that | < x < 3. In this case the end-points are not included 
and the interval is said to be open. Brackets, (), denote open intervals. An interval may 
be open at one end and closed at the other. For example, (1, 3] is open at the left and 
closed at the right. It consists of all real numbers x, such that 1 < x < 3, and is known 
as a semi-open interval. Open and closed intervals can be represented on the real line. 
A closed end-point is denoted by e; an open end-point is denoted by o. The intervals 
(—6, —4), [—1, 2] and (3, 4] are illustrated in Figure 2.2. 

An upper bound of a set of numbers is any number which is greater than or equal 
to every number in the given set. So, for example, 7 is an upper bound for the set [3, 6]. 
Clearly, 7 is greater than every number in the interval [3, 6]. 

A lower bound of a set of numbers is any number which is less than or equal to every 
number in the given set. For example, 3 is a lower bound for the set (3.7, 5). 

Note that upper and lower bounds are not unique. Both 3 and 10 are upper bounds 
for (1, 2). Both —1 and —3 are lower bounds for [0, 6]. 

Technical computing languages such as MATLAB® usually have functions that auto- 
matically generate a set of numbers within a particular interval. In MATLAB® we could 
generate a set of time values, t, by typing: 


t= 020.124 


This generates a set of real numbers from the interval [0 , 1] stored in a row vector t, each 
individual number being separated by an increment of 0.1. The values of t generated 
are: 


0 0.1000 0.2000 0.3000 0.4000 0.5000 0.6000 0.7000 
0.8000 0.9000 1.0000 


BASIC CONCEPTS OF FUNCTIONS 


Loosely speaking, we can think of a function as a rule which, when given an input, 
produces a single output. If more than one output is produced, the rule is not a function. 
Consider the function given by the rule: “double the input’. If 3 is the input then 6 is the 
output. If x is the input then 2x is the output, as shown in Figure 2.3. 

If the doubling function has the symbol f we write 


fix 2x 
or more compactly, 


ff) = 2x 


Example 2.1 


Solution 


2.3.1 


Example 2.2 


Solution 


Example 2.3 


2.3 Basic concepts of functions 57 


f f 
3 double ps double dx 
the input the input 
Figure 2.3 


The function: ‘double the input’. 


The last form is often written simply as f = 2x. If f(x) is a function of x, then the value 
of the function when x = 3, for example, is written as f(x = 3) or simply as f(3). 


Given f(x) = 2x + | find 


(a) f(3) (b) f() 
(ce) f(-)) (d) f(a) 
(e) fa) (f) ft) 
(g) fe +1) 


(a) f(3) =2(3)+1=7 

(b) f(0) =2(0)+1=1 

(ce) f(-l =2(-I)+1=-1 

(d) f(q@) is the value of f(x) when x has a value of a, hence f(a) = 2a + 1 
(e) fe) =2(e)+1=4a+1 

(f) fi) =2t+1 

(g) ft+1)=2¢4+14+1=21+3 


Observe from Example 2.1 that it is the rule that is important and not the letter being 
used. Both f(t) = 2f + 1 and f(x) = 2x + | instruct us to double the input and then 
add 1. 

Argument of a function 


The input to a function is often called the argument. In Example 2.1(d) the argument is 
a, while in Example 2.1(e) the argument is 2a. 


Given f(x) = - write down 


(a) f (Sx) (b) f(—x) 
(c) f@+2) id) f@?) 
Ox x 
@) foy= =x (b) f(-) =—3 
9: g) 
©) fa+2) == @ foe =% 


Given y(t) = t? +t, write down 


(a) y¢+2) (5) 
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Solution 


2.3.2 


x 


Figure 2.4 
The function: f(x) = 2x. 


Example 2.4 


(a) yt +2) = (+27 +042) =P 4+5t+6 
‘ t\ t\? i 2 4 
95) =(5) *()=545 


Graph of a function 


A function may be represented in graphical form. The function f(x) = 2x is shown in 
Figure 2.4. Note that the function values are plotted vertically and the x values horizon- 
tally. The horizontal axis is then called the x axis. The vertical axis is commonly referred 
to as the y axis, so that we often write 


y= f(x) =2 


Since x and y can have a number of possible values, they are called variables: x is the 
independent variable and y is the dependent variable. Knowing a value of the in- 
dependent variable, x, allows us to calculate the corresponding value of the dependent 
variable, y. To show this dependence we often write y(x). The set of values that x is al- 
lowed to take is called the domain of the function. A domain is often an interval on the 
x axis. For example, if 


f@) =3x+1 —-S5<x< 10 (2.1) 


the domain of the function, f, is the closed interval [—5, 10]. If the domain of a function 
is not explicitly given it is taken to be the largest set possible. For example, 


g(x) =x -4 (2.2) 


has a domain of (—0o, oo) since g is defined for every value of x and the domain has not 
been given otherwise. The set of values that the function takes on is called the range. 
The range of f(x) in Equation (2.1) is [—14, 31]; the range of g(x) in Equation (2.2) is 
[—4, oo). 

We now consider plotting the function f(t) = 7 for 0 < ft < 100 ina technical com- 
puting language. First we generate a number set as shown in Section 2.2. 


t= 0:1:100 


Then we produce a graph of the function by using the MATLAB® plot command to 
give the following 


plot(t, t.*2) 


Consider the function, f, given by the rule: ‘square the input’. This can be written as 


f@=x 


The rule and the graph of f are shown in Figure 2.5. The domain of f is (—oo, oo) and 
the range is [0, 00). 
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f 


square 
the input 


x —> +2 


Figure 2.5 
The function: ‘square the input’. 


Many variables of interest to engineers, for example voltage, resistance and current, can 
be related by means of functions. We try to choose an appropriate letter for a particular 
variable; so, for example, t is used for time and P for power. 


Engineering application 2.1 


Function to model the power dissipation in a resistor 


Recall from Engineering application 1.1 that the power, P, dissipated by a resis- 
tor depends on the current, J, flowing through the resistance, R. The relationship is 
given by 
(Pas iFin 

The power dissipated in the resistor depends on the square of the current passing 
through it. In this case J is the independent variable and P is the dependent variable, 
assuming R remains constant. The function is given by the rule: “square the input and 
multiply by the constant R’, and the input to the function is /. The output from the 
function is P. This is illustrated in Figure 2.6, for the cases R = 4 and R = 2. 


square the 
I ; input and 
multiply 
by R 
Figure 2.6 


The function: P = [7R. 


This model for a resistor only approximates the behaviour of the device. In prac- 
tice, changes in the temperature of the resistor lead to slight changes in the resistance 
value. If the current through the resistor is excessively high then the resistor over- 
heats and is permanently damaged. It no longer has the correct resistance value. The 
amount of power that a resistor can handle depends on the materials that have been 
used in its construction. A good circuit designer would calculate the amount of power 
to be dissipated and then allow a suitable safety margin to ensure that the resistor can- 
not be overloaded. 
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2.3.3 


2.3.4 


Figure 2.7 
The inputs x,, x», x3 and x, all produce the same Each input produces a different output and so 
output, therefore g(x) is a many-to-one function. h(x) is a one-to-one function. 


One-to-many 


Some rules relating input to output are not functions. Consider the rule: ‘take plus or 
minus the square root of the input’, that is 


x > tx 


Now, for example, if 4 is the input, the output is +/4 which can be 2 or —2. Thus a 
single input has produced more than one output. The rule is said to be one-to-many, 
meaning that one input has produced many outputs. Rules with this property are not 
functions. For a rule to be a function there must be a single output for any given input. 

By defining a rule more specifically, it may become a function. For example, consider 
the rule: ‘take the positive square root of the input’. This rule is a function because there 
is a single output for a given input. Note that the domain of this function is [0, oo) and 
the range is also [0, 00). 


Many-to-one and one-to-one functions 


Consider again the function f(x) = x? given in Example 2.4. The inputs 2 and —2 both 
produce the same output, 4, and the function is said to be many-to-one. This means that 
many inputs produce the same output. A many-to-one function can be recognized from 
its graph. If a horizontal line intersects the graph in more than one place, the function is 
many-to-one. Figure 2.7 illustrates a many-to-one function, g(x). The inputs x,, x,, x; 
and x, all produce the same output. 

A function is one-to-one if different inputs always produce different outputs. A hori- 
zontal line will intersect the graph of a one-to-one function in only one place. Figure 2.8 
illustrates a one-to-one function, h(x). 

Both one-to-one functions and many-to-one functions are supported in technical com- 
puting languages. For example, in MATLAB® the function f(x) = x? can be defined by 
using the command: 


f = @(x) x°2; 

It is now possible to type: 
f£ (3) 

or 


f (-3) 


h(x) 


Figure 2.8 


2.3.5 


2.3.6 
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both of which have the same result: 
ans = 9 
affirming that f(x) = x? is a many-to-one function. 


Notice that the variable used by the function is defined in brackets after the @ sign. This 
indicates that the input to the function is x and the command creates a function handle, 
f. Giving the function a handle enables it to be used elsewhere in the program. 

More complicated functions are usually created in a separate file and saved on the 
computer’s internal storage devices. They can be easily reused to create sophisticated 
programs. In MATLAB® these files are saved with the file name extension .m and are 
often termed m-files. 


Parametric definition of a function 


Functions are often expressed in the form y(x). For every value of x the corresponding 
value of y can be found and the point with coordinates (x, y) can then be plotted. Some- 
times it is useful to express x and y coordinates in terms of a third variable known as a 
parameter. Commonly we use ¢ or @ to denote a parameter. Thus the coordinates (x, y) 
of the points on a curve can be expressed in the form 


x=fQ) y=st) 
For example, given the parametric equations 
2 y=2t 0<t<5 


S71 


we can calculate x and y for various values of the parameter rf. Plotting the points (x, y) 
produces part of a curve known as a parabola. 


Composition of functions 


Consider the function y(x) = 2x”. We can think of y(x) as being composed of two func- 
tions. One function is described by the rule: ‘square the input’, while the other function 
is described by the rule: ‘double the input’. This is shown in Figure 2.9. 


g h 
2 
2 square x double x2 
the input the input 


Figure 2.9 
The function: y(x) = h(g(x)). 
Mathematically, if h(x) = 2x and g(x) = x° then 
yx) = 2x7 = 2(g(x)) = h(g(a)) 


The form /(g(x)) is known as a composition of the functions h and g. Note that the 
composition h(g(x)) is different from g(h(x)) as Example 2.5 illustrates. 
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Example 2.5 If f(t) = 2 +3 and g(t) = — write expressions for the compositions 
(a) f(s()) 
(b) g(f(t)) 


+1 
Solution (@) fe) = s(“S*) 


The rule describing the function f is: ‘double the input and then add 3’. Hence, 
t+1 t+1 
= 3=1+4 
(S) =A) 3-14 


f(gt)) =t+4 


So 


(b) s(f(@)) = g(2t + 3) 
The rule for g is: ‘add 1 to the input and then divide everything by 2’. So, 


a 


g(2t+3) = t+2 


Hence 
e(f(t)) =t+2 


Clearly f(g(t)) 4 g(f()). 


2.3.7. Inverse of a function 


Consider a function f(x). It can be thought of as accepting an input x, and producing an 
output f(x). Suppose now that this output becomes the input to the function g(x), and 
the output from g(x) is x, that is 


e(f(x)) =x 


We can think of g(x) as undoing the work of f(x). Figure 2.10 illustrates this situation. 
Then g(x) is the inverse of f, and is written as f—'(x). Since f~'(x) undoes the work 
of f(x) we have 


ff '@) =f'GF¢@) =x 


x f(x) af =x 


—> —_—_—_—_—_—_> ——» Figure 2.10 
The function g is the inverse of f. 


Example 2.6 


Solution 


Example 2.7 


Solution 
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If f(x) = 5x verify that the inverse of f is given by f a) = = 


The function f receives an input of x, and produces an output of 5x. Hence when the 
inverse function, f—!, receives an input of 5x, it produces an output of x, that is 


PF Gn =< 


We introduce a new variable, z, given by 


2= 5x 
sO 
re 
3s 
Then 
apy = 
f @=x 5 


Writing f~' with x as the argument gives 


P@=s 


If f(x) = 2x +1, find f-!(x). 


The function f receives an input of x and produces an output of 2x + 1. So when the 
inverse function, f~!, receives an input of 2x + 1 it produces an output of x, that is 


f iQx+1) =x 
We introduce a new variable, z, defined by 
z=2x4+1 


Rearranging gives 


el 
aimee 


So 


fi@ax=i 


Writing f~! with x as the argument gives 


x-1 


i, oe 5 
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Example 2.8 


Solution 


2.3.8 


—1 
Given g(x) = — find the inverse of g. 


x— 
so that 


1 f{*x-1 x 
We know g(x) = , and so g “=. =x, Lety = 


g'Q)=x 
But, 
x=2y+1 
and so 
g'0) =2y+1 
Using the same independent variable as for the function g, we obtain 


gi(x) =2x4+1 


We note that the inverses of the functions in Examples 2.7 and 2.8 are themselves func- 
tions. They are called inverse functions. The inverse of f(x) = 2x +1 is f-'(x) = 


= and the inverse of g(x) = - is g'(x) = 2x4 1. This illustrates the impor- 


tant point that if f(x) and g(x) are two functions and f(x) is the inverse of g(x), then 
g(x) is the inverse of f(x). It is important to point out that not all functions possess an 
inverse function. Consider f(x) = x*, for —00 <x < ©. 

The function, f, is given by the rule: ‘square the input’. Since both a positive and 
negative value of x will yield the output x’, the inverse rule is given by: ‘take plus or 
minus the square root of the input’. As discussed earlier, this is a one-to-many rule and 
so is not a function. Clearly not all functions have an inverse function. In fact, only one- 
to-one functions have an inverse function. Suppose we restrict the domain of f(x) = x? 
such that x > 0. Then f is a one-to-one function and so there is an inverse function. The 
inverse function is f—!(x) given by 


f° @) = +vx 
Clearly, 
i UG)=f @i=% 


where x is the positive square root of x”. Restricting the domain of a many-to-one func- 
tion so that a one-to-one function results is a common technique of ensuring an inverse 
function can be found. 


Continuous and piecewise continuous functions 


We now introduce in an informal way the concept of continuous and piecewise continu- 
ous functions. A more rigorous treatment follows in Chapter 10 after we have discussed 


1 
limits. Figure 2.11 shows a graph of f(x) = —. Note that there is a break, or discontinu- 
x 


1 
ity, in the graph at x = 0. The function f(x) = — is said to be discontinuous at x = 0. 
x 
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F(x) 
x 
1 3: t 
Figure 2.11 i Figure 2.12 
The function f(x) = ; has a discontinuity at The function f(t) is a piecewise continuous 
x =0; function with a discontinuity at t = 1. 


g(t) 


1 


Figure 2.13 


The function g(t) is a 
continuous function 


on (0, 3). 


2.3.9 


If the graph of a function, f(x), contains a break, then f(x) is discontinuous. 


A function whose graph has no breaks is a continuous function. 


Sometimes a function is defined by different rules on different intervals of the domain. 
For example, consider 


@) 2 O0<t<l 
fO= : i<fe3 


The domain is [0, 3] but the rule on [0, 1) is different to that on [1, 3]. The graph of f(t) 
is shown in Figure 2.12. Recall the convention of using e to denote that the end-point 
is included and o to denote the end-point is excluded. Note that f(t) has a discontinuity 
at t = |. Each component, or piece, of the graph is continuous and f(t) is said to be 
piecewise continuous. 


A piecewise continuous function has a finite number of discontinuities in any given 
interval. 


Not all functions defined differently on different intervals are discontinuous. For example, 


2 O<t<l 


(2 
BO =o Le 23 


is a continuous function on the interval (0, 3), as shown in Figure 2.13. 


Periodic functions 


A periodic function is a function which has a definite pattern which is repeated at regular 
intervals. More formally we say a function, f(t), is periodic if 


f@)=fE+T) 


for all values of t. The constant, T, is known as the period of the function. 


66 


Chapter 2 Engineering functions 


Example 2.9 Figure 2.14 illustrates a periodic waveform. It is often referred to as a triangular wave- 


form because of its shape. The form of the function is repeated every two seconds, that 
is 
FO) = ft +2) 


and so the function is periodic. The period is 2 seconds, that is T = 2. Note that this 
function is continuous. 


SO 


Figure 2.14 
The triangular waveform is a periodic function. 


Engineering application 2.2 


Saw-tooth waveform 


Figure 2.15 illustrates a saw-tooth voltage waveform. It is called a saw-tooth wave- 
form because its shape is similar to that of the teeth on a saw. It has many uses in 
electronic engineering. One use would be to provide a signal to sweep a beam of elec- 
trons across a cathode ray tube in a uniform way and then quickly move the beam 
back to the start again. This technique is used in an analogue oscilloscope and forms 
a signal for the time base. 

The form of the function is repeated every three seconds, that is 


v(t) = v(t +3) 


v(t) 


=3 0 3 6 %) t 


Figure 2.15 
The saw-tooth waveform is a periodic function. 
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Technical computing languages often have a range of built-in functions for pro- 
ducing waveforms. Sometimes specialist functions are provided in a separate soft- 
ware package. In MATLAB®, these software packages are known as toolboxes. The 
signal processing toolbox has a function for generating saw-tooth waves. This can be 
accessed by typing, for example: 


t=(-2*pi:0.1:2*pi) ; 
plot(t, sawtooth(t)) ; 


This will plot two periods of a saw-tooth wave. The first line generates a set of time 
values —27 < t < 27tina vector form with a spacing of 0. 1 between each point. The 
second line plots ¢ against the result of passing the vector ft to the sawtooth function. 
The sawtooth function always produces a wave with a period of 27. It highlights 
the need to read the manual pages carefully before using a function to understand 
how it will behave. 


Engineering application 2.3 


Square waveform 


Periodic functions may be piecewise continuous. Consider the function g(t) defined 
by 


il O<preil od 
g(t) = < period = 


The function g(t) is periodic with period 2. A graph of g(t) is shown in Figure 2.16. 
This function is commonly referred to as a square waveform by engineers. In Fig- 
ure 2.16 the open and closed end-points have been shown for mathematical correct- 
ness. Note, however, that engineers tend to omit these when sketching functions with 
discontinuities and usually they use a vertical line to show the discontinuity. This 
reflects the fact that no practical waveform can ever change its level instantaneously: 
even very fast rising waveforms still have a finite rise time. The function has discon- 
CNG Stati — ed) le eA ee 


g(t) 


= o-.« 1 ea e— 


Figure 2.16 
The function g(t) is both piecewise continuous and periodic. 


The square waveform is often used in electronic engineering, particularly in digital 
electronic systems. One example is the clock signal that is generated to ensure that 
all of the digital electronic circuits switch around the same time and so remain in 
synchronisation. 
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EXERCISES 2.3 
Represent the following intervals on the real line: 
(a) [1, 3] (b) [2,4) 
(c) (0, 3.5) (d) [—2, 0) 
(e) (-1, 1] (ff) 2<x<4 
(g) O<x<2 (h) —3<x< -l 
Gi) O<x <3 


Describe the rule associated with the following 
functions, sketch their graphs and state their domains 
and ranges: 


@f@=27 

(b) f(x) =x? -1 
(c) g(t) = 3t -—4 
@y@) =x 

(e) f(t) = 0.5t +2 
(f) z(x) = 3x —2 

If f(x) = 5x + 4, find 


(a) f(3) 

(b) f(—3) 

(c) f(@) 

(d) fat) 

(e) fa) 

(f) FQ?) 

If g(t) = 51? — 4, find 


(a) g(0) 

(b) g(2) 

(c) g(—3) 
(d) gi) 

(e) g2r—1) 


The reactance, X;, offered by a capacitor is given by 


O<x 


O0<t 


1 
Xo= On where f is the frequency of the applied 


alternating current, and C is the capacitance of the 
capacitor. If C = 10° F, find Xc when f = 50 Hz. 


Classify the functions in Question 2 as one-to-one or 
many-to-one. 


Find the inverse of the following functions: 
(a) f@)=x4+4 

(b) g(t) =3r+1 

(©) ya)=x 


t-—8 
(d) h(t) = 3" 


oj6=— 

e) f(t)= 3 

(f) A(x) =x -1 
(g) kv) =7-v 
(h) m(n) = 3 (1 — 2n) 


Given f(t) = 2t, g(t) =t — landh(f) = t? write 
expressions for 
(a) f(g@)) 
(c) g(h(t)) 
(e) A(gt)) 
(g) FFM) 
(i) A(A(t)) 
(k) g(f(AW)) 


(b) fA@)) 
) sf@) 
(f) ACFO) 
(h) g(g(t)) 
Q) F(g(h@))) 
() h(g(f@)) 


Given f(t) = 7? +1, g(t) = 3t +2 and A(t) = = 
write expressions for ’ 
(a) f(g@) 

(c) g(h(t)) 

(e) f(g(h@))) 


Given f(t) = 2f, g(t) = 2r+ 1, h(t) = 1 — 3f, write 
expressions for the following: 


@f io  e'® 


(b) FAG) 
(d) A(f()) 


(c) h7!(t) 


Given a(x) = 3x — 2, b(x) = = c(x) =1+ : write 
expressions for . * 
@atx) © b'G) @Wc'@) 

Given f(t) = 2t + 3, g(t) = 3t and A(t) = f(g(t)) 
write expressions for 

(a) A(t) 

(o) f'O 

() g'@) 

(d) h-!(t) 

@) s*¢'@) 

What do you notice about (d) and (e)? 


Sketch the following functions: 


tf O<pr<3 
(a) ro={, eed 
—x O<x<l 
(b) w=[? i2g25 
1-t O<t<l 
- a= | j2b<2 
2 O<x<l 
(d) mf l<x<2 
3-x 2<x<3 


14 Sketch 


t 0 
2 


ro={\ 


Is the function piecewise continuous or continuous? 
State, if they exist, the position of any discontinuities. 


15 The function h(f) is defined by 


t 


2-t O0<t<2 
A(t) = 
2-4 2<t<3 


Solutions 


(a) Square the input and then multiply by 2; domain 


(—00, 0), range [0, co) 


(b) Square the input, then subtract 1; domain [0, 00), 


range [—1, co) 

(c) Multiply input by 3 and subtract 4; domain 
[0, 00), range [—4, oo) 

(d) Cube the input; domain (—oo, 00), range 
(—00, 00) 

(e) Multiply input by 0.5 and then add 2; domain 
[—2, 10], range [1, 7] 


(f) Multiply input by 3 and then subtract 2; domain 


[3, 8], range [7, 22] 


(a) 19 (b) —11 (c) 5a +4 
(d) 5x+9 (e) I5Sa+4 (f) 5x°4+4 
(a) —4 (b) 16 (c) 41 
(d) 5x2-—4 (e) 201? —20r+1 

3183 ohms 


(a) many-to-one (b) one-to-one 


(c) one-to-one (d) one-to-one 


(e) one-to-one (f) one-to-one 


(a) f'@)=x-4 
) gg) =f" 
3 


Oy @=x" 

(d) h-'(t) =3t +8 

(ce) f-'() =3r+1 

® A'@=@+p4 
(g) KW) =7-v 


(h) mn) = = 
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and h(t) has period 3. Sketch h(t) on the interval 
[0, 6]. 
16 The function g(t) is defined by 
w 1 0<t<il 
A 
= I~ dered 
and g(t) has period 2. Sketch g(t) on the interval 
[—1, 4]. State any points of discontinuity. 
8 (a) 2(t—1) (b) 272 (c) 2-1 
(d) 2r—1 (e) (t—1)? (f) 412 
(g) 4t (h) t-2 (i) t+ 
(j) 22 — 1) (k) 217-1 (l) (2t—1)2 
il 
9 (a) 974+ 12t+5 (b) 541 
() 242 G 
ce) = Je 
t e+ 
9 12 
t t 
t t-—1 1-t 
10 = b) —— — 
(a) 5 (b) 5 (c) 3 
x+2 2 1 
11 (a) (b) = (c) 
3 x x—1 
12 (a) 6 +3 Cael (c) 
a —— c) = 
3 . 3 : 
t— t— 
d) —— ars 
(d) ra (e) G 
13 See Figure S.1. 
f 
3 &§ 
2 2 oe 
1 1 
(a) 0 123 4 t (b) 0 12 3 x 
a b 
1 2 
1 
(c) 0 I 2 t (d) 0 lL 2 3 x 
Figure S.1 


70 Chapter 2 Engineering functions 


14 Piecewise continuous; discontinuity at t = 2. See 16 Discontinuities at t = 0, 2. See Figure S.4. 
Figure S.2. 


-1 0123 4 tf 


Figure S.2 Figure S.4 


15 See Figure S.3. 


h 
2 

1 

0 3 4 SG t 
Figure S.3 


REVIEW OF SOME COMMON ENGINEERING FUNCTIONS 
AND TECHNIQUES 


This section provides a catalogue of the more common engineering functions. The im- 
portant properties and definitions are included together with some techniques. It is in- 
tended that readers will refer to this section for revision purposes and as the need arises 
throughout the rest of the book. 


2.4.1 Polynomial functions 


A polynomial expression has the form 
A,X" +4, 4X! +a, 9X ++ tax +axt ay 


where n is a non-negative integer, a,, a,_,,...,@,, @ are constants and x is a vari- 
able. A polynomial function, P(x), has the form 


P(x) = a,x" +a, x" | ta, 9X"? +++ Fax +a,x+ aq (2.3) 


Examples of polynomial functions include 


P,(x) = 3x2 —x+2 a) 
P@=Td+2-1 29) 
P,(t) = 3¢ +9 (2.6) 


P,(t) =6 (2.7) 
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where x, z and ¢ are independent variables. It is common practice to contract the term 
polynomial expression to polynomial. By convention, a polynomial is usually written 
with the powers either increasing or decreasing. For example, 


3x +9 —7 +42 
would be written as either 
ae +90 3x42 or 243x490 =r 


The degree of a polynomial or polynomial function is the value of the highest power. 
Equation (2.4) has degree 2, Equation (2.5) has degree 4, Equation (2.6) has degree | and 
Equation (2.7) has degree 0. Equation (2.3) has degree n. Polynomials with low degrees 
have special names (see Table 2.1). 


Table 2.1 

Polynomial Degree Name 

ax* + bx +ex* +dx+e 4 Quartic 
ax? + bx? +ex+d 3 Cubic 
ax? + bx +e 2 Quadratic 
ax+b 1 Linear 

a 0 Constant 


Typical graphs of some polynomial functions are shown in Figure 2.17. 


Degree 2 
P(x) 


Degree 3 


Degree 0 


Figure 2.17 
Some typical polynomials. 


Engineering application 2.4 


Ohm’s law 


Degree | 


Recall from Engineering application 1.1 that the current flowing through a resistor is 
related to the voltage applied across it by Ohm’s law. The equation is 


WW TUR 
where V = voltage across the resistor; 
7 = current through the resistor; 


R = resistance value of the resistor, which is a constant 
for a given temperature. 
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Note that the voltage is a linear polynomial function with J as the independent 
variable. 

This equation is only valid for a finite range of currents. If too much voltage is 
applied to the resistor, then the current flowing through the resistor becomes sufficient 
for the resistor to overheat and breakdown. 


Engineering application 2.5 


A non-ideal voltage source 


An ideal voltage source has zero internal resistance and its output voltage, V, is in- 
dependent of the load applied to it; that is, V remains constant, independent of the 
current it supplies. It is called an ideal voltage source because it is difficult to create 
such a source in practice; it is in effect an abstraction that is useful when develop- 
ing engineering models of real electronic systems. A non-ideal voltage source has 
an internal resistance. Due to this internal resistance, the output voltage from such a 
source is reduced when current is drawn from the source. The voltage reduction in- 
creases as more current is drawn. Figure 2.18 shows a non-ideal voltage source. It is 
modelled as an ideal voltage source in series with an internal resistor with resistance 
R,. The output voltage of the non-ideal voltage source is v, while vp is the voltage 
drop across the internal resistor and i is the load current. Using Kirchhoff’s voltage 


law, 
V=vUg +0, 

and hence by Ohm’s law, 
VY Sak. sr W, 
vo = V — ik, 


Note that V and R, are constants and so the output voltage is a linear polynomial 
function with independent variable i. The equation gives the output voltage across 
the load as a function of the current through the load. The output characteristic for the 
non-ideal voltage source is obtained by varying the load resistor R, and is plotted in 
Figure 2.19. Notice that the output voltage of the non-ideal voltage source decreases 
as the load current increases and is equal in value to the ideal voltage source only 
when there is no load current. 


Vo i 


non-ideal voltage source i 


Figure 2.18 Figure 2.19 
A non-ideal voltage source connected to a load Output characteristic of a non-ideal 
resistor, Ry . voltage source. 
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This is why it is called a non-ideal voltage source. Engineers would prefer to have 
a source that maintained a constant voltage no matter how much current was drawn 
but it is not possible to build such a source. 


Engineering application 2.6 


Wind power turbines 


Wind turbines are an important source of electrical power. The most common type, 
and the ones which are usually found in offshore installations, resemble a desktop fan 
and are called horizontal axis turbines. The wind driving a turbine blade consists 
of many molecules of air, each having a tiny amount of mass. This mass passing the 
blade area each second carries kinetic energy, which is the source of the wind power. 
The wind power, P, can be calculated using the formula 


1 
P= —Mv" 
5) 0) 


where M is the total mass of air per second passing the blade in kg s~! and v is the 
velocity of the air in ms7!. 
The mass per second can be calculated by considering the area swept out by the 


blade, A, the density of the air, 0, and the velocity: 
M = pAv 


This equation can be substituted in the power equation 
1 yl! 3 
Pe 5 (pAv) vv = ia (2.8) 


The available wind power therefore increases with the cube of the velocity. Note that 
the power is a cubic polynomial function of the independent variable, v. 

At 20°C the air density is approximately 1.204 kg m~3. Consider the case of an 
offshore turbine that has a swept area of 6362 m? and a rated wind speed of 15 ms7!. 
The maximum theoretical power at the rated speed is therefore 


1 1 
P= 5 PAu =5* 1.204 x 6362 x 15° = 12.93 MW 


The actual rated power of the device is approximately 3 MW because other physical 
processes and losses have to be accounted for, yet Equation (2.8) remains one of the 
most fundamental in the study of wind power. 


Many excellent computer software packages exist for plotting graphs and these, as well 
as graphics calculators, may be used to solve polynomial equations. The real roots of 
the equation P(x) = 0 are given by the values of the intercepts of the function y = P(x) 
and the x axis, because on the x axis y is zero. 

Figure 2.20 shows a graph of y = P(x). The graph intersects the x axis atx = x,, 
x = x, and x = x,, and so the equation P(x) = 0 has real roots x,, x, and x,, that is 
P(x,) = P(x) = P(x;) = 0. 
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1 


P(x) ih 


EXERCISES 2.4.1 


State the degree of the following polynomial 
expressions: 


(a) 2 +272 —8 +4 13z 
(b) 2? —5P 4+2-8f 


Solutions 


y@ 3 (b)5 ©@4 @d2 (2 


Technical Computing Exercises 2.4.1 


Use a technical computing language such as MATLAB® to 
do the following exercises. 


1 


2 


(a) Plot y =x? and y = 4 — 2x in the interval 
[-3, 3]. Be aware that in MATLAB®, to carry out 
operations on individual elements of a vector, 
special notation is used. For example to multiply 
each element in vector a by the corresponding 
element in vector b (having the same dimension 
as vector a) you would type a. *b. Other 
functions such as raising to a power also require a 
dot prefix if they are to be carried out on each 
individual element, rather than the whole matrix. 
Note the x coordinate of the point of intersection. 


(b) Draw y = x3 + 2x — 4. Note the coordinate of 
the point where the curve cuts the x axis. 
Compare your answer with that from (a). Explain 
your findings. 


Plot the following functions: 


(a) y=3r—x* 42x41 —2<x<2 
x? 5x2 

(b) y=+5 ji +x-1 3<x<2 

(ec) y=ex—-x? 42 -2<x<2 


(f) 3 


Figure 2.20 
A polynomial function which cuts 
the x axis at points x,, x, and x3. 


x3 


(c) 3w—5w? + 12w* 
(d) 7x — x2 

(e) 3(2r2 — 9+ 1) 

(f) 2z(2z+ 1)(2z— 1) 


Hence estimate the real roots of 


0= 3x —x? 42x41 2<x<2 
3 2 
x 5x 
Gag) 4 = age 
3 2 
—3-<¥<2 
O=r-xr42 2<xK<2 


Use the roots function in MATLAB® or equivalent 
to calculate a more accurate value for the real roots 
estimated in question 2 and confirm your answers are 
correct. 


(a) Draw y = 2x? and y = x° + 6 using the same 
axes. Use your graphs to find approximate 
solutions to x3 — 2x2 +6 =0. 

(b) Add the line y = —3x + 5 to your graph. State 
approximate solutions to 
(i)  +3x+1=0 
(ii) 2x? + 3x -—5 =0 
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2.4.2 Rational functions 


A rational function, R(x), has the form 


12 
Reve 
Q(x) 
where P and Q are polynomial functions; P is the numerator and Q is the 
denominator. 
The functions 
x+6 p-1 22 el 
1@) e+1 2) 2t +3 ae) 2437-2 


are all rational. When sketching the graph of a rational function, y = f(x), it is usual to 
draw up a table of x and y values. Indeed this has been common practice when sketching 
any graph although the use of graphics calculators is now replacing this custom. It is still 
useful to answer questions such as: 


“How does the function behave as x becomes large positively?’ 
“How does the function behave as x becomes large negatively?’ 
“What is the value of the function when x = 0?’ 
“At what values of x is the denominator zero?’ 

1+ 2x 1 


Figure 2.21 shows a graph of the function y = = — +2. As x increases, the 
x x 


value of y approaches 2. We write this as 
yo>2 as x—> 00 


and say ‘y tends to 2 as x tends to infinity’. Also from Figure 2.21, we see that 


y > +00 as x—>0 


As x — 00, the graph gets nearer and nearer to the straight line y = 2. We say that y = 2 
is an asymptote of the graph. Similarly, x = 0, that is the y axis, is an asymptote since 
the graph approaches the line x = 0 as x —> 0. 

If the graph of any function gets closer and closer to a straight line then that line is 
called an asymptote. Figure 2.22 illustrates some rational functions with their asymptotes 
indicated by dashed lines. In Figure 2.22(a) the asymptotes are the horizontal line y = 3 


Figure 2.21 
1+2x 1 


The function: y = =-—+2. 
x x 
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(a) 


Example 2.10 


Solution 


(b) 


Figure 2.22 
Some examples of functions with their asymptotes: 
3x+1 1 x— 44x42 1 
=3+—;b)y= (ly = ar 
x x+2 x+1 x+ 1 


(ay= 


and the y axis, that is x = 0. In Figure 2.22(b) the asymptotes are the horizontal line 
y = | and the vertical line x = —2; in Figure 2.22(c) they are y = x + 3 and the vertical 
line x = —1. The asymptote y = x + 3, being neither horizontal nor vertical, is called an 
oblique asymptote. Oblique asymptotes occur only when the degree of the numerator 
exceeds the degree of the denominator by one. 

We see that the vertical asymptotes occur at values of x which make the denominator 
zero. These values are particularly important to engineers and are known as the poles 
of the function. The function shown in Figure 2.22(a) has a pole at x = 0; the function 
shown in Figure 2.22(b) has a pole at x = —2; and the function shown in Figure 2.22(c) 
has a pole at x = —1. 


If the graph of a function approaches a straight line, the line is known as an asymp- 
tote. Asymptotes may be horizontal, vertical or oblique. 


Values of the independent variable where the denominator is zero are called poles 
of the function. 


x 


Sketch the rational function y = —————_., 
wr+tx—-2 


For large values of x, the x* term in the denominator has a much greater value than the 
x in the numerator. Hence, 

y> 0 as x—> oO 

y>0 as x —> —00 


Therefore the x axis, that is y = 0, is an asymptote. Writing y as 
i 


o—@=DeF2) 
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we see the function has poles at x = | and x = —2; that is, there are vertical asymptotes 
at x = 1 and x = —2. Substitution into the function of a number of values of x allows a 
table to be drawn up: 


x -=3 2.5 2.1 1.9 1.5 1 0 0.5 0:9° 11 15 2 3 
y —0.75 —1.43 —-6.77 655 1.20 0.50 0 —0.40 —3.10 3.55 0.86 0.50 0.30 


The graph of the function can then be sketched as shown in Figure 2.23. 


Figure 2.23 
The function: y = 


wrtx—-2 


Engineering application 2.7 


Equivalent resistance 
Recall from Engineering application 1.2 that the formula for the equivalent resistance 
of two resistors in parallel is given by: 
1 1 1 
Re OR, 4 R, 
Consider a circuit consisting of two resistors in parallel as shown in Figure 2.24. 


One has a known resistance of 1 Q and the other has a variable resistance, R Q. The 
equivalent resistance, R,, Q, satisfies 


1 ee ks 


fom le oe 
Hence, 

Pimp tee 

ea 


Thus the equivalent resistance is a rational function of R, with domain R > 0. 
The graph of this function is shown in Figure 2.25. When R = O we note 
that R, = 0, corresponding to a short circuit. As the value of R increases, that 
is R + ov, the equivalent resistance R,, approaches | so that R,, = | is an asymptote. 
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R 10 Rr 
Figure 2.24 Figure 2.25 
Two resistors in parallel. The equivalent resistance, Rp, 
increases as R increases. 
EXERCISES 2.4.2 
1 State the poles of the followi tional functions: = _ 
ate the poles of the following rational functions (d) Gt) = a (e) H(s) = = 
x+3 2x+1 
(a) ya) = —> (b) yx) = — 

x—-2 x+7 (f) yx) = 5 
202" ean ie 
ares: a ee f+2 

(2) w(f) == 
(e) H(s) : es 
e s) = —~———_ 
s?+6s+5 e+t+1 
h) P(t) = =~ 
(f) je @) PO) 12 +6¢t+9 
s2 +35 —18 . 
@) T@)= 
ST a 2x-1 
6+r 
2t — 6 @) 00) ==" 
h t) = —=————_ Dn pics 
(h) p® 12 +10t + 25 f r 12, 
2 Describe the horizontal asymptote of each of the & Describe the oblique asymptote of each of the 
following functions: following functions: 
1 2 1 
(a) yx) =6+— — (b) A(t) = 5-1 (a) yo) =x +34 — 
2 6+ 
c =3-— d) v(t) = — 3 
©) y@) 5 ee, (b) y(x) = 2x14 
7 2u—5 ae 
(e) rv) = a aes . a 1 
c x)==- 

?4+2t+1 204 0 2x+7 

() a) =—S— : 
d =3x-—1+ 

10 — 2s — 352 i) Wal Se +2 

(g) m(s) = —S5 
x+2 


3. Describe the vertical asymptotes of each of the ©) yl ae- ts x 


following functions: 


_ 35 
3x+1 (b) sian 3 (f) y(x) = 3x oe 
x—2 4a+4 

9 


3 
2 ae ara es (@) y@) = 4-204 


(a) ya) = 


5 Show that 


2% 


I(x) = 
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23 


3 


9 9(3x + 2) 


can be expressed in the equivalent form 


2x —x+1 
3x+2 


Sketch the rational function /(x) and state any 


asymptotes. 


6 Show that 


1 


1 


1 
P(x) =2x+—+ 


Solutions 
(a) 2 
(d) —2,2 
(g) —1,0,1 
(a) y=6 
(c) y=3 
ae 
(e) r= 3 
3 
(g) m= 3 
(a) x=2 
(c) s=—2,1 
(e) s=—l,1 
(g) f= —3,2 
(i) x=0.5 


9 
DD ea 
(b) —7 (c) -; 
(e) —5,-1 (f) —6,3 
(h) —5 
(b) h=-1 
(d) v=1 
(f) a=1 
(b) t=—-1 
idgt=<=<1)1 
@ <=<11 
(h) t= —3 
(j) r=—3,4 


Technical Computing Exercises 2.4.2 


Use a technical computing language to plot the 


following rational functions. State any asymptotes. 


(2x +1) 


(a) fx) = G3) —-4<x<4 
Ss 

(b) Oo) = Fan —3<s<3 
Zz 

(c) ea —3<z<3 


can be written in the equivalent form 
Ax? + Tx +6 
2x +3 
Sketch the rational function p(x) and state any 
asymptotes. 


Show that the function 


Tx 
yo) =x+ a 
can be expressed in the equivalent form 
+247 
243° 
Sketch the rational function y(x) and state any 
asymptotes. 


(a) y=x4+3 (b) y=2x-1 
x 3 
co a aa (d) y=3x-1 
(e) y=2x-1 (f) y= 3x-2 
(g) y=4—2x 
2 2x 7 
x=--, =—- 
3 3. 9 
Sao pH? ae 
LS 2 pH 2% 3 
y=x 
3g e525 
iia. = 42ers 
(x— 1) — 2) 


Plot the functions given in Question 4 in 
Exercises 2.4.2 for —10 < x < 10. 
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2.4.3 Exponential functions 


An exponent is another name for a power or index. Expressions involving exponents are 
called exponential expressions, for example 3*, a”, and m”. In the exponential expres- 
sion a*, ais called the base; x is the exponent. Exponential expressions can be simplified 
and manipulated using the laws of indices. These laws are summarized here. 


Ke il 
aa" = ant" a = a a = 1 a” ee. (Gey = an 
ae ge 
Example 2.11 Simplify 
3x 2x y)\2 —6z 
(a) — ) @d-a+a®e © @ @ i 
a* 2a” a7 
(2a?")7a?" ava’ 3a) at 
(e) 3g (f) a (g) oe 
3x: 2% 5x 
Solution (a) * < = a 
a x a x 


(b) a(1 = a) +a* = a _ a +a" = at 


( ay a @ 


O oo 7 ie = 

(2a7)2a" Aaa 4a a!” 
©) 3a = Gq = 3g = 3 
(f) oe = gta 2p 


Exponential functions 


An exponential function, f(x), has the form 
f@)=a 


where a is a positive constant called the base. 


Some typical exponential functions are tabulated in Table 2.2 and are shown in Fig- 
ure 2.26. Note from the graphs that these are one-to-one functions. 

An exponential function is not a polynomial function. The powers of a polynomial 
function are constants; the power of an exponential function, that is the exponent, is the 
variable x. 
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Table 2.2 
Values of a* for a = 0.5, 2 and 3. 


x 0.5* 2* oP 
—3 8 0.125 0.037 
—2 4 0.25 0.111 
-1 2 0.5 0.333 
0 1 1 1 
1 0.5 2 3 
2 0.25 4 9 


Some typical exponential functions. 


The most widely used exponential function, commonly called the exponential func- 
tion, is 


fa) =e 


where e is an irrational constant (e = 2.718281 828...) commonly called the 
exponential constant. 


Most scientific calculators have values of e* available. The function is tabulated 
in Table 2.3. The graph is shown in Figure 2.27. This particular exponential function 
so dominates engineering applications that whenever an engineer refers to the expo- 
nential function it almost invariably means this one. We will see later why it is so 
important. 

As x increases positively, e* increases very rapidly; that is, as x — 00, e* — oo. This 
situation is known as exponential growth. As x increases negatively, e* approaches zero; 
that is, as x — —oo, e* > 0. Thus y = 0 is an asymptote. Note that the exponential 
function is never negative. 

Figure 2.28 shows a graph of e~. As x increases positively, e~* decreases to zero; that 
is, asx — 00, e* — 0. This is known as exponential decay. The function is tabulated 
in Table 2.4. 


Table 2.3 

The values of the exponential 
function f(x) = e* for 
various values of x. 


x e* S 
3 0.050 
5 0.135 : 
-1 0.368 
0 1 
1 2.718 Bi 
2 7.389 : 
3 20.086 Figure 2.27 


Graph of y = e* showing exponential growth. 
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Table 2.4 
The values of the exponential 
function f(x) = e~* for 


various values of x. -_ 

x e* 

3 20.086 

—2 7.389 I 

—1 2.718 
0 1 a 
1 0.368 x 
2 0.135 : 
3 0.050 Figure 2.28 


Graph of y = e~* showing exponential decay. 


Engineering application 2.8 


Discharge of a capacitor 


The capacitor is another of the three fundamental electronic components. It is a 
device that is used to store electrical charge. It consists of two parallel conducting 
plates separated by an insulating material, known as a dielectric. A build up of a net 
positive charge on one plate and a net negative charge on the other plate creates an 
electric field across the dielectric, allowing electrical energy to be stored that has the 
potential to do useful work. The symbol for a capacitor is two parallel lines that are 
perpendicular to the conductors in the circuit. 

Consider the circuit of Figure 2.29. Before the switch is closed, the capacitor has 
a voltage V across it. Suppose the switch is closed at time t = 0. A current then flows 
in the circuit and the voltage, v, across the capacitor decays with time. The voltage 
across the capacitor is given by 


V t<0O 
bee Ve (RC) t > 0 


The quantity RC is known as the time constant of the circuit and is usually denoted 


by t. So 
4 faa) 
— 
Ve" t>0 
v 
T larger 
V 
G R 
t 
Figure 2.30 
Figure 2.29 The capacitor takes longer to discharge for a 
Circuit to discharge a capacitor. larger circuit time constant, T. 


If t is small, then the capacitor voltage decays more quickly than if t is large. 
This is illustrated in Figure 2.30. 
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Engineering application 2.9 


The diode equation 


A diode is an electronic device that allows current to flow with ease in one direction 
and with difficulty in the other. In plumbing terms it is the equivalent of a non-return 
valve. It is constructed by sandwiching two different types of semiconductor material 
together, known as p-type and n-type semiconductors. The resultant construction is 
referred to as a p-n junction. These different types of semiconductors are created 
by a procedure known as doping. The technical details are complex but it is one 
of the most fundamental processes underpinning the electronics revolution that has 
transformed society in recent decades. 
A semiconductor diode can be modelled by the equation 


1=1,(e# -1) 


where V = applied voltage (V); 
I =diode current (A); 
I, = reverse saturation current (A); 
eke 10m Ke 
g=1.60 x 10°” C; 
T = temperature (K); 


n= ideality factor. 


This equation relates the current through the diode to the voltage across it. The ide- 
ality factor, n, typically ranges between | and 2 depending on how the diode is man- 
ufactured and the type of semiconductor material used. We will consider the case 
n = 1, which corresponds to that of an ideal diode. At room temperature g/(kT) ~ 40 
and so the equation can be written as 


T=1(e" -1) 


Figure 2.31 shows a graph of / against V. Notice that for negative values of V, the 
equation may be approximated by 


since e*°” ~ Q. The diode is said to be reverse saturated in this case. In reality, 


I, is usually quite small for a practical device, although its size has been exagger- 
ated in Figure 2.31. This model does not cater for the breakdown of the diode. 
According to the model it would be possible to apply a very large reverse voltage 
to a diode and yet only a small saturation current would flow. This illustrates an 
important point that no mathematical model covers every facet of the physical device 
or system it is modelling. A different model would be needed to deal with breakdown 
characteristics. 
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‘1 Simplify (a) R=19,C=1pF 
or 3 (b) R=10Q,C=1 uF 
e*(et + e*) ; e-3en! (d) R=56 Q, C=0.1 uF 
(©) e2t k¢) e6e—2 Calculate the time constant, tT, in each case. 
©) (e2)3(e")4 Bl Sketch the following functions, using the same axes: 
10r 
e yae* yael? yoo 
(2 Consider the RC circuit of Figure 2.29. Given an _ fie: ewes 
initial capacitor voltage of 10 V plot the variation in 
capacitor voltage with time, using the same axes, for 4 Sketch a graph of the function y = 1 — e~ for 
the following pairs of component values: x>0. 
a —6 —5 —6 
Heat wr ote 2 @ 10 (b) 10 (c) 3.3 x 10 
(d) 5.6 x 10-6 


(d) el4 (e) et 


ts Plot y = AX for k = —3, —2, -1, 0, 1, 2, 3, for a Plot y=5—x? andy = * for —3 < x < 3. For which 
—3<x<3. values of x is eX < 5 —x?? 
(2 Plot y = ke* for k = —3, -2, -1, 0, 1, 2, 3, for “G Plot y = x4 and y = e* for —1 <x < 9. For which 


—3<x<3. values of x is (a) e* < x*, (b) e* > x4? 


2.4.4 


Example 2.12 


Solution 


Example 2.13 
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Logarithm functions 


Logarithms 
The equation 16 = 2* may be expressed in an alternative form using logarithms. In 
logarithmic form we write 
log, 16 =4 
and say ‘log to the base 2 of 16 equals 4’. Hence logarithms are nothing other than 
powers. The logarithmic form is illustrated by more examples: 
i5=5° so lop. 1253 
64 = 8° so log, 64 = 2 
16=4° so log, 16=2 
1000 = 10° so log,, 1000 = 3 


In general, 
ifc =a’, thenb=log,c 
In practice, most logarithms use base 10 or base e. Logarithms using base e are called 
natural logarithms. Log,,x and log, x are usually abbreviated to log x and In x, respec- 
tively. Most scientific calculators have both logs to base 10 and logs to base e as pre- 
programmed functions, usually denoted as log and In, respectively. Some calculations 
in communications engineering use base 2. Your calculator will probably not calculate 
base 2 logarithms directly. We shall see how to overcome this shortly. 
Focusing on base 10 we see that 


if y= 10° then x= logy 
Equivalently, 

if x = logy then y= 10° 
Using base e we see that 

ify=e then x =Iny 
Equivalently, 

ifx =Iny then y=" 


Solve the equations 


(a) 16 = 10° (b) 30=e (c) logx = 1.5 (d) Inx = 0.75 
(a) 16 = 10° (b) 30=e 
log 16 =x In30=x 
x = 1.204 x = 3.401 
(c) logx = 1.5 (d) Inx = 0.75 
x= 10!5 x= 0-75 
= 31.623 = 2.117 


Solve the equations 
(a) 50 = 9(10*) 
(c) log’? — 1) =2 


(b) 3e-2+) — 10 
(d) 3In(4x +7) = 12 
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Solution 


Example 2.14 


Solution 


(a) 50=9(10*) (b) 3e°?+D = 10 
10 = ou et) — 10 
9 | 10 
a OE al eter = ls 
1 50 10 
x = ~log — = 0.372 2x = —In——-I1 
2 9 3 
2x = —2.204 
x = —1.102 
(c) log@’? —1) =2 (d) 3In(44x+ 7) = 12 
x —1=10? = 100 4x+7=e' 
= 101 4x = et —7 
4 
= : -—7 
x = +10.050 os e ~ 11.900 


Logarithmic expressions can be manipulated using the laws of logarithms. These laws 
are identical for any base, but it is essential when applying the laws that bases are not 
mixed. 


log, A + log, B = log, (AB) 
A 
log, A — log, B = log, (5) 


nlog,A = log, (A”) 
los,a= Il 


We sometimes need to change from one base to another. This can be achieved using the 
following rule. 


In particular, 
log) X _ log, X 


log, X = = 
OF  Jog.y2 0.3010 


Simplify 
(a) logx + log x? 


(b) 3logx + log x? 


(c) 5Inx+ in(~) 
x 


(d) log(xy) + logx — 2logy 


4\ 1 
(e) In(2x°) — in( ) + 5 In27 


2 
(a) Using the laws of logarithms we find 
logx + logx* = log x* 
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(b) 3logx + logx? = logx* + log x” = logx° 


1 P 1 x” 4 
(c) 5Inx+ Inf — ) = Inx + In{ — } = In{ — }) =Inx 
x x x 


(d) logxy + logx — 2logy =log xy + log x — logy” 


-| XX = x 
SN a, poe ay 
¥ y 


4\  1n27 2x3 
(e) In(2x3) in( :) oor =1n( =) +1n27'3 
Xx Xx 


3 


Example 2.15 Find log, 14. 


Solution Using the formula for change of base we have 


log) 14 1.146 


= ——— = 3.807 
log;92 (0.301 


log, 14 = 


Engineering application 2.10 


Signal ratios and decibels 


The ratio between two signal levels is often of interest to engineers. For example, the 
output and input signals of an electronic system can be compared to see if the system 
has increased the level of a signal. A common case is an amplifier, where the output 
signal is usually much larger than the input signal. This signal ratio, known as the 
power gain, is often expressed in decibels (dB) given by 


power gain (dB) = Lotoe( 7°) 
i 
where P, is the power of the output signal and P, is the power of the input signal. The 
term gain is used because if P, > P,, then the logarithm function is positive, corre- 
sponding to an increase in power. If P, < P, then the gain is negative, corresponding 
to a decrease in power. In this situation the signal is said to be attenuated. 

The advantage of using decibels as a measure of gain is that if several electronic 
systems are connected together then it is possible to obtain the overall system gain 
in decibels by adding together the individual system gains. We will show this for 
three systems connected together, but the development is easily generalized to more 
systems. Let the power input to the first system be P.,, and the power output from 
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the third system be P,,. Suppose the three are connected so a the power output 
from system 1, P.,, is used as input to system 2, that is P, = P.,. The power output 
from system 2, P,,, is then used as input to system 3, that is P, = P.,,. We wish to 
find the overall power gain, 10 log(P,,/P,,). Now 


eer 
Pr Ps Po Pa 
because P,, = P,, and P., = P.,. Therefore, 


Ps Pos Poo Pei 
LOikog|| = |] = IOios, === — 
Py Px Po Pi 


that is 


P.3 IPs 
10 log va Solos = sP 101og( 22 + 10log{ — 
il Ps Po le 


using the laws of logarithms. 

It follows that the overall power gain is equal to the sum of the individual power 
gains. Often engineers are more interested in voltage gain rather than power gain. 
The power of a signal is proportional to the square of its voltage. We define voltage 
gain (dB) by 


; MG Ve 
voltage gain (dB) = 10 log y)a 20 log V 


i 


Engineering application 2.11 


The use of dBm in radio frequency engineering 


In the previous engineering application it was shown how the decibel can be used to 
express a ratio of the power of two signal levels P, and P.. It is possible to specify 
a fixed value for the input signal P,. This is termed a reference level. When this is 
done the decibel becomes an absolute quantity. The notation is normally changed 
slightly to indicate the assumed reference level. For example, dBm is used as an 
absolute measure of power in the field of radio frequency (RF) engineering. A mobile 
telephone handset, a microwave oven and a radar transmitter on an airfield are all 
devices that might have their output power quoted in dBm. The definition of power 
gain measured in dBm is as follows: 


IP. 
power gain (dBm) = 10 log ( Te ) 


Here the reference level chosen is 1 mW or 107? W. 
If a device is quoted as having an output power of 15 dBm we can convert this 
into a power value in watts as follows: 


lige 
15 (dBm) = 10 log (2 :) 


Dividing both sides by 10 


Ir 
1.5 = log ( es) 


2.4 Review of some common engineering functions and techniques 89 


And so 


10!> = FE 
10-3 


Therefore the actual power is 


P10? 105 = 105? = 0.0316 = 32 mw. 


This is the typical amount of RF power that might be transmitted by a laptop computer 
with WiFi capability. 


Engineering application 2.12 


Attenuation in a step-index optical fibre 


Fibre optical cables are used to guide high-bandwidth light signals generated by 
lasers. One design is the step-index optical fibre. This consists of a glass core of 
silicon dioxide with a high refractive index, obtained by doping the glass with the el- 
ement germanium. A surrounding sheath of lower refractive index glass ensures that 
nearly all of the light remains within the core. Transmission occurs by total internal 
reflection at the interface between the two types of glass. 

The losses that do occur in the optical fibre can be described by the following 
equation: 


Kg) =e 


where the intensity of the light /(z) is a function of the distance along the fibre, z, 
from the light source. The intensity of the light source at the insertion point is J) and 
a is an attenuation factor. 

Note that @ varies with the wavelength of the light and so the rate of attenuation 
depends on the colour of the light. It is possible to obtain plots for the variation of the 
attenuation factor with wavelength from the manufacturers. The attenuation factor is 
often expressed in units of dB km7!. A typical value for a cable operating at a light 
wavelength of 1550 nm is 0.3 dB km~!. 

An alternative measure for gain/attenuation is that of the neper (Np). Like the 
decibel this is also a dimensionless ratio that has a logarithmic form. Whereas the 
decibel is defined in terms of base 10 logarithms, the neper is defined in terms of 
natural logarithms, that is base e logarithms. The gain measured in Np is defined by 
the following expression: 


voltage gain (Np) = In Z =InV, — InV, 
where V, is the signal value in volts after gain/attenuation and V, is the reference 
signal value in volts. 
It is possible to derive a conversion factor between Np and dB by considering 
the case when V, is a factor of 10 greater than V,. Using the neper measurement this 
corresponds to 


: 10V, 
voltage gain (Np) = In ys In 10 


i 
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Using the dB measurement for voltage gain (see Engineering application 2.10) 


oO 


this corresponds to voltage gain (dB) = 20log V 


— which equals 20log 10. So, 
In 10 (Np) = 20 log 10 (dB). Thus, 


20 log 10 
1 Np = —~2— dB ~ 8.685 89 dB 
In 10 
Using the value quoted earlier of an attenuation factor of 0.3 dB km™|, this corre- 
sponds to 


0.3 + 8.685 89 Np km7! = 0.034539 Np km! 


1 


It is more usual to quote values of Np m~ and so this value becomes 3.4539 x 


10-> Np m“!. 


Engineering application 2.13 


Reference levels 


We saw in Engineering application 2.11 that the suffix ‘m’ is used in dBm to indi- 
cate the provision of a specific reference level. Alternative suffixes are used to denote 
other reference levels and quantities, which do not necessarily have to be related to 
electrical power. For example, when measuring sound pressure, P, in air the conven- 
tional reference level for sound pressure is 20 Pa r.m.s. This is chosen to correspond 
to the approximate threshold of human hearing for a | kHz sinusoidal signal. The unit 
for sound pressure is therefore quoted with reference to an input pressure of 20 Pa. 
This is commonly written as dB re 20 uPar.m.s. or using the shorthand dB SPL (dB, 
sound pressure level). In other words we have 


[P 

sound pressure level (dB SPL) = 20 log (x Sane ;) 

As aconsequence of the choice of the human hearing threshold as the reference level, 
a negative value of dB SPL corresponds to a sound that is too quiet to be heard by the 
average person; 0 dB SPL is a sound that can just be heard and anything above this is 
fully audible. An office might have an ambient (background) level of 30 dB SPL and 
a person talking to you at the next desk might produce 60 dB SPL, both quantities 
being measured at your hearing position. 


Logarithm functions 


The logarithm functions are defined by 
f(x) = log, x x>0 
where a is a positive constant called the base. 
In particular the logarithm functions f(x) = logx and f(x) = Inx are shown in Fig- 


ure 2.32 and some values are given in Table 2.5. The domain of both of these functions is 
(0, oo) and their ranges are (—0o, 00). We observe from the graphs that these functions 
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Table 2.5 
Some values for logarithm 
functions log x and In x. 


x logx Inx 
0.1 -1 —2.303 
0.5 —0.301 —0.693 
1 0 0 
2 0.301 0.693 
2) 0.699 1.609 
10 1 2.303 
50 1.699 3.912 Figure 2.32 
Graphs of Inx and log x. 


are one-to-one. It is important to stress that the logarithm functions, log, x, are only 
defined for positive values of x. The following properties should be noted: 


logx — oo logx + —oo 

as xX > 0O asx > 0 
Inx > oo Inx — —oo 
log! =Inl =0 log lO=1 Ine=1 


Connection between exponential and logarithm functions 


The exponential function, f(x) = a*, is a one-to-one function and so an inverse function, 
f-'(), exists. Recall 


fF) =x 
So 
fl @)=x 
Now 
log (a) =xlog,a using laws of logarithms 
=% since log,a = | 


Hence the inverse of f(x) = a’ is f~'(x) = log, x. By similar analysis the inverse of 
f(x) = log, x is fl@=a. 


The inverse of the exponential function, f(x) = a‘, is the logarithm function, that 
ise (Noe ue 

The inverse of the logarithm function, f(x) = log, x, is the exponential function, 
thatis f'@) =a. 


In particular: 


If f(x) =e’, then f—!(x) = Inx. 
If f(x) = Inx, then f-!(x) = e*. 
If f(x) = 10", then f-!(x) = logx. 
If f(x) = logx, then f-'(x) = 10°. 
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Example 2.16 


Solution 


log y 

x y 

- 6 
1 1 
2 64 - 
3 729 4 
4 4096 3 
5 15 625 5 
6 46 656 
7 117649 1 
8 262 144 0 
9 531441 12 3 4 5678 9 10 x 
0 oe Figure 2.33 


The function y = x® plotted on a log-linear graph. 


Use of log-log and log-linear scales 
Suppose we wish to plot 


y(x) = x° 1<4< 10 


This may appear a straightforward exercise but consider the variation in the x and y 
values. As x varies from 1 to 10, then y varies from 1 to 1 000000, as tabulated above. 
Several of these points would not be discernible on a graph and so information would be 
lost. This can be overcome by using a log scale which accommodates the large variation 
in y. Thus log y is plotted against x, rather than y against x. Note that in this example 


logy = logx® = 6 log x 


so as x varies from | to 10, log y varies from 0 to 6. A plot in which one scale is loga- 
rithmic and the other is linear is known as a log-linear graph. Figure 2.33 shows log y 
plotted against x. In effect, use of the log scale has compressed a large variation into one 
which is much smaller and easier to observe. 


Consider y = 7* for —3 < x < 3. Plot a log—linear graph of this function. 


We have 
f= 2 
and so 
log y = log(7*) = xlog7 = 0.8451x 


Putting Y = log y we have Y = 0.8451x which is the equation of a straight line passing 
through the origin with gradient log 7. Hence when log y is plotted against x a straight 
line graph is produced. This is shown in Figure 2.34. Note that by taking logs, the range 
on the vertical axis has been greatly reduced. 


A plot in which both scales are logarithmic is known as a log-log plot. Here log y is 
plotted against log x. 
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Ys 


x y Y = logy 
—3 0.003 —2.54 
—2 0.020 —1.69 
—-1 0.143 —0.85 
0 1 0 
1 7 0.85 A 
2 49 1.69 Figure 2.34 
3 343 2.54 A log-linear plot of y = 7* produces a 


straight line graph. 


Example 2.17 Consider y = x’ for 1 < x < 10. Plot a log-log graph of this function. 


Solution We have 
y=x’ 
and so 
logy = log(x’) = 7logx 


We plot log y against log x in Figure 2.35 for a log—log plot. Putting Y = logy and 
X = logx we have Y = 7X which is a straight line through the origin with gradient 7. 


x y X = logx Y = logy 

1 1 0 0 

2 128 0.301 2.107 

3 2187 0.477 3.340 

4 16 384 0.602 4.214 

5 78 125 0.699 4.893 

6 279 936 0.778 5.447 

7 823543 0.845 5.916 

8 2097 152 0.903 6.322 

9 4.782.969 0.954 6.680 
10 10000000 1 a Fagure 2:89 


A log-log plot of y = x’ 
produces a straight line graph. 


Examples 2.16 and 2.17 illustrate the following general points: 


A log-linear plot of y = a* produces a straight line with a gradient of log a. 
A log-log plot of y = x” produces a straight line with a gradient of n. 
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96» +————_—_—_————\— _ Second cycle 


First cycle 


Figure 2.36 
Two-cycle log-linear graph paper. 


Use of log-linear and log-log graph paper 

The requirement to take logarithms is a tedious process which can be avoided by using 
special graph papers called log-linear graph paper and log-log graph paper. An example 
of log-linear graph paper is shown in Figure 2.36. 

Note that on one axis the scale is uniform; this is the linear scale. On the other, the 
scale is not uniform and is marked in cycles from | to 9. This is the logarithmic scale. On 
this scale values of y are plotted directly, without first taking logarithms. On the graph 
paper shown in Figure 2.36 there are two cycles but papers are also available with three 


Example 2.18 


Solution 
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or more cycles. To decide which sort of graph paper is appropriate it is necessary to 
examine the variation in size of the variable to be plotted measured in powers of 10. If, 
for example, y varies from | to 10, then paper with one cycle is appropriate. If y varies 
from 1 to 107, two-cycle paper is necessary. If y varies from 107! to 10*, then paper 
with 4 — (—1) = 5 cycles would be appropriate. To see how log-linear paper is used in 
practice, consider the following example. 


During an experiment the following pairs of data values were recorded: 


A B Cc D 
x 0 1 5 12 
y 4.00 5.20 14.85 93.19 


It is believed that y and x are related by the equation y = ab*. By plotting a log-linear 
graph verify the relationship is of this form and determine a and b. 


If the relationship is given by y = ab‘, then taking logarithms yields 
logy = loga+xlogb 


So, plotting log y against x should produce a straight line graph with gradient log b and 
vertical intercept log a. The need to find log y is eliminated by plotting the y values di- 
rectly on a logarithmic scale. Examining the table of data we see that y varies from 
approximately 10° to 10? so that two-cycle paper is appropriate. Values of y between 1 
and 10 are plotted on the first cycle, and those between 10 and 100 are plotted on the 
second. The points are plotted in Figure 2.37. Note in particular that in this example the 
‘1’ at the start of the second cycle represents the value 10, the ‘2’ represents the value 20 
and so on. From the graph, the straight line relationship between log y and x is evident. 
It is therefore reasonable to assume that the relationship between y and x is of the form 
y=ab. 

To find the gradient of the graph we can choose any two points on the line, for example 
C and B. The gradient is then 


log 14.85 —log5.20 _ log(14.85/5.20) 
5-1 7 4 


= 0.1139 


Recall that log b is the gradient of the line and so 


log b = 0.1139, that is b = 10°''? = 1.2999 


The vertical intercept is log a. From the graph the vertical intercept is log 4 so that 


loga = log4, that isa = 4 


We conclude that the relationship between y and x is given by y = 4(1.3)’. 
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Figure 2.37 
The log-linear graph is a straight line. 


Engineering application 2.14 


Bode plot of a linear circuit 


It is possible to find out a great deal about an electronic circuit by measuring how 
it responds to a sinusoidal input signal. We will examine the sinusoidal functions in 


1 
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against frequency. 


linear— log graph. 


100000 
AG) i 


1+j= 
TS 


complex numbers. 


f=1:1:10000; 


Section 3.4. The usual procedure is to apply a range of fixed-amplitude sinusoidal 
signals with different frequencies in order to obtain information about the circuit or 
system. In Section 23.9 we will see that if the circuit is linear then, after it has settled 
down, the output signal is also a sinusoidal signal of the same frequency but with a 
different amplitude and phase (see Section 3.7 for details of these terms). 

A Bode plot consists of two components: 


(1) The ratio of the amplitudes of the output signal and the input signal is plotted 


(2) The phase shift between the input and output signals is plotted against frequency. 


A log scale is used for the frequency in order to compress its length; for example, 
a typical frequency range is 0.1 to 10° Hz which corresponds to a range of —1 to 6 
on a log scale. A log scale is also used for the ratio of the signal powers as this is 
calculated in decibels. Phase shift is plotted on a linear scale. So the signal amplitude 
ratio versus frequency is a log—log graph and the phase shift versus frequency is a 


An operational amplifier is an example of a linear circuit. In a technical computing 
language such as MATLAB’ it is usually easy to produce a Bode plot. 
An example function which describes the behaviour of such a device is: 


which gives the input-output voltage function. Do not worry about the meaning of 
the ‘j’ in the equation for now. This will be covered in Chapter 9 when we discuss 


We could produce a Bode plot by typing: 


semilogx(f, 20*log10(abs(100000./(1+j*£/8)))); 


The function semilogx plots a graph with a logarithmic scale on the x-axis. Chapter 
21 examines Bode plots in more detail. 


EXERCISES 2.4.4 
Evaluate 
(a) log, 8 (b) log, 15 
(c) log), 50 (d) logy, 123 
(e) logs 23 (f) logs 47 
Simplify each of the following to a single log term: 


(a) log7+ logx 


(b) logx + logy + log z 
(c) Iny —In3 

(d) 2logy+logx 

(e) In(y) + InG”) 


Ym 


Q) 


In@x+y) —Iny 
log(2x?) + log(4x) 

3 logy + 2logx 

5 log x* —2logx 
3Inx+2Iny + 4Inz 
logz— 2logx +3 logy 
logr? — log(2r) + 2logt 


Simplify each of the following to a single log term: 


(a) 3In¢ — Int 


(b) Glog? + 4logt 
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(c) nGy®) —2In3 +Iny 
(d) In(6x + 4) — In(3x + 2) 


log (9x) 2 
(e) 7 le (=) 


Sketch graphs of the following functions, using the 


same axes: 
y = In(2x), y=Inx 0<x<10 


Measure the vertical distance between the graphs for 
x = 1,x =2 and x = 8. Can you explain your 
findings using the laws of logarithms? 


Solve the following equations: 


(a) eX = 70 (b) eX = : 

(c) e*=1 (d) 3e* = 50 
(e) e* =50 Gc = 300 
(g) e*+! = 0.75 (h) 2ee2* = 50 
. 3 

(i) S.i- 0.6 @O sa = 06 
(k) (e*)? = 200 () Ve2* =2 


Xx 


(m) /e2* +4 =6 iw — 


ey +2 
(p) 2e* =9 
(r) Be* — 6)? = 8 


(t) 2e2* —7e* +3 =0 


12 
(v) & —-74+—=0 
e* 


0.7 


(0) e?* = 7e* 
(q) (e° +3)? = 25 
(s) e2* —3e° +2 =0 


(u) e*(5—e*) =6 


Solve the following equations: 
(a) 10* = 30 

(c) 4(10*) = 20 
(e) 10°? = 20 
(g) 1073* = 0.02 


(b) 10* = 0.25 
(d) 107* = 90 
(f) 3(10°*3) = 36 
(h) 711072") = 1.4 


(i) 10°? = 20 Gj) 103*+! = 75 
(k : =6 1) (107*)? = 40 
) To = (1) ¢ : = 
(m) 10% = 3 (n) — = 

2+10-* 2 


(0) /102%+6=5 (p) 10% = 30(103*) 
(q) (10-* +2)? =6 (r) 6(1073*) = 10 
(s) 102° —7(10") +10 =0 

(t) 10** — 8(107*) + 16 =0 

(u) 10°—5+6(10-*) =0 

(v) 4(107") — 8(10*) +3 =0 


10 


11 


12 


13 


14 


15 


16 


Solve 

(a) logx = 1.6 

(c) log(2+x) = 1.6 
(e) log(2x — 3) = 0.7 


(b) log 2x = 1.6 
(d) 2log(x”) = 2.4 


Solve 
(a) Inx = 2.4 


(c) 2InQx-—1) =5 
(e) In FF) = 0.9 


(b) In3x=4 
(d) In(2x?) = 4.5 


Solve 
(a) &* =21 (b) 107% = 6.7 

= 0.3 d) 2e¢/2) -1 =0 
(c) ae (d) 2e 


(e) 3(100-#49) = 17 
(f) (et 153 ot ee = 500 


(g) V/102* + 100 = 3(10*) 


Calculate the voltage gain in decibels of the following 
amplifiers: 


(a) input signal = 0.1 V, output signal = 1 V; 
(b) input signal = 1 mV, output signal = 10 V; 
(c) input signal = 5 mV, output signal = 8 V; 
(d) input signal = 60 mV, output signal = 2 V. 


An audio amplifier consists of two stages: a 
preamplifier and a main amplifier. Given the 
following data, calculate the voltage gain in decibels 
of the individual stages and the overall gain in 
decibels of the audio amplifier: 


preamplifier: input signal = 10 mV, 


output signal = 200 mV 


main amplifier: input signal = 400 mV, 


output signal = 3 V 


A Bluetooth radio system operating in Class 3 has a 
maximum output power of 0 dBm at 2.45 GHz. Find 
the maximum output power in watts (W). 


A microwave oven has an output power of 1 kW. 
Express this figure in dBm. 


Express 0 dB SPL as a sound pressure in 
micropascals (Pa). 


A car audio speaker has an output sound pressure 
level of 55 dB SPL when measured at a distance of 
1 m. Calculate the sound pressure at that point in 
pascal r.m.s. 


An active sonar system fitted to a boat produces a 
source level of 220 dB re 1 Pa at 1 m. Calculate the 
sound pressure in kPa. 
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17 By using log-linear paper find the relationship 
between x and y given the following table of values: 


x 15 17 32 39 43 49 
y 85 9.7 27.6 448 59.1 89.6 


Solutions 

1 (a) 3 (b) 3.9069 
(c) 1.4110 (d) 1.7356 
(e) 1.5079 (f) 1.8515 

2 (a) log7x (b) log xyz 
(c) (3) (d) log xy? 


(e) In(xy?) 


0 n=) 
y 


(h) log x*y3 
(j) Inx3y2z4 


tt 
d) log 3 


(g) log 8x3 
G) logl=0 


3 
(k) e( =) 


3 (a) 2Intor In??? (b) 16logt 
y! 
(c) ™(] (d) In2 
9x3/2 
(e) nl 5 
5 (a) 4.2485 (b) —1.0986 
(c) 0 (d) 2.8134 
(e) 1.3040 (f) 1.3519 
(g) 1.2877 (h) 1.1094 
(i) 1.3863 (j) 0.6094 
(k) 1.7661 () 0.6931 
(m) 1.7329 (n) 1.5404 
(0) 1.9459 (p) —1.5041 
(q) 0.6931 (r) —0.9808 
(s) 0, 0.6931 (t) —0.6931, 1.0986 


(u) 0.6931, 1.0986 (v) 1.0986, 1.3863 


11 


12 
13 
14 
15 
16 
‘lz 
18 


By using log-log paper find the relationship between 
x and y given the following table of values: 


x 20 25 30 35 40 4.5 
y 13.0 19.0 25.9 33.6 42.2 51.6 


(a) 1.4771 (b) —0.6021 
(c) 0.6990 (d) 0.9771 
(e) 1.1003 (f) —1.9208 
(g) 0.5663 (h) 0.3495 
(i) 3.3010 (j) 0.2917 
(k) —0.1761 (1) —0.8010 
(m) 0.2386 (n) —0.3010 
(0) 0.6395 (p) 0.4924 
(q) 0.3473 (r) —0.0739 


(s) 0.3010, 0.6990 (t) 0.3010 
(u) 0.3010,0.4771  (v) —0.3010, 0.1761 


(a) 39.81 (b) 19.91  (c) 37.81 
(d) £3.98 (e) 4.01 

(a) 11.02 (b) 18.20. (c) 6.59 

(d) £6.71 — (e) 6.38 

(a) 1.0148 — (b) —0.4130 (c) —0.2877 
(d) —1.3863 (e) 1.3117 (f)_ 2.0553 
(g) 0.5485 

(a) 20dB = (b) 80dB_——(c) 64.1 dB 
(d) 30.46 dB 


Preamplifier gain = 26.02 dB, main amplifier gain 
= 17.50 dB, total gain = 43.52 dB 


10-3 W or 1 mW 
60 dBm 

20 Pa 

0.011 Pa or 11 mPa 
10° or 100 kPa 

y = 3(2*) 


y= 4yh7 


100 


1 
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Technical Computing Exercises 2.4.4 


Use a technical computing language to draw 
y = log(kx) for 0.5 < x < 50 fork = 1, 2, 3 and 4. 


1 
Draw y = Inx andy = in(*) for 
x 


0.5 < x < 20. What do you observe? Can you explain 


3 Draw y = Inx and y = 1 — 5 for 


0.5 < x < 4. From your graphs state an approximate 
solution to 


x 
Inx=1--= 
3 


your observation using the laws of logarithms? 


2.4.5 The hyperbolic functions 


The hyperbolic functions are y(x) = coshx, y(x) = sinhx, yx) = tanhx, yx) = 
sechx, y(x) = cosechx and y(x) = cothx. Cosh is a contracted form of ‘hyperbolic 
cosine’, sinh of ‘hyperbolic sine’ and so on. We define cosh x and sinh x by 


e+e~* ; e*—e* 
y(x) = coshx = ——— y(x) = sinhx = ——— 
2 2 
Note: 
cosh(—x) = eae = coshx 
; ee“ =e ; 
sinh(—x) = 5 = — sinhx 


so, for example, cosh 1.7 = cosh(—1.7) and sinh(—1.7) = — sinh 1.7. Clearly, hyper- 
bolic functions are nothing other than combinations of the exponential functions e* and 
e-*. However, these particular combinations occur so frequently in engineering that it is 
worth introducing the coshx and sinhx functions. The remaining hyperbolic functions 
are defined in terms of cosh x and sinh x. 


Gn ihe sinh x = e*—e” 
coshx e*+e~* 
1 2 
ON cosh x oy Crem: 
= Free ee. D 
MISONO re ~ ee 
cosh x = 1 ees 


x) = cole = = = 
y@) sinh x tanhx e*—e* 


Values of the hyperbolic functions for various x values can be found from a scientific 
calculator. Usually a Hyp button followed by a Sin, Cos or Tan button is used. 


Example 2.19 


Evaluate 

(a) cosh3 (b) sinh(—2) 
(c) tanh 1.6 (d) sech(—2.5) 
(e) coth 1 (f) cosech(—1) 
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Solution (a) cosh3 = 10.07 
(b) sinh(—2) = —3.627 
(c) tanh(1.6) = 0.9217 
(d) sech(—2.5) = 1/cosh(—2.5) = 0.1631 
(e) coth1 = 1/tanh1 = 1.313 
(f) cosech(—1) = 1/ sinh(—1) = —0.8509 


Ve YH 


Graphs of the functions sinhx, coshx and tanhx can be obtained using a graphics 
calculator. 


Hyperbolic identities 
Several identities involving hyperbolic functions exist. They can be verified algebraically 
using the definitions given, and are listed for reference. 


cosh’ x — sinh? x = 1 
1 — tanh? x = sech’x 
coth’ x — 1 = cosech?x 
sinh(x + y) = sinhx cosh y + coshx sinh y 
cosh(x + y) = coshxcoshy + sinhx sinh y 
sinh 2x = 2 sinh x coshx 
cosh 2x = cosh’ x + sinh? x 


cosh 2x + 1 

ieee 
cosh” x 5 

cosh 2x — 1 

inh? i 
sinh x 5 


Note also that 


e“ = coshx + sinhx e ~“=coshx — sinhx 


Hence any combination of exponential terms may be expressed as a combination of 
cosh x and sinh x, and vice versa. 


Example 2.20 Express 


(a) 3e* — 2e™~ in terms of cosh and sinh x, 


x 


(b) 2 sinhx + cosh x in terms of e* and e™. 


Solution (a) 3e* — 2e~* = 3(coshx + sinh x) — 2(coshx — sinhx) = coshx + 5 sinh x. 
ee+e* 3e*-—e* 


b) 2sinh hx =e*—e~* = 
(b) 2sinhx + coshx = e* —e* + 5 5 
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Example 2.21 


Solution 


Inverse hyperbolic functions 
The inverse hyperbolic functions use the familiar notation. Both y = sinhx and y = 
tanh x are one-to-one functions and no domain restriction is needed for an inverse to be 
defined. However, on (—oo, ©), y = cosh is a many-to-one function. If the domain is 
restricted to [0, oo) the resulting function is one-to-one and an inverse function can be 
defined. 

The inverse of the function sinh.x is denoted by sinh~' x. Here the —1 must not be 
interpreted as a power but rather the notation we use for the inverse function. Similarly 
the inverses of cosh.x and tanh. x are denoted by cosh”! x and tanh™' x respectively. 

Values of sinh~!x, cosh~!x and tanh~!x can be obtained using a scientific 
calculator. 


Evaluate 
(a) cosh7!(3.7) (b) sinh! (—2) (c) tanh7!(0.5) 


Using a calculator we get 
(a) 1.9827 (b) —1.4436 (c) 0.5493 


Engineering application 2.15 


Capacitance between two parallel wires 


Although it may not seem obvious, a small capacitance exists between two wires that 
run close to each other and at certain frequencies this can be a significant factor for 
some electrical systems. 

The mutual capacitance per metre, C, between two long parallel wires in air each 
having a radius r metres and with the wire centres separated by d metres is calculated 
using 

——_ 
cosh (d/2r) 


This expression includes an inverse hyperbolic function. In the equation d > 2r, oth- 
erwise the wires would be overlapping. The constant ¢) is a fundamental physi- 
cal constant called the permittivity of free space. It has an approximate value of 
8.85 x 10°? Fm |. 

Recall the general expression for the hyperbolic function cosh x: 


e*+e~* 
2 


To derive the inverse of this hyperbolic function, we need to restrict the domain to 
[0, co), that is x > 0. We let y = cosh x and then solve for x in terms of y: 


coshx = 


- e*+e~* 
Se) 
dy=e*+e”* 


1 
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In order to solve this equation we first multiply both sides by e*, to give 
0=e'e* —e'2y+e'e * 

Using the laws of indices, 
0 =e” —2e*y4+ 1 


We can now write this as a quadratic equation in e* and solve it using the standard 
formula. This gives 


(e*)? — 2y(e*) +1 =0 


_ —btVP—4ac  —(—2y) + /(—2y — 40) 0) 
a 2a a 2(1) 


2y+ /4—4 
ee 


e 


As x > 0, the function e* > 1 so we reject the negative root and it follows that 


e=y+/y—-1 


from which we get 
x=in(y+ Vy? = 1) 
Recall that y = cosh x and so x = cosh” 'y. Therefore 


cosh"! y = In(y + v2 — 1) 


Hence the equation for the pair of wires can also be written in terms of logarithms as 
TE 


An [(a/2r) 4+ /@/2r —1 | 


Cc 


A word of warning about the inverse of coshx is needed. The calculator returns a 
value of 1.9827 for cosh”! (3.7). Note, however, that cosh(—1.9827) = 3.7. The value 
—1.9827 is not returned by the calculator; only values on [0, co) will be returned. This 
is because the domain of y = cosh x is restricted to ensure it has an inverse function. 


EXERCISES 2.4.5 
Evaluate the following: 3 Express 
(a) sinh 3 (b) cosh 1.6 (a) 6e* + 5e~* in terms of sinh x and cosh x, 


(c) tanh0.95 
(e) cosech2 
(g) cosh(—3) 
(i) sinh(—2) 
(k) tanh(—1.4) 


Evaluate 
(a) sinh! 3 
(b) cosh! 2 


(c) tanh~!(—0.25) 


d) sech 1 

in ar 1.5 (b) 4e2* — 3e~2* in terms of sinh 2x and cosh 2x, 
(h) cosech(—1.6) (c) 2e73* — 5e?* in terms of sinh 3x and cosh 3x. 
(j) coth(—2.7) 4 Express 


(1) sech(—0.5) : ; 
(a) 4sinhx + 3coshx in terms of e* ande™*, 


(b) 3 sinh 2x — cosh 2x in terms of e2* and e~2*, 


(c) 3cosh 3x — 0.5 sinh 3x in terms of e** and e~3*. 


5 Express ae* + be~*, where a and b are constants, in 
terms of cosh x and sinh x. 
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6 Express acoshx + bsinhx, where a and b are x y =1 


constants, in terms of e* ande~. 


7 Show that the point x = coshu, y = sinhu lies on the 8 Prove the hyperbolic identities listed in the box earlier 
curve in this section. 
Solutions 
1 (a) 10.0179 = (b) 2.5775 (c) 0.7398 4 (a) 3.5e* —0.5e% 
(d) 0.6481 — (e) 0.2757. -~— (f):1.1048 (b) e* —2e7* 
3 —3x 
(g) 10.0677. = (h) —0.4210 (i) —3.6269 (c) 1.25e°* + 1.75e~"* 
G) —1.0091 (k) —0.8854 (1) 0.8868 5 (a+b) coshx + (a —b) sinhx 
2 (a) 1.8184 § (b) 1.3170 = (c) —0.2554 a+b. a—b 
6 5 e+ ; <7 


3 (a) 1lcoshx-+ sinhx (b) cosh 2x + 7 sinh 2x 
(c) —3 cosh 3x — 7 sinh 3x 


Technical Computing Exercises 2.4.5 


1 Use a technical computing language to draw 3 Draw 
(a) y = sinhx (b) y = coshx 


(c) y = tanhx for —-5<x<5. (a) y = sinh7! x —5<x<5 
e* 
2 Draw graphs of y = sinh x, y = coshx and y = cy for (b) y =cosh7! x l<x<5 
0 <x <5. What happens to the three graphs as x , 
increases? Can you explain this algebraically? (c) y= tanh” x -l<x<1l 


2.4.6 The modulus function 


The modulus of a positive number is simply the number itself. The modulus of a negative 
number is a positive number of the same magnitude. For example, the modulus of 3 is 
3; the modulus of —3 is also 3. We enclose the number in vertical lines to show we are 
finding its modulus, thus 


si=3 [Saas 


Mathematically we define the modulus function as follows: 


The modulus function is defined by 


ay se 
fo=t=| se 


Figure 2.38 illustrates a graph of f(x) = |x|. The modulus of a quantity is never negative. 
Consider two points on the x axis, a and b, as shown in Figure 2.39. Then 


ja — b| = |b— | = distance from a to b 


Example 2.22 


Solution 


Example 2.23 


Solution 
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F(x) 
—————_—_—____"-p,_—— 
a 0 b 
md- -------—-------- > 
x la — bl 
Figure 2.38 Figure 2.39 
The function: f(x) = |x|. Distance from a to b = |a — Dl. 


Find the distance from 
(a) x=2tox=9 (b) x= —2tox=9 (c) x= —2tox=—-9 


(a) Distance = |2 —9| = |—7| =7 
(b) Distance = | —2—9|=|—11]=11 
(c) Distance = | — 2 — (—9)| = |7| =7 


From the definition of the modulus function it follows: 


If |x| = a, then either x = a or x = —a. 
If |x| < a, then —a <x <a. 
If |x| > a, then either x > a or x < —a. 


For example, if |x| = 4 then either x = 4 or x = —4. If |x| < 4 then —4 < x < 4; that 
is, x lies between —4 and 4. If |x| > 4, then either x > 4 or x < —4; that is, x is either 
greater than 4 or less than —4. 


Describe the interval on the x axis defined by 
(a) |x| <2 

(b) |x| 23 

(c) |x-—1| <3 

(d) |x+2|>1 


(a) |x| < 2 is the same statement as —2 < x < 2; that is, x is numerically less than 2. 
Figure 2.40 illustrates this region. Note that the region is an open interval. Since the 
points x = —2 and x = 2 are not included, they are shown on the graph as o. 


(b) If |x| > 3 then either x > 3 or x < — 3. This is shown in Figure 2.41. The required 
region of the x axis has two distinct parts. Since the points x = 3 and x = —3 are 
included in the interval of interest, they are shown on the graph as e. 


(c) |x — 1| < 3 is equivalent to —-3 <x—1 <3, thatis -—2 <x <4. 


(d) |x +2] > 1 is equivalent tox+2 > 1 orx+2 < —l, thatisx > —l orx < —3. 
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——_o—. +A ——_—_o— —o—__4I_______o-——_ 
2 0 2 -3 0 3 
i ki<2 > 3 Inl>3 
Figure 2.40 Figure 2.41 
The quantity |x| < 2 is equivalent to —2 < x < 2. The quantity |x| > 3 is equivalent tox > 3o0rx < —3. 


The modulus function can be used to describe regions in the x—y plane. 


Example 2.24 Sketch the region defined by 
(a) |x| <2 and |y| < 1 
(b) be +y"1 <9 
Solution (a) The region is a rectangle as shown in Figure 2.42. The boundary is not part of the 


region as strict inequalities were used to define it. The region |x| < 2, |y| < 1 is the 
same as that in Figure 2.42 with the boundary included. 


(b) |x? +y| < 9 is equivalent to —9 < x?+y* < 9. Note, however, that x” + y” is never 
negative and so the region is given by 0 < x? + y’ < 9. 


Let P(x, y) be a general point as shown in Figure 2.43. Then from Pythagoras’s 
theorem, the distance from P to the origin is \/x? + y?. So, 


(distance from origin)” = x* + y? <9 
Then, 
(distance from origin) < 3 


This describes a disc, centre the origin, of radius 3 (see Figure 2.44). 


P (x, y) 
x r 
O x 
Figure 2.42 Figure 2.43 Figure 2.44 
The region: |x| < 2 and |y| < 1. Point P(x, y) is a general point. The region: |x? + y?| < 9. 


Engineering application 2.16 


Full-wave rectifier 


A rectifier is an electronic circuit that is used to convert an alternating voltage to 
a direct voltage, that is, one that does not change in polarity with time. Usually 
the alternating voltage has the shape of a sinusoidal function. We will examine the 


1 


1 
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sinusoidal functions in Section 3.4. A full-wave rectifier is one that makes use of the 
negative part of the input sinusoidal signal as well as the positive part, by reversing its 
polarity. It contrasts with a half-wave rectifier that merely discards the negative part 
of the input sinusoidal signal. This means that the half-wave circuit is not driven by 
the alternating voltage supply at all during half of the cycle. In both types of circuit 
a capacitor is usually connected across the output to store energy when the signal 
approaches a peak. The capacitor then discharges into the circuit when the rectifier 
output voltage falls. By this means, a relatively constant voltage is produced at the 
power supply output. 

A fully rectified sine wave is the modulus of the sine wave. The circuit for a full- 
wave rectifier is shown in Figure 2.45 together with the input and output waveforms. 
The input signal is v,, and the output signal is v,. Ignoring the voltage drops across 
the diodes gives 


Uo Vin | 
Vin 
> = 
t ji 
Figure 2.45 
A full-wave rectifier. 
EXERCISES 2.4.6 
Sketch the interval defined by (By Tee) 4 Iya dl <3 
(a) |x] > 4 (b) ly- 1] <3 (c) 2 +y*-1] <4 
2 
(e) |t]<—-l 
Also, state the intervals without using the modulus 3 Express the following intervals using modulus 
Signs. notation: 
Sketch the regions defined by (a) —2<x<2 (b) -4<t<4 
(a) |x| > 2, |y| < 3 (c) l<y<3 (d) -6<1t<0 
Solutions 
(a) x >4orx < —4 (b) —-2<y<4 3 (a) |x| <2 (b) |t| <4 
(c) t> —30rt< —9 (d) -3<t<3 (c) |ly-—2| <1 (d) |t+3| <3 


(e) no value of ¢ satisfies this 
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Technical Computing Exercises 2.4.6 


1 Use a technical computing language such as (a) a full wave rectifier circuit 


MATLAB® to produce a plot of the output of: (b) a half wave rectifier circuit 


2.4.7. The ramp function 


fO 
The ramp function is defined by 
ft) a So c constant 
a ~)0 4<0 
Its graph is shown in Figure 2.46. 
Figure 2.46 


The ramp function. 
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Telescope drive signal 


In order to track the motion of the stars large telescopes are usually driven by an elec- 
tric motor. The speed of this motor is controlled in order that the angular position of 
the telescope follows a specified trajectory with time. The whole assembly, including 
telescope, gears, motor, controller and sensors, forms a position control system or 
servo-system. This servo-system must be fed a signal corresponding to the desired 
trajectory of the system. Often this trajectory is a ramp function as illustrated in Fig- 
ure 2.47. The drive motor is started at time tf = f) and the desired angular position of 
the telescope is 0. 


0 


Drive motor started 


Desired telescope 
position 


Figure 2.47 


% ' ~ Tracking signal for a telescope. 


2.4.8 The unit step function, u(t) 


The unit step function is defined by 


ea 
wt ={ 4 <0 
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u (t) u(t—d) 


1 1 eo—_—_—__—_ 


Figure 2.48 Figure 2.49 
The unit step function. Graph of u(t — d). 


Its graph is shown in Figure 2.48. Note that u(t) has a discontinuity at t = 0. The point 
(0, 1) is part of the function defined on t > 0. This is depicted by e. The point (0, 0) is 
not part of the function defined on t < 0. We use © to illustrate this. 

The position of the discontinuity may be shifted. 


1 ted 
we-a= {5 124 


The graph of u(t — d) is shown in Figure 2.49. 


Engineering application 2.18 


RLC circuit 


The inductor is another of the three fundamental electronic components. It is a device 
that is used to store electrical energy in the form of a concentrated magnetic field. It 
consists of a tightly wound coil, often around a ferromagnetic material, which helps 
to strengthen the magnetic field. It is usually depicted on a circuit diagram as a set 
of loops. Figure 2.50 shows a resistor, inductor and a capacitor connected in parallel. 
This is usually known as an RLC circuit. The circuit also contains a switch, which 
here is depicted as an arrow connecting two terminals with the switch in the open 
position. When the switch is open the voltage v across terminals A and B is zero. If 
the switch is closed at t = 0 the voltage across A and B is V;, where Vg is the supply 
voltage. Thus v can be modelled by the function 


vo={y t<0 
Ve 120 


This can be written, using the unit step function, as 


v(t) = Veu(t) 


Figure 2.50 
B RLC circuit. 
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Example 2.25 Sketch the following functions: 


f 
1 o_O 
1 
1 é] t 
Figure 2.53 Figure 2.54 
The function: The function: Figure 2.55 
f =utt — 1) —u(@¢ —3). f =u —3) -—u—-1). The function: f = e’u(t). 
EXERCISES 2.4.8 
1. Sketch the following functions: 2 A ramp function, f(t), is defined by 
(a) f(t) =utt—1) eo: 
(b) f(t) =u(r+1) f= i ped 
(c) f(t) =u — 2) —u(t — 6) 
(d) f(t) =3u(t — 1) Sketch the following for —1 <t <3: 
coer na 
; =u(t—1)—u 
(g) ft) =u +1) — 2ut— 1) ic lta 
(h) f(t) = 2u(¢ +1) —u(t — 1) (c) ut — DFO 
(i) f(t) = 3u(t — 1) — 2u(t — 2) (d) ut — I) f(t) — ut — 2) f@) 
Solutions 


(a) f =u(t — 3) (b) f=u(t—1) 
(c) f=u(tt— 1) —u(t — 3) (d) f=u(t— 3) -—u(t — 1) 
(e) f =e'u(t) 
Solution (a) See Figure 2.51. (b) See Figure 2.52. 
(c) See Figure 2.53. (d) See Figure 2.54. 
(e) See Figure 2.55. 
f f 
1 — 1 —<$£@ 
3 t 1 t 
Figure 2.51 Figure 2.52 
The function: f = u(t — 3). The function: f = u(t — 1). 


2 See Figure S.5. 


NRW SS 


(a) (b) -10 12 


Figure S.5 


2.4.9 


Figure 2.56 
The rectangle function, R(t). The delayed rectangle function, R(t — d). 
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u(t — l)f u(t — 1)f— u(t -2)f 


The delta function or unit impulse function, 5(t) 


Consider the rectangle function, R(t), shown in Figure 2.56. The base of the rectangle 
is h, the height is 1/h and so the area is 1. Fort > h/2 andt < —h/2, the function is 0. 
As h decreases, the base diminishes and the height increases; the area remains constant 
at 1. 

As h approaches 0, the base becomes infinitesimally small and the height infinitely 
large. The area remains at unity. The rectangle function is then called a delta function 
or unit impulse function. It has a value of 0 everywhere except at the origin. 


5(t) = rectangle function as h approaches 0 
We write this concisely as 
d(t) = R(t) as h->0O 


The position of the delta function may be changed from the origin to t = d. Consider a 
rectangle function, R(t — d), shown in Figure 2.57. R(t — d) is obtained by translating 
R(t) an amount d to the right. Again, letting approach 0 produces a delta function, this 
time centred ont = d. 


d(t —d) = R(t —d) as h—>0O 


We have seen that the delta function can be regarded as bounding an area | between 
itself and the horizontal axis. More generally the area bounded by the function 


f(t) = kd(t) 


is k. We say that kd(t) represents an impulse of strength & at the origin, and kd (t — d) is 
an impulse of strength k att = d. It is often useful to depict such an impulse by an arrow 
where the height of the arrow gives the strength of the impulse. A series of impulses is 
often termed an impulse train. 


R(t) 


1 
h 


Figure 2.57 
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Example 2.26 A train of impulses is given by 
f(t) = 8) + 36¢ — 1) + 26(¢ — 2) 


Depict the train graphically. 


Solution Figure 2.58 shows the representation. In Section 22.8 we shall call such a function a 
series of weighted impulses where the weights are 1, 3 and 2, respectively. 


f@O 


3 
2: 
1 .- 
Figure 2.58 
A train of impulses given by 
0 l 7 t f(t) = 6(t) +: 38(t — 1) + 26(¢ — 2). 


Engineering application 2.19 


Impulse response of a system 


An impulse signal is sometimes used to test an electronic system. It can be thought 
of as giving the system a very harsh jolt for a very short period of time. 

It is not possible to produce an impulse function electronically as no practical 
signal can have an infinite height. However, an approximation to an impulse function 
is often used, consisting of a pulse with a large voltage, V, and short duration, T. The 
strength of such an impulse is V7. When this pulse signal is injected into a system 
the output obtained is known as the impulse response of the system. 

The approximation is valid provided the width of the pulse is an order of magni- 
tude less than the fastest time constant in the system. If the value of T required is small 
in order to satisfy this constraint, then the value of V may need to be large to achieve 
the correct impulse strength, V7. Often this can rule out its use for many systems as 
the value of V would then be large enough to distort the system characteristics. 


EXERCISES 2.4.9 


1. Sketch the impulse train given by (b) f(t) = 38(t) + 48(¢ — 2) + 6(t — 3) 
(a) f(t) = S(t — 1) + 28(t — 2) 
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REVIEW EXERCISES 2 


1. State the rule and sketch the graph of each of the (a) y(x) = 3+% (b) y(x) = 2x 
following functions: x ae 
= e+ 1 

(a) f(x) =7x—2 (c) ya) =3-e (d) yQ) = = 


a 
BE Se Aes 10 Simplify each expression as far as possible. 


5 
(c) g(x) =3e°+4 OK<x<2 


(a) eX e3x (b) exe2%e—3x 
(d) y(t) = (e% —1)/2 t>0 a 
; (c) etere (d): == 
(e) f@®) =x +2x+5 —2<x<2 = 
2 
2 State the domain and range of the functions in (e) = (f) In3x+In 2 
Question 1. ere x 
3 Determine the inverse of each of the following (g) 3int + 2In?? (h) ent 
functions: (i) me ie 
(a) y(x) = 2x (b) f(t) = 8-3 (ky fms (1) In(e**) 
2X (m) e2!nx (n) In(e?) + In(e2*) 
(c) hx) = — +1 (d) m(r) = 1-—3r 
: 11. Solve the following: 
(e) H(s) = Fi +2 (f) f(v) = Inv (a) e** = 200 (b) e3*-6 — 150 
x — () — 
(g) fe) =e (h) gv) =Inv +1 me Tas Ree ee 
() gv) =Inw+ 1) @) yt) =3e? © Gpex =o 
4 If f(t) =e! find 12 Solve the following: 
(a) f(2t) (b) f(x) (a) 0.5Int = 1.2 
(c) fa) (d) f(t—A) (b) In(3r+2) = 1.4 
5 If g(t) = In(@?? + 1) find (a) g(A), (b) g(t — A). (c) 3in(t— 1) =6 


(d) logyg(t? — 1) = 1.5 
(e) logyg(nt) = 0.5 


6 Sketch the following functions: 


_|0 #49 (f) In(logyyt) = 0.5 
@ fO= fe t>0 
13 Express each of the following in terms of sinh x and 
4 t<0 cosh x: 
(b) f= yt OK<t<3 (a) Je* + 3e-* (b) 6e* — 5Se~* 
2t 3<t<4 3e* — 2e-¥ 1 
Oo £are 
(©) f@)=lel| —3<1<3 & 
(e) te 
7 The function f(x) is periodic with a period of 2, and ca 
f(x) = |x|, -1 <x < 1. Sketch f for -—3 <x <3. 14 Express each of the following in terms of e* and e*: 
8 Given a(t) = 31, b(t) =t +3 and c(t) = 1? — 3 write (a) Dinh Scosx 
expressions 
(b) tanhx + sechx 
(a) b(c(t)) (b) c(b@)) 3 
(c) a(b(t)) (d) a(c(t)) (c) 2coshx — a sinh x 
(e) a(b(c(t))) — ()_ e(b(a(t))) 1 


(4) —_—_____ 
9 Sketch the following functions, stating any sinh x — 2 cosh. 


asymptotes: (e) (sinh.x)? 


114 Chapter 2 Engineering functions 


15 Describe the interval on the x axis defined by 16 Sketch 
(a) |x| < 1.5 (b) |x| > 2 (a) f =|—2zt| —-3<t<3 
(e) 2-1] <5 
(c) 26(t) —d(@¢ — 1) +36(¢ + 1) 
Solutions 
1 (a) Multiply input by 7, then subtract 2 10 (a) e* (b) 1 
(b) Square the input, then subtract 2 (d) e* (e) e* 
(c) Calculate e*, where x is the input, multiply by 3 (g) Int? (h) x 
and een add 4 . Gia (k) x 
(d) Multiply by 2, calculate exponential, subtract | > 
and then divide by 2 (m) x (n) 3+ 2x 
(e) Cube input, add to twice the input and add 5 11 (a) 1.3246 (b) 3.6702 
2 (a) Domain (—oo, oo) range (—0o, 00) (c) —1.7918 (d) 42.0234 
(b) Domain [0, 5] range [—2, 23] (e) —1.3863 
(c) Domain [0, 2] range [7, 3e2 + 4] 12 (a) 11.0232 (b) 0.6851 
(d) Domain [0, oo) range [0, oo) (c) 8.3891 (d) +5.7116 
(e) Domain me 2] range [—7, 17] (ec) 23.6243 (f) 44.5370 
=f = 
ge @y @=; MM (a) 10coshx +4sinhx 
(b) fiw = t+ 3 (b) cosh x + 11 sinhx 
j 3 8 ) coshx + 5 sinh x 
(©) WM) = S@=1) . 2 
1 
l-r 
@ mas © J eoshx 
3 cosh x + sinh x 
H7! = 
(e) (s) s—2 (e) 1 + coshx + sinhx 
Gf wee 7 x 3o-x 
() f= hint me) oot 
h) gh) =e! Pe et Sl 
@) g'@)=e-1 et fe-* 
t x lle-* 
() y'@ =In( £) +2 ee 
3 8 
—2 
2 x a kh d 
Sa@e (be Ce Me @) ay 3eo 
5 (a) In(a2 +1) (b) In((t — A)? + 1) (<<) 
e 
8 (a) r? 2 
(b) (¢ +3)? —3 ort? + 6 +6 15 (a) -1.5<x<15 
(c) 3(¢+3) (b) x > 2andx < —2 
2 
(d) a —3) (c) -10<x<4 
aor (d) x>3andx< -3 
(f) (3¢ + 3)* —3 or 91? + 187 + 6 


—2<x<3 


Contents 


functions 


3.1 Introduction 115 
3.2 Degrees and radians 116 
3.3 The trigonometric ratios 116 
3.4 The sine, cosine and tangent functions 120 
3.5 The sinc x function 123 
3.6 Trigonometric identities 125 
3.7 Modelling waves using sin¢ and cost 131 
3.8 Trigonometric equations 144 
Review exercises 3 150 


INTRODUCTION 


Many common engineering functions were studied in Section 2.4. However, the trigono- 
metric functions are so important that they deserve separate treatment. Often alternating 
currents and voltages can be described using trigonometric functions, and they occur 
frequently in the solution of various kinds of equations. 

Section 3.3 introduces the three trigonometric ratios: sin@, cos @ and tan @. These are 
defined as ratios of the sides of a right-angled triangle. The definitions are then extended 
so that angles of any magnitude may be considered. Section 3.5 introduces an impor- 
tant function, sinc x. This is really a combination of the familiar functions x and sin x, 
but because this combination occurs frequently in some engineering applications it de- 
serves special mention. Trigonometric identities are introduced in Section 3.6, the most 
common ones being tabulated. These identities are useful in simplifying trigonomet- 
ric expressions. Graphs of the trigonometric functions are illustrated. The application 
of these functions to the modelling of waveforms is an important section. The chapter 
closes with the solution of trigonometric equations. 
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[a DEGREES AND RADIANS 


Angles can be measured in units of either degrees or radians. The symbol for degree 
is °. Usually no symbol is used to denote radians and this is the convention adopted in 
this book. 

A complete revolution is defined as 360° or 27t radians. It is easy to use this fact to 
convert between the two measures. We have 


360° = 27 radians 


2 Tt : 
= — = — radians 
360 180 


° 


180 
1 radian = —— degrees © 57.3° 
a 
Note that 
T radi 90° 
— radians = 
2 
7m radians = 180° 
37 . 
= radians = 270° 


Your calculator should be able to work with angles measured in both radians and degrees. 
Usually the Mode button allows you to select the appropriate measure. 


[se THE TRIGONOMETRIC RATIOS 


Consider the angle 6 in the right-angled triangle ABC, as shown in Figure 3.1. We define 
the trigonometric ratios sine, cosine and tangent as follows: 


sey) side opposite to angle BC 
sind = = 
hypotenuse AC 


9 side adjacent to angle AB 
0s 0 = = 
hypotenuse AC 


ne side opposite to angle — BC 


~ side adjacent to angle AB 


Cc 


<q 


Figure 3.1 


A = \ [J ; : 
BA right-angled triangle, ABC. 
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Note that 
BC BC AC __ sing 
AB AC AB cos0 


Note that when 6 reduces to 0° the length of the side BC reduces to zero and so 


tand = 


sin0° = 0, tan0° = 0 
Also when @ reduces to 0° the lengths of AB and AC become equal and so 
cos 0° = | 


Similarly when 6 approaches 90°, the lengths of BC and AC become equal and the length 
of AB shrinks to zero. Hence 


sin 90° = 1, cos 90° = 0 


Note also that the length of AB shrinks to zero as @ approaches 90°, and so tan@ ap- 
proaches infinity. We write this as 


tan@ — co as 0 > 90° 


The trigonometric ratios of 30°, 45° and 60° occur frequently in calculations. They can 
be calculated exactly by considering the right-angled triangles shown in Figure 3.2. 


1 1 
Simi? = —=, css = —, canta 
/2 /2 
1 1 
rae =e, Reece Me eee 
p) 2) J3 
3 1 
sin 60° = es CoOso0m— 5 tan 60° = V3 


Most scientific calculators have pre-programmed values of sin @, cos 6 and tan @. An- 
gles can be measured in degrees or radians. We will use radians unless stated otherwise. 
If we let 2 ACB = a (see Figure 3.1), then 


sina = —~ =cos@ 
AC 
and 


cosa = —~ = sind 
AC 


Figure 3.2 

The trigonometric ratios for 30°, 
45° and 60° can be found exactly 
from these triangles. 
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But 
+O=5 
a = 
2 


Hence, 


. ™ 
sind = cos| ~ —6@ 

2 

_ {a 
cos@ = sin{ — —0 

2 


Since @ is an angle in a right-angled triangle it cannot exceed 71/2. In order to define 
the sine, cosine and tangent ratios for angles larger than 7t/2 we introduce an extended 
definition which is applicable to angles of any size. 

Consider an arm, OP, fixed at O, which can rotate (see Figure 3.3). The angle, 0, in 
radians, between the arm and the positive x axis is measured anticlockwise. The arm can 
be projected onto both the x and y axes. These projections are OA and OB, respectively. 
Whether the arm projects onto the positive or negative x and y axes depends upon which 
quadrant the arm is situated in. The length of the arm OP is always positive. Then, 


projection of OP onto y axis _ OB 


sin@ = 
OP OP 
projection of OP onto x axis OA 
cos 6 = = 
OP OP 
projection of OP onto yaxis OB 
tan@ = = 


projection of OP onto x axis OA 


In the first quadrant, that is 0 < @ < 7t/2, both the x and y projections are positive, 
so sin@, cos @ and tan@ are positive. In the second quadrant, that is 77/2 < 0 < 7, the 
x projection, OA, is negative and the y projection, OB, positive. Hence sin @ is positive, 
and cos @ and tan @ are negative. Both the x and y projections are negative for the third 
quadrant and so sin@ and cos @ are negative while tan @ is positive. Finally, in the fourth 
quadrant, the x projection is positive and the y projection is negative. Hence, sin @ and 
tan @ are negative, and cos @ is positive (see Figure 3.4). The sign of the trigonometric 
ratios can be summarized by Figure 3.5. 

For angles greater than 271, the arm OP simply rotates more than one revolution be- 
fore coming to rest. Each complete revolution brings OP back to its original position. 


Second quadrant First quadrant 


P 


A O x 


Third quadrant Fourth quadrant 
Figure 3.3 


An arm, OP, fixed at O, which can rotate. 
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Third quadrant 
Figure 3.4 


Fourth quadrant 


Evaluating the trigonometric ratios in each of the four quadrants. 


So, for example, 


sin(8.76) = sin(8.76 — 27) = sin(2.477) = 0.617 
cos(14.5) = cos(14.5 — 47t) = cos(1.934) = —0.355 


Negative angles are interpreted as a clockwise movement of the arm. Figure 3.6 illus- 


trates an angle of —2. Note that 


sin(—2) = sin(27t — 2) = sin(4.283) = —0.909 


since an anticlockwise movement of OP of 4.283 radians would result in the arm being 
in the same position as a clockwise movement of 2 radians. 


y 
sin@>0 sin@>0 
cos 8<0 cos @>0 


tand <0 tan@>0 


sin@ <0 sin@ <0 
cos 8 <0 cos 0>0 
tand>0 tan @<0 


Figure 3.5 
Sign of the trigonometric ratios in 
each of the four quadrants. 


Figure 3.6 
Illustration of the angle 6 = —2. 
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The cosecant, secant and cotangent ratios are defined as the reciprocals of the sine, 
cosine and tangent ratios. 


1 
cosec 90 = —— 
sin 0 

il 

en = 

cos 0 

il 
cot@ = —— 
tan@ 


Example 3.1 An angle 6 is such that sin@ > 0 and cos@ < 0. In which quadrant does 6 lie? 


Solution From Figure 3.5 we see that sin@6 > 0 when 6 is in the first and second quadrants. Also, 
cos @ < 0 when @ is in the second and third quadrants. For both sin@ > 0 and cos@ < 0 
thus requires 9 to be in the second quadrant. Hence sin@ > 0 and cos@ < 0 when @ is 
in the second quadrant. 


EXERCISES 3.3 


1 Verify using a scientific calculator that (b) cos 1.4 = cos(1.4 + 87) = cos(1.4 — 67) 


(a) sin 30° = sin390° = sin 750° (c) tanl = tan(1 + 7t) = tan(1 + 271) = tan(1 + 37t) 
(b) cos 100° = cos 460° = cos 820° (d) sin2.3 = sin(2.3 — 27) = sin(2.3 — 47) 

(c) tan 40° = tan 220° = tan 400° 

(d) sin 70° = sin(—290°) = sin(—650°) 
(e) cos 200° = cos(—160°) = cos(—520°) 


(e) cos2 = cos(2 — 27t) = cos(2 — 47t) 
(f) tan4 = tan(4 — 7) = tan(4 — 27) = tan(4 — 371) 


(f) tan 150° = tan(—30°) = tan(—210°) 3  Anangle 6 is such that cos@ > 0 and tané < 0. In 
2 Verify the following using a scientific calculator. All which quadrant does 6 lie? 
angles are in radians. 4 Anangle a is such that tana > 0 and sina < 0. In 
(a) sin0.7 = sin(0.7 + 27t) = sin(0.7 + 471) which quadrant does a lie? 
Solutions 
3 4th quadrant | 4 3rd quadrant 


THE SINE, COSINE AND TANGENT FUNCTIONS 


The sine, cosine and tangent functions follow directly from the trigonometric ratios. 
These are defined to be f(x) = sinx, f(x) = cosx and f(x) = tanx. Graphs can be 
constructed from a table of values found using a scientific calculator. They are shown in 
Figures 3.7 to 3.9. Note that these functions are many-to-one. 


3.4.1 
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f@) A 
1 


Figure 3.7 Figure 3.8 
Graph of f(x) = sinx. Graph of f(x) = cos x. 


Figure 3.9 
Graph of f(x) = tanx. 


By shifting the cosine function to the right by an amount 7t/2 the sine function is 
obtained. Similarly, shifting the sine function to the left by 71/2 results in the cosine 
function. This interchangeability between the sine and cosine functions is reflected in 
their being commonly referred to as sinusoidal functions. Notice also from the graphs 
two important properties of sin x and cos x: 


sin x = — sin(—x) 


cos x = cos(—x) 


For example, 


. 7 : 7m Tt Tt 
sn = = =sint = — and cos — = Cos} —— 
3 ( :) 3 ( ) 


Note that f(x) = sinx is 0 when x = 0, +71, +27, £37,..., tn7,.... Also f(x) = 
is 0 wh a 3m St +(2 a 9 
cos x is 0 when x = pe gitar n Ree 


Inverse trigonometric functions 


All six trigonometric functions have an inverse but we will only examine those of sin x, 
cos x and tan x. The inverse functions of sinx, cos x and tan x are denoted sin7! x, cos~! x 
and tan~! x. This notation can and does cause confusion. The ‘—1’ in sin~! x is some- 
times mistakenly interpreted as a power. We write (sin.x)~! to denote 1/ sin x. Values of 
sin! x, cos~! x and tan~! x can be found using a scientific calculator. If y = sinx, then 


x = sin! y, as in 


sinx = 0.7654  x=sin' (0.7654) = 0.8717 
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Figure 3.10 


The function sin x is one-to-one if the Figure 3.11 . Figure 3.12 
a The inverse sine A single input produces many 
5° | function, sin7! x. output values. This is not a function. 


domain is restricted to | — 


Note that y = sin x is a many-to-one function. If the domain is restricted to [—7t/2, 7/2] 
then the resulting function is one-to-one. This is shown in Figure 3.10. 

Recall from Section 2.3 that a one-to-one function has a corresponding inverse. So 
if the domain of y = sinx is restricted to [—7t/2, 71/2], then an inverse function ex- 
ists. A graph of y = sin”! x is shown in Figure 3.11. Without the domain restriction, a 
one-to-many graph would result as shown in Figure 3.12. To denote the inverse sine 
function clearly, we write 


= sin”! —-~<ycrz 
x 
y A SIS 5 


Use a scientific calculator to evaluate 


Example 3.2 


Solution 


(a) sin~! (0.3169) 
(b) sin~'(—0.8061) 


(a) sin-!(0.3169) = 0.3225 
(b) sin~'(—0.8061) = —0.9375 


A word of warning about inverse trigonometric functions is needed. The calculator re- 
turns a value of 0.3225 for sin~! (0.3169). Note, however, that sin(0.3225 + 2nm) = 
0.3169, n = 0,1,2,3,..., so there are an infinite number of values of x such that 
sinx = 0.3169. Only one of these values is returned by the calculator. This is because 
the domain of y = sinx is restricted to ensure it has an inverse function. To ensure the 
inverse functions y = cos! x and y = tan™! x can be obtained, restrictions are placed 
on the domains of y = cosx and y = tanx. By convention, y = cosx has its domain 
restricted to [0, 71] whereas with y = tan x the restriction is (—7/2, 7/2). 
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EXERCISES 3.4 
1. Evaluate the following: 3. Sketch y = sinx and y = cos x for x in the interval 
(a) sin7!(0.75) — (b) cos7!(0.625) (c) tan7!3 [—27t, 271]. Mark on the graphs the points where 
(d) sin~'(—0.9)  (e) cos~!(—0.75)  (f) tan~!(—3) x=1,x=1.5,x=—3 andx = 2.3. 
2 Show that 
sin-! x +cos7! x = ds 
2 
Solutions 
1 (a) 0.8481 (b) 0.8957 (c) 1.2490 (d) — 1.1198 (e) 2.4189 (f) — 1.2490 


Technical Computing Exercises 3.4 


1 Using a technical computing language draw y = sinx 2 Draw the graphs of y = cos! x and y = tan™! x. 
and y = cosx for 0 < x < 47 on the same axes. Use 
your graphs to find approximate solutions to the 
equation sin x = cos x. 


THE SINC x FUNCTION 


The sine function is used to define another important function used in engineering. The 
cardinal sine function, sinc x, occurs frequently in engineering mathematics in applica- 
tions ranging from communications, power electronics, digital signal processing (d.s.p.) 
and optical engineering. The standard sinc x function without normalization is plotted 
in Figure 3.13 and is defined by 
sin x 
; — x40 
sinc x = x 


1 x=0 
It is necessary to separately define the value of the function at x = 0 because the 


ae ‘ sin x ee Nevis ‘ : 
definition of the function, , would cause a division by zero at this point. Fortunately 


it can be shown that as x approaches 0, then = approaches 1, and so specifying the 


value sinc x = | at x = 0 is adequate to resolve the problem. Note that the sinc function 
without normalization is 0 when x = ..., —37, —27, —71, 7, 271, 37, ..., so that apart 
from x = 0 where sinc x = | by definition, it has the same zero crossing points as that 
of sin x. 

Some engineers use a different definition of the sinc x function: 


sin 71x 


x40 
sinc x = TX 
1 x=0 
This is commonly termed the normalized sinc function. 
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0.25 


Figure 3.13 
The standard sinc function without normalization. 


Engineering application 3.1 


Zero crossing points of the normalized sinc(x) function 


The sine function often appears in communications engineering and signal process- 

ing. In general, a signal can be regarded as being made of a large number of individual 

frequency components. In the case of a square pulse, the graph showing the amplitude 

of these components against their frequencies takes the form of a sinc function. 
Find the zero crossing points of the normalized sinc function, 


sin(7tx) 
sinc(x) = TX nea 
1 (0) 


Solution 
It is evident from Figure 3.13 that the sinc function which has not been normalized 
has the same zero crossing points as sin x, with the exception of the crossing atx = 0, 
(LANE TS BYE oe = SETS, SET, SEBIMG, oo og SMM on co 

The normalized sinc function is 0 when sin(7tx) is 0. Now, from page 121 we see 


that 

Sins = (0) Wl nvein 32 = 5. 5 Sn, —, rae, (0), oe rae, Sia, coo 
and so 

Sint — Owhenti— i — 3 1) Oe in Tre 
and hence 


sin(7tx) = 0 when x = —3, —2, —1,0,1,2,3... 
Noting that sinc(0) is defined to have a value of 1, then we see that 


sinc(x) = 0 whenx =... —3, —2, —-1,1,2,3,... 


Example 3.3 


Solution 
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or 


sme) = O when 7 = aya tora = I, 2, 3s 00> 


The plot of the normalized sinc function is shown in Figure 3.14. 


Since x 


Figure 3.14 
The sinc function with normalization. 


TRIGONOMETRIC IDENTITIES 


0 
We have seen that tan@d = = for all values of 6. We call this an identity. In an 
identity, the 1.h.s. and the rhs. are always equal, unlike in an equation where the |.h.s. 
and the r.h.s. are equal only for particular values of the variable concerned. Table 3.1 


lists some common trigonometric identities. 


Simplify 
cotA 
cosA 
We know that 
sinA 
tan A = —— 
cosA 
and so 
1 A 
cotA = = 
tanA sinA 
Hence 
cotA 
=cotA x 
cosA osA 
cosA 1 
= x 
sinA cosA 
_ 1 
~ sinA 


This may also be written as cosec A. 
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Table 3.1 
Common trigonometric identities. 


tanA = 


sin(A + B) = sinAcosB+sinBcosA 
cos(A + B) = cosAcosB = sinA sinB 
tanA + tanB 

1 tanA tanB 
2sinAcos B = sin(A + B) + sin(A — B) 
2cosA cos B = cos(A + B) + cos(A — B) 
2sinA sinB = cos(A — B) — cos(A + B) 
sin? A + cos?A = 1 

1 + cot” A = cosec2A 

tan? A +1 =sec?A 

cos2A = 1 — 2sin? A = 2cos”A — 1 = cos* A — sin2A 
sin2A = 2sinAcosA 


tan(A + B) = 


a) 1 —cos2A 

sine A = ——___ 
2 

1 2A 

cos? A = SE 


Note: sin? A is the notation used for (sin A). Similarly cos? A means (cos A)?. 


Example 3.4 Show that 
tanA + cotA 


may be written as 


2 
sin 2A 


Solution We have 


sinA cosA 
’ cotA = — 
cosA sinA 


and so 
sinA cosA 


tanA +cotA = - 
cosA sinA 


sin? A + cos? A 
sinA cosA 
= a using the identity sin’ A + cos?A = 1 
sinA cosA 
wy) 
2sinAcosA 
2 


=-— using the identity sin2A = 2sinAcosA 
sin 2A 
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Example 3.5 Use the identities in Table 3.1 to simplify the following expressions: 
7 37 
inj ~ +6 b —-6 
(a) sin($ + ) (b) cos( 5 ) 
(c) tan(27t — 0) (d) sin(7t — 0) 
Solution (a) The expression sin (5 + 6) is of the form sin(A + B) and so we use the identity 
sin(A + B) = sinAcosB-+ sinBcosA 
Putting A = and B = @ we obtain 
‘ (= 4 6) . 7 Pactie ™ 
sin ( = = sin ~ co sin @ cos = 
2 oo 2 
_ 7 Tt 
We note that sin 5 = 1, cos 3 = 0 and so 


sin (5 z 6) = 1 cos + sin6(0) 
= cosé 
3 
(b) The expression cos (F _ 0) has the form cos(A — B); hence we use the identity 


cos(A — B) = cosAcosB+ sinAsinB 


3m 
Putting A = >” B= 6 we obtain 


3 
cos ea tg eae hi sae 
2 2 2 


37 . 3m 
Now cos cy = 0, sin a = —1andso 


37 ; 
cos a 6 | = Ocosé + (—1) sin@ 
= —sind 
(c) We use the identity 


tanA — tanB 
tan(A — B) = 
1+ tanA tan B 


Substituting A = 27, B = 6 we obtain 


tan 27 — tand 
tan(27 — 0) = 
1 + tan 271 tan@ 


Since tan 27t = 0 this simplifies to 


—tand 
tan(277 — 9) = i = —tand 


(d) We use the identity 


sin(A — B) = sinA cos B — sinBcosA 
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Substituting A = 71, B = 6, this then becomes 
sin(7 — 0) = sin7tcos 6 — sin@ cos 7t 
Now sin 7 = 0, cos 7t = —1 and so we obtain 


sin(7z — 0) = Ocos@ — sin@(—1) = sind 


Example 3.6 Simplify 
(a) cos(7+ 64) 
(b) tan(m — 6) 
(c) sin? B + sinBcos? B 
(d) tanA(1 + cos 2A) 


Solution (a) Using the identity for cos(A + B) with A = 1, B = 0 we obtain 


cos(7 +60) = cos7 cos@ — sin7z sind@ 
= (—1)cosé@ — (0) sin@ 


= —cos@ 


(b 


wm 


Using the identity for tan(A — B) with A = 7, B = 6 we obtain 


tan 7 — tandé 
1 + tan 7ttan@ 
_, = tan? 

~ 1+ (0) tané 


tan(7a— 0) = 


= —tand 


(c) sin’ B + sin B cos? B = sin B(sin’ B + cos” B) 
=sinB since sin’ B + cos? B = 1 


inA 
(d) Firstly we note that tanA = = Also we have from Table 3.1 
cos 


1+ cos2A 
2 


cos? A = 
from which 

1+cos2A =2cos’A 
Hence 


sinA 2 
tanA(1 + cos 2A) = 2cos°A 
cosA 


2sinAcosA 
= sin2A 
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Example 3.7 Show that 


: : _ (A+B A-B 
sinA + sinB = 2sin —o cos 9 


Solution Consider the identities 


sin(C + D) = sinC cos D + sinDcosC 
sin(C — D) = sinC cos D — sinDcosC 


By adding these identities we obtain 
sin(C + D) + sin(C — D) = 2sinC cos D 


We now make the substitutions C + D = A, C — D = B from which 


A+B A-B 
C= ——_, D= —— 
2 2 
Hence 
: . ._ (A+B A-B 
sinA + sinB = 2sin a cos a 


The result of Example 3.7 is one of many similar results. These are listed in Table 3.2. 


Table 3.2 
Further trigonometric identities 


: : . (A+B A 
sinA+sinB =2sin 5 cos 

; F . (A-B A 
sinA —sinB =2sin 5 cos 


A+B A-B 
cosA + cos B = 2cos a cos {| ——— 


_ (A+B\ . (A-B 
cosA — cos B = —2sin = sin =z 


| 
ies] 


Ni+ 
wy 
ee ee 


Example 3.8 Simplify 
sin 70° — sin 30° 
cos 50° 


Solution We note that the numerator, sin 70° — sin30°, has the form sinA — sin B. Using the 
identity for sinA — sin B with A = 70° and B = 30° we see 


7 eo} a re) "} te} oO 
sin 70° — sin30° = 2sin ei cos ee 
2 2 
= 2sin 20° cos 50° 
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1 


Hence 


sin 70° — sin 30° _ 2 sin 20° cos 50° 


cos 50° 


cos 50° 


= 2sin 20° 


EXERCISES 3.6 
Use the identities for sin(A + B), cos(A + B) and 
tan(A + B) to simplify the following: 
(a) sin(o— 7 (b) cos( — = 
a) sin{ 6 — — cos{ 6 — — 
2 2 
(c) tan(@ + 70) (d) sin(@ — 7) 
(e) cos(@ — 71) (f) tan(@ — 37) 
: 37 
(g) sin(@ + 70) (h) cos{| 6 + > 
(i) sin(20+ ==) @ cos(o - 
i) sin — cos{ 6 — — 
2 2 
Tt 
(k) con(§ - 2 
2 
Write down the trigonometric identity for tan(A + @). 
7 Tl 
By letting A > a show that tan (5 + ) can be 
simplified to — cot é. 
(a) By dividing the identity 
sin? A + cos?A = 1 by cos? A show that 
tan? A + 1 = sec? A. 
(b) By dividing the identity 
sin? A + cos?A = 1 by sin? A show that 
1+cot?A = cosec 2A. 
Simplify the following expressions: 
(a) cosA tanA (b) sin@ coté 
Solutions 
(a) —cosé (b) sin@ (c) tané 
(d) —sin@ (e) —cosd (f) tané 
(g) —sind (h) sin@ (i) —cos20 
Gg) —siné (k) —sin@ 
(a) sinA (b) cosé (c) secB 
(d) cosec 2x (e) -l (f) 2sint 


(c) tan B cosec B 


T 
(e) tané@ tan 5 +0 


[Hint: see Question 2.] 


(g) sin? A + 2cos? A 

(i) (1 + cot? X) tan? X 
1 

(k) 5 sin 2A tanA 


sin 2A 
cos 2A 
(0) (tan? 6+ 1) cot2 6 


5 Simplify 
(a) sin 110° — sin 70° 
(b) cos 20° — cos 80° 
(c) sin 40° + sin 20° 
cos 50° + cos 40° 


d 
(d) Va 


6 Show that 
sin 60° + sin 30° 
sin 50° — sin 40° 
is equivalent to 


cos 15° 


sin 5° 


(d) cot 2x sec 2x 


sin 2t 


® cost 
(h) 2cos?B—1 
(j) (sin? A + cos” A)? 
(1) (sec? t—1) cos? t 
sinA 
sin 2A 
(p) cos2A +2 sin? A 


(n 


(g) 1+cos?A  (h) cos2B (i) sec2X 
ij) 1 (k) sin? A (1) sin?+ 
(m) tan2A (n) 5 secA (0) cosec 269 
(p) 1 

5 (a) 0 (b) sin 50° (c) cos 10° (d) cos 5° 
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MODELLING WAVES USING SIN¢ AND COS¢ 


Examining the graphs of sinx and cos x reveals that they have a similar shape to waves. 
In fact, sine and cosine functions are often used to model waves and we will see in 
Chapter 23 that almost any wave can be broken down into a combination of sine and 
cosine functions. The main waves found in engineering are ones that vary with time and 
so t is often the independent variable. 

The amplitude of a wave is the maximum displacement of the wave from its mean 
position. So, for example, sint and cost have an amplitude of 1, the amplitude of 2 sint 
is 2, and the amplitude of A sint is A (see Figure 3.15). 


The amplitude of A sint is A. The amplitude of A cost is A. 


A more general wave is defined by A cos wt or A sin wt. The symbol w represents the 
angular frequency of the wave. It is measured in radians per second. For example, sin 3t 
has an angular frequency of 3 rad s~!. Ast increases by 1 second the angle, 3r, increases 
by 3 radians. Note that sin has an angular frequency of | rad s~!. 


The angular frequency of y = A sin wt and y = Acos wt is w radians per second. 


The sine and cosine functions repeat themselves at regular intervals and so are pe- 
riodic functions. Looking at Figure 3.7 we see that one complete cycle of sint is com- 
pleted every 27t seconds. The time taken to complete one full cycle is called the period 
and is denoted by 7. Hence the period of y = sintf is 27t seconds. Similarly the period of 
y = cost is 27 seconds. Mathematically this means that adding or subtracting multiples 
of 27t to t does not change the sine or cosine of that angle. 


sint = sin(t + 2n7t) (Hs, tl A Se 5 ae 
cost = cos(t + 2n7t) f=, 2,3 
In particular we note that 


sint = sin(t + 27t) 
cost = cos(t + 271) 


We now consider y = A sinwt and y = Acos wt. When t = 0 seconds, wt = 0 radians. 


271 271 : . 
When t = — seconds, wt = w— = 27 radians. We can see that as t increases from 0 


Figure 3.15 
The amplitude of f(t) = Asint is A. 
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27 : : 
to — seconds, the angle wf increases from 0 to 27t radians. We know that as the angle 
a) 


ot increases by 27t radians then A sin wt completes a full cycle. Hence a full cycle is 


27 27 
completed in — seconds, that is the period of y = A sin wf is — seconds. 
a) o 
: . 27 
If y = Asina@t or y= Acosat, then the period T is —. 
o 


In particular we note that the period of y = A sint and y = A cost is 27. 

Closely related to the period is the frequency of a wave. The frequency is the number 
of cycles completed in | second. Frequency is measured in units called hertz (Hz). One 
hertz is one cycle per second. We have seen that y = A sin wt takes 


27 
— seconds to complete one cycle 
o 


and so it will take 


1 second to complete © cycles 
27 


We use f as the symbol for frequency and so 


@ 
fi f= 
requency, f a 

: 3 
For example, sin 3t has a frequency of = Hz. 


Note that by rearrangement we may write 
@ = 2nf 


and so the wave y = A sin wt may also be written as y = A sin 27tft. 
From the definitions of period and frequency we can see that 
1 
period = ————— 
frequency 


that is 


= 
if 


We see that the period is the reciprocal of the frequency. Identical results apply for the 
wave y = Acosat. 

A final generalization is to introduce a phase angle or phase, ¢. This allows the 
wave to be shifted along the time axis. It also means that either a sine function or a 
cosine function can be used to represent the same wave. So the general forms are 

Acos(wt + ¢), Asin(wt + ¢) 


Figure 3.16 depicts A sin(wt + @). Note from Figure 3.16 that the actual movement of 
the wave along the time axis is ¢/a. It is easy to show this mathematically: 


Asin(aot + @) = Asino(1 ae ) 
(a) 


The quantity . is called the time displacement. 
0) 


Example 3.9 


Solution 


Example 3.10 


Solution 
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Figure 3.16 
The generalized wave A sin(wt + @). 


The waves met in engineering are often termed signals or waveforms. There are 
no rigid rules concerning the use of the words wave, signal and waveform, and often 
engineers use them interchangeably. We will follow this convention. 


State the amplitude, angular frequency and period of each of the following waves: 


(a) 2 sin 3t 


ne ee 
(b) 5 0s(21 +2) 


. . 2n 27 
(a) Amplitude, A = 2, angular frequency, w = 3, period, T = — = 
a) 
; . 2n 2x 
(b) Amplitude, A = 0.5, angular frequency, w = 2, period, T= — = a= TL. 
() 


State the amplitude, period, phase angle and time displacement of 


(a) 2 sin(4t + 1) 
(b) =e 0.7) 


2t+1 

(c) s0s( 3 ) 
3. (At 
(d) 7sin() 


27 7 
(a) Amplitude = 2, period = a = F phase angle = | relative to 2sin4r, time 


displacement = 0.25. 


2 2 
(b) Amplitude = 3” period = 271, phase angle = —0.7 relative to 3 cos f, time displace- 
ment = —0.7. 
' 2m 1 2t\ 
(c) Amplitude = 4, period = 273 = 3m, phase angle = 3 relative to 4 cos a) time 


displacement = 0.5. 


: 3 : 2m 3m ; 3. (/4t\ . 
(d) Amplitude = —, period = —-~ = —., phase angle = 0 relative to — sin| — }, time 
4 4/3 2 4 3 
displacement = 0. 
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3.7.1 


Engineering application 3.2 


Alternating current waveforms and the electricity supply 


Alternating current waveforms are often found in engineering. The electricity supply 
to homes and businesses most often takes the form of an alternating current. This is 
because it is far easier to supply alternating current electricity than direct current elec- 
tricity when distributing across long distances. Energy losses along the supply cables 
can be reduced by transforming the electricity to high voltages prior to distribution 
but electricity transformers only work with alternating currents. 

Sine and cosine functions are often used to model alternating current (a.c.) wave- 
forms. The equations for an a.c. waveform are 


I=I[,,sin(at + ¢) or I=I,,cos(at + ¢) 


where /,, = maximum current, w = angular frequency and ¢ = phase angle. In prac- 
tice the functions can be shifted along the time axis by giving ¢ a non-zero value and 
so both the sine and the cosine function can be used to model any a.c. waveform; 
which one is used is usually a matter of convenience. 

The angular frequency, w, can be written as w = 27f, where f is the frequency 
of the waveform in Hertz (Hz). The frequency of the electricity supply in Europe 
and across large parts of the world is 50 Hz, while in the Americas and in areas of 
Asia it is 60 Hz. Alternating current supplies are also found on ships, submarines and 
aircraft but these often use 400 Hz as the operating frequency. 


Combining waves 


There are many situations in which engineers need to combine two or more waves 
together to form a single wave. It is possible to make use of trigonometric identities 
to calculate the resulting waveform when several waves are combined. Consider the 
following example. 


Engineering application 3.3 


Combining two sinusoidal voltage signals 
Two voltage signals, v, (t) and v,(t), have the following mathematical expressions: 
v, (t) = 3sint 
v,(t) = 2cost 
(a) State the amplitude and angular frequency of the two signals. 
(b) Obtain an expression for the signal, v,(t), given by 
U3 (t) = v, (t) + 2v,(¢) 


(c) Reduce the expression obtained in part (b) to a single sinusoid and hence state 
its amplitude and phase. 
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Solution 
(a) v,(¢) has an amplitude of 3 volts and an angular frequency wm = | rads. v,(t) 
has an amplitude of 2 volts and an angular frequency w = 1 rad s~'. Note that 
both of these signals have the same angular frequency. 
(b) v(t) = v,(t) + 2v,(0) 
= 3sint + 2(2 cost) 
=3sint +4cost 
We wish to write v(t) in the form R sin(t + @). The choice of sine is arbitrary. 
We could have chosen cosine instead. R is the amplitude of the single sinusoid 
and ¢ is its phase angle. 
Using the trigonometric identity sin(t + @) = sintcos @ + sing cost found 
in Table 3.1 we can write 


1 


(c 


wa 


Rsin(t+ ¢) = R(sint cos @ + sing cos t) 
= (Rceos ¢) sint + (Rsin @) cost 


Comparing this expression with that for v;(t) we note that, in order to make the 
expressions identical, 


Rcos¢@ =3 (3.1) 

Rsing = 4 (3.2) 
We need to solve (3.1) and (3.2) to obtain R and ¢. Squaring each equation gives 

R’ cos’ ¢ =9 

R’ sin? @ = 16 


Adding these equations together we obtain 
R’ cos’ @ + R’ sin’ @ = 9 + 16 
R’(cos’ @ + sin’ g) = 25 
Using the identity cos” ¢ + sin? @ = | this simplifies to 


Ro —25 
R= 
Next we determine ¢. Dividing (3.2) by (3.1) we find 
Rsing 4 
Reos@ a 3 
tang = x 
3 


From (3.1) and (3.2) we can see that both sing and cos ¢ are positive, and so 
@ must lie in the first quadrant. Calculating tan~'(?) using a calculator gives 
o = 0.927 radians. So we can express v,(t) as 


v3(t) = 3sint + 4cost = 5 sin(t + 0.927) 
Finally v,(t) has amplitude 5 volts and phase 0.927 radians. 
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Engineering application 3.3 illustrates an important property when combining to- 
gether two sinusoidal waves of the same angular frequency. 


If two waves of equal angular frequency, w, are added the result is a wave of the 
same angular frequency, w. 


In fact this result holds true when combining any number of waves of the same angular 
frequency. 


Engineering application 3.4 


Combining two sinusoidal current signals 
Two current signals, i, (¢) and i,(t), have the following mathematical expressions: 
i,(t) = 10sin 4t 
i,(t) = 5 cos 4t 
(a) State the amplitude and angular frequency of the two signals. 
(b) Obtain an expression for the signal, 7,(t), given by 7,(t) = 0.3, (t) — 0.4i, (ft). 
(c) Reduce the expression obtained in part (b) to a single sinusoid in the form 
Rcos(4t + @) and hence state its amplitude and phase. 


Solution 


(a) i,(¢) has an amplitude of 10 amps and an angular frequency w = 4 rad Sau i,(t) 
has an amplitude of 5 amps and an angular frequency w = 4 rad s_'. Note that 
both signals have the same angular frequency. 


(b) i,(t) = 0.37, (¢) — 0.47, (¢) 
=0.3 x 10sin4t — 0.4 x Scos 4t 
= 3sin4t — 2 cos 4t 
(c) Let 
3 sin4t — 2cos4t = Rcos(4t + ¢) 
Then using the trigonometric identity given in Table 3.1 
cos(A + B) = cosAcosB — sinA sinB 
with A = 4t and B = ¢ we find 
3 sin 4t — 2cos4t = Rcos(4t + ¢) 
= R(cos 4t cos @ — sin 4t sing) 
= (Rcos ¢) cos 4t — (Rsin @) sin 4t 


Hence 
3 = —Rsing (3.3) 
—2 = Rcos¢ (3.4) 


By squaring each equation and adding we obtain 
9+4=R*(sin? d+ cos?) = R” 
so that R = V 13. 
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From (3.3) and (3.4), both sing and cos ¢ are negative and so ¢ lies in the 
third quadrant. Division of (3.3) by (3.4) gives 


3 —Rsi 
=, Ring = —tang 
—2 Rcos@ 

tang = 1.5 


Using a calculator and noting that ¢ lies in the third quadrant we find @ = 4.124. 
Finally 


3 sin4t — 2cos4t = V 13 cos(4t + 4.124) 


So i,(t) = V 13 cos(4t + 4.124). Therefore 7,(t) has an amplitude of / 13 amps 
and a phase of 4.124 radians. 


Example 3.11 Express 0.5 cos 3r + sin 3t as a single cosine wave. 


Solution Let 
0.5 cos 3t + sin3t = Rcos(3t + d) 
= R(cos 3t cos @ — sin 3t sing) 
= (Rcos d) cos 3t — (Rsin@) sin 3t 


Hence 
0.5 = Rcosd@ (3.5) 
1 = —Rsing (3.6) 


By squaring and adding we obtain 
Lo = 2" 
R=V1.25=1.1180 (4dp.) 
Division of (3.6) by (3.5) yields 
2=-—tand 


From (3.5), cos ¢@ is positive; from (3.6), sing is negative; and so ¢ lies in the fourth 
quadrant. Hence, using a calculator, ¢ = 5.1760. So 


0.5 cos 3t + sin 3t = 1.1180 cos(3t + 5.1760) 


Example 3.12 If acoswt + bsinwt is expressed in the form Rcos(wt — 6) show that R = Ja? + b? 


and tand = -. 
a 


Solution Let 
acoswt + bsinwt = Rcos(wt — @) 
Then, using the trigonometric identity for cos(A — B), we can write 


acos wt + bsinwt = Rcos(wt — 0) 
= R(cos wt cos @ + sinwt sind) 


= (Rcos@) cos wt + (Rsin@) sin wt 
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3.7.2 


Example 3.13 


Solution 


Equating coefficients of cos wt and then sin wt gives 


a= Rcosdé (3.7) 
b= Rsin@ (3.8) 


Squaring these equations and adding gives 


e+b=R 
that is 
R=V@+b? 


Division of (3.8) by (3.7) gives 
— =tand 
a 


as required. 

Note that this example demonstrates that adding two waves of angular frequency w 
forms another wave having the same angular frequency but with a modified amplitude 
and phase. 


Wavelength, wave number and horizontal shift 


The sine and cosine waves described earlier in this section had ¢ as their independent 
variable because the waves commonly met in engineering vary with time. There are 
occasions where the independent variable is distance, x say, and in this case some of the 
terminology changes. Consider the wave 


y =Asin(kx + ¢) 


As before, A is the amplitude of the wave. The quantity k is called the wave number. It 
plays the same role as did the angular frequency, w, when t was the independent variable. 
The length of one cycle of the wave, that is the wavelength, commonly denoted A, is 


27 
related to k by the formula 7 = —. The phase angle is ¢ and its introduction has the 
effect of shifting the graph horizontally. 


Figure 3.17 shows a graph of y = sin 2x. 
(a) State the wave number for this wave. 
(b) Find the wavelength of the wave. 

(c) State the phase angle. 


(a) Comparing y = sin 2x with y = sin kx we see that the wave number, k, is 2. 


27 
(b) The wavelength, 7 = — = 17. Note by observing the graph that this result is 
consistent in that the distance required for one cycle of the wave is 7« units. 


(c) Comparing y = sin 2x with sin(kx + @) we see that the phase angle, @, is 0. 
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Figure 3.17 
A graph of the wave y = sin 2x. 


Example 3.14 Figure 3.18 shows a graph of sin (2: + *). 


(a) State the phase angle. 


(b) By comparing Figures 3.17 and 3.18 we see that the introduction of the phase angle 
has caused a horizontal shift of the graph (to the left). Calculate this shift. 


Figure 3.18 
A graph of the wave 


3 


. T 
y=sin (2-4 *), 


Solution (a) By comparing sin (2 + 5) with sin(kx + @) we see that the phase angle is 7 


Tt ™ 
(b) By writing y = sin (2 + 5) as sin2( + * we note that this is y = sin2x 


Tt 
shifted to the left by a horizontal distance — units. 


The results of this example can be generalized. The wave y = Asin(kx + #) can be 
written y = A sink(x + @/k) so that a phase angle of ¢ introduces a horizontal shift of 
length ¢/k. (Compare this with the expression for time displacement in Section 3.7.) 


™ 27 
Noting that A = —, then k = — and we may write A sin(kx + #) equivalently 


. (27x . . 27 .  @ re 
as A sin > + @ ]. Again, using k = TT the horizontal shift, Re may similarly be 
written as 


Xr 
horizontal shift = 2 = 2 = oh 
k Q2n/rX =2n 
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from which 
27 x horizontal shift 
Xr 


This result is important in the engineering application that follows because, more gen- 
erally, when any two waves arrive at a receiver it enables the difference in their phases, 
¢, to be calculated from knowledge of the horizontal shift between them. 


phase angle = ¢ = 


d 
The presence of ¢ in y = A sin(kx + @) causes a horizontal (left) shift of e = a 


Note that adding any multiple of 27t onto the phase angle ¢ will result in the same 
graph because of the periodicity of the sine function. Consequently, a phase angle could 


Tt 
be quoted as @ + 2n7. For example, the wave sin (20+ *) is the same wave as 
7 ™ 
sin (2x + S + an), sin (2x + 3 + rt) and so on. Normally, we would quote a value 


of the phase that was less than 27t by subtracting multiples of 271 as necessary. 


Engineering application 3.5 


Two-ray propagation model 


It is very useful to be able to model how an electromagnetic wave emitted by a trans- 
mitter propagates through space, in order to predict what signal is collected by the 
receiver. This can be quite a complicated modelling exercise. One of the simplest 
models is the two-ray propagation model. This model assumes that the signal re- 
ceived consists of two main components. There is the signal that is sent direct from 
the transmitter to the receiver and there is the signal that is received after being re- 
flected off the ground. 

Figure 3.19 shows a transmitter with height above the ground h, together with a 
receiver with height above the ground h,. The distance between the transmitter and 
the receiver along the ground is d. 


_~” Receiver 
P 


Figure 3.19 
ee / A transmitter and receiver at 
Wee different heights above the 
Q ground. 


Note that the quantities h,, h, and d are all known. 


(Die 
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Waves can be considered to propagate between the transmitter and the receiver 
in two ways. There is the direct route between transmitter and receiver. The direct 
distance between transmitter and receiver is d,. We obtain an expression for d, in 
terms of the known quantities h,, h, and d by considering the triangle ARST. In this 
triangle, RS = d and ST = TO — SO = h, — h,. Hence by Pythagoras’s theorem in 
ARST we have 

TR? = RS? + ST? 
di = d+ (h, —h,)? 


and so 


dy = Jd? + (h,—h,)? 


Note that d, is expressed in terms of the known quantities h,, h, and d. 


There is also a route whereby a wave is reflected off the ground at point A before 
arriving at the receiver. The point A on the ground is such that TAO equals RAP. 
The distance travelled in this case is d. = TA + AR. We wish to find an expression 
for d, in terms of the known quantities h,, h, and d. In order to simplify the calculation 
of this distance we construct an isosceles triangle, ATAQ, in which TA = QA and 
LTAO = ZQAO. Note that in this triangle, TO = QO = h,. 

Then the distance travelled by this reflected wave, d,, is 

distance travelled = d. = TA+ AR 
= QA+ AR 
= QR 
Consider now AQSR. QR is the hypotenuse of this triangle. So by Pythagoras’s 
theorem we have 
de OR SR SO, 
We have SR = d and SQ = QO + SO=h, +h,. So 
d? =d° + (h, +h, 


from which 


d, =a + (h, +h,)* 


Now if the wavelength of the transmitted wave is A then we can calculate the phase 
difference between the direct wave and the reflected wave, ¢, by noting the difference 
in the distance travelled, d, — d,. Using the result for phase difference from Section 
3.7.2 we have 

TG 


2 
@ = = x horizontal shift = —(d, ae 


27 5 - 
7. (Je erste) j@ sh iy?) 


2 hth ear ie 
Se fii ie | aanalak aa 
o= 7 (afi (M4) ice 


a 
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Now the binomial expansion for ./(1 + x) is (see Section 6.4) 
UG ota tors oe es oe ee ee See 
m8 16 2) 


Using this expansion in the expression for ¢ and noting that the moduli of both (h, + 
h,)/d and (h, — h,)/d are less than 1, we have 


2rd h, +h.) h, —h.)2 
_ on (ee Bie Geass 


¢ r 2d? Dap 


Expanding the bracketed terms gives 


2nd (he + 2h,h, + h? — he + 2h,h, — h?) 
on 2d? 


So 
_ 4h, h,7 
Ad 


This is a simplified approximation for the phase difference between the direct wave 
and the reflected wave. Note that it depends on the height of the transmitter, the height 
of the receiver and the distance between the transmitter and the receiver. 

This calculation is important because under some conditions the phase difference 
between the two paths means that the directed and reflected waves destructively 
interfere. In severe cases this causes the signal to decrease at the receiver enough so 
that the communications link is lost. The effect is often termed multipath-induced 


fading. 


EXERCISES 3.7 


1 


State the amplitude, angular frequency, frequency, 
phase angle and time displacement of the following 
waves: 

(a) 3 sin 2t 


(b) ; sin 4t (c) sin(t + 1) 


(d) 4cos 3t (e) 2sin(t — 3) (f) 5cos(0.4r) 


(g) sin(100z7r) 
G) 4cos(7t — 20) 


(h) 6cos(S5t+2) (i) = sin(0.51) 


State the period of 
(a) 2 sin 7t (b) 7 sin(2t + 3) 


(c) tan 5 (d) sec 3t 


2t 
(e) cosec(2t — 1) (f) ca 3 + ?) 
A voltage source produces a time-varying voltage, 
v(t), given by 
u(t) = 15 sin(207t + 4) t>0 


(a) State the amplitude of v(t). 


(b) State the angular frequency of v(t). 
(c) State the period of u(r). 
(d) State the phase of u(r). 
(e) State the time displacement of v(t). 


(f) State the minimum value of u(r). 


A sinusoidal function has an amplitude of 2 anda 
period of 2. State a possible form of the function. 


State the phase angle and time displacement of 


(a) 2sin(t + 3) relative to 2 sint 
(b) sin(2t — 3) relative to sin 2t 


t t 
(c) cs + 02) relative to cos — 

2 2 
(d) cos(2 — f) relative to cost 


is (8 +4 


(f) sin(4 — 3r) relative to sin 3t 


: 4. ot 
relative to sin 3 


(g) sin(27t + 7t) relative to sin 27tt 
(h) 3 cos(5z7tt — 3) relative to 3 cos Smt 


~ .. | TE ‘ . et 
(i) sin{ — +2] relative to sin — 
3 3 

(j) cos(37t — ft) relative to cost 


Write each of the following in the form 
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7 Write each of the following in the form 


Acos(t — 6), @ > 0: 
(a) 2sint —3cost 
(b) 9sint + 6cost 
(c) 4cost — sint 
(d) 3sint 


Asin(3t+6),0 > 0: : : : . 
8 Write each of the following expressions in the form 


(i) A sin(wt + 8), (ii) A sin(wt — 0), 
(iii) A cos(wt + 0), (iv) A cos(wt — 0) where 6 > 0: 


(a) 2sin3t + 3 cos 3t 
(b) cos 3f — 2 sin 3t 
(c) sin 3t — 4cos 3t 
(d) —cos 3t — 4sin3t 


(a) Ssint+4cost (b) —2 sin 3t + 2 cos 3t 
(c) 4sin 2t — 6cos 2t (d) —sin 5t — 3 cos 5t 


Solutions 
1 (a) 3,2, 5.0.0 (b) 54.0.0 7 (a) Vi3cos(t — 2.5536) 
TT A 
1 3 (b) /T17 cos(t — 0.9828) 
iit 3414 (d) 4,3, —,0,0 
2m 2m (c) V17cos(t — 6.0382) 
1 1 
©) 21,57,-3.-3 ® 5,04, —,0,0 ‘as reoe(1— 3) 
5 
1, 100 0 (h = 9.0A 
i ae a) (i) VAsin(t + 0.675) 
(i) al 0.5, ca 0 (j) 4, 1, i —20, _ 20 (ii) J/A1 sin(t _ 5.608) 
i: 7 a (ii) V41 cos(t + 5.387) 
A a) = Gx (c) 2n (iv) V41 cos(t — 0.896) 
3m 
b) (@) V8sin{ 3¢ + — 
@ = on ae mee ssin( ~ | 
. a 
Ja 15 (b) 20n )—@) OU (ii) ssin( = =| 
1 
(d) 4 (e) — (f) —15 
57 


lithe) se08 € i ;) 


2 2 
4 3 sin(7t + k) or 3 cos(mt + k) 


. 71 
By (a) 3,3 (b) “i=5 (c) 0.2, 0.4 9 VEen( = | 
(d) —2, —2 (e) a5 (f) —0.858, —0.286 (c) (i) V52sin(2t + 5.300) 


(ii) 52 sin(2r — 0.983) 


(g) 5 (h) “3-5 (i) 2, 2 (iii) /52 cos(2t + 3.730) 
Gg) —37, —37 (iv) /52 cos(2t — 2.554) 
8 (a) Vi3sin(3r + 0.9828) (d) Gi) V10sin(S¢ + 4.391) 
(b) V5sin(3r + 2.6779) (ii) V10 sin(5¢ — 1.893) 
(c) V17sin(3t + 4.9574) (iii) 10 cos(5t + 2.820) 
(d) V17sin(3t + 3.3866) (iv) 10 cos(St — 3.463) 
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Technical Computing Exercises 3.7 


1 Plot y=sin2r forO <t < 27. 5 Plot y=sint+3cost for0 <¢t < 37. By reading 


from your graph, state the maximum value of 


2 Plot y=cos3t for0 <t < 37. sint + 3cost. 


t 


3 Ploty= w»(5) for0 <t <4n. 


2 


6 (a) Plot y = 2sin3t —cos3¢ for0 <t < 27. 
Use your graph to find the amplitude of 
2 sin 3t — cos 3¢. 


4 Ploty= co( 3) for0 <t < 67. (b) On the same axes plot y = sin 3t. Estimate the 


time displacement of 2 sin 3t — cos 3¢. 


Example 3.15 


Solution 


TRIGONOMETRIC EQUATIONS 


We examine trigonometric equations which can be written in one of the forms sin z = k, 
cosz = k or tanz = k, where z is the independent variable and k is a constant. These 
equations all have an infinite number of solutions. This is a consequence of the trigono- 


—7T 
metric functions being periodic. For example, sinz = | has solutions z = ..., > 
—3n 7m Sn . 7 
a pha ee These solutions could be expressed as z = 7 +2nn,n=0,1,2,.... 


Sometimes it is useful, indeed necessary, to state all the solutions. At other times we are 
interested only in solutions in some specified interval, for example solving sin z = | for 
0 <z< 2m. The following examples illustrate the method of solution. 


Solve 
(a) sint = 0.6105 forO <t < 27 (b) sint = —0.6105 forO <t < 27. 


Figure 3.20 shows a graph of y = sinf, with horizontal lines drawn at y = 0.6105 and 
y = —0.6105. 


(a) From Figure 3.20 we see that there are two solutions in the interval 0 < t < 27. 
These are given by points A and B. We have 


sint = 0.6105 
and so, using a scientific calculator, we have 

t = sin '(0.6105) = 0.6567 
This is the solution at point A. From the symmetry of the graph, the second solu- 
tion is 

t = 71 — 0.6567 = 2.4849 


This is the solution at point B. The required solutions are t = 0.6567, 2.4849. 


(b) Again from Figure 3.20 we see that the equation has two solutions in the interval 
0 <t < 27. These are given by the points C and D. One solution lies in the interval 


37 As ; 37 
Tt to >? the other solution lies in the interval a, to 27. We have 


sint = —0.6105 
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Figure 3.20 
A and B are solution points for 
sint = 0.6105. C and D are 
solution points for sint = —0.6105. 
and so, using a calculator, we see 
t = sin !(—0.6105) = —0.6567 
Although this value of f is a solution of sint = —0.6105 it is outside the range of 


values of interest. Recall that 
sint = sin(t + 270) 

that is, adding 271 to an angle does not change the sine of the angle. Hence 
t = —0.6567 + 27 = 5.6265 


is a required solution. This is the solution given by point D. From the symmetry of 
Figure 3.20 the other solution is 


t= 7+ 0.6567 = 3.7983 


This is the solution at point C. The required solutions are t = 3.7983, 5.6265. 


Example 3.16 Solve 


Solution 


7 


(a) cost = 0.3685 for 0 < 2 
0 < 20 


< 
(b) cost = —0.3685 for t 


t 
< 
Figure 3.21 shows a graph of y = cost between t = 0 and ft = 27 together with 
horizontal lines at y = 0.3685 and y = —0.3685. 
Tt 
(a) From Figure 3.21 we see that there is a solution of cost = 0.3685 between 0 and 7 
37 
and a solution between a and 27t. These are given by points A and D. Now 
cost = 0.3685 
and so, using a scientific calculator, we see 


t = cos! (0.3685) = 1.1934 
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cos ¢ 
1 


0.3685 


Figure 3.21 

A and D are solution points for 
cost = 0.3685. B and C are 
solution points for cost = —0.3685. 


Tt 
This is the solution between 0 and 3 that is at point A. Using the symmetry of 


Figure 3.21 the other solution at point D is 
t = 27 — 1.1934 = 5.0898 


The required solutions are t = 1.1934 and 5.0898. 


(b) The graph in Figure 3.21 shows there are two solutions of cost = —0.3685. These 
solutions are at points B and C. Given 


cost = —0.3685 

then using a scientific calculator we have 
t = cos”! (—0.3685) = 1.9482 

This is the solution given by point B. By symmetry the other solution at point C is 
t = 27 — 1.9482 = 4.3350 


The required solutions are t = 1.9482 and 4.3350. 


Example 3.17 Solve 


Solution 


™ 


(a) tant = 1.3100 for0 < 2 
0 < 20 


t 
(b) tant = —1.3100 for0 < 


Figure 3.22 shows a graph of y = tant fort = 0 tot = 27 together with horizontal lines 
y = 1.3100 and y = —1.3100. 


7 
(a) There is a solution of tant = 1.3100 between 0 and es and a solution between 7t and 
370 
oi These are given by points A and C. 


tant = 1.3100 
t = tan! (1.3100) = 0.9188 
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Figure 3.22 
A and C are solution points for tant = 1.3100. B and D are 
solution points for tant = —1.3100. 


This is the solution between 0 and ; given by point A. Using Figure 3.22 we can 
see that the second solution is given by 

t=7m+ 0.9188 = 4.0604 
This is given by point C. 


Tt 
(b) Figure 3.22 shows that there are two solutions of tant = —1.3100, one between ~ 


wm 


37 
and 71, the other between cs and 27. Points B and D represent these solutions. Using 


a scientific calculator we have 
t = tan '(—1.3100) = —0.9188 


This solution is outside the range of interest. Noting that the period of tant is 7t we 
see that 


t = —0.9188 + m = 2.2228 
is a solution between . and 7t. This is given by point B. The second solution is 
t = —0.9188 + 27 = 5.3644 


This is the solution given by point D. The required solutions are t = 2.2228 and 
5.3644. 
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Example 3.18 Solve 


Solution 


Example 3.19 


Solution 


(a) 
(b) 


(a) 


(b) 


sin 2t = 0.6105 for0 <t < 27 
cos(3t + 2) = —0.3685 for0 <t < 27 


Let z = 2t. Ast varies from 0 to 27 then z varies from 0 to 47t. Thus the problem is 
equivalent to solving 


sinz = 0.6105 O0<z<4n 


From Example 3.15 the solutions between 0 and 27t are 0.6567 and 2.4849. Since 
sin z has period 271, then the solutions in the next cycle, that is between 27t and 471, 
are z = 0.6567 + 27 = 6.9399 and z = 2.4849 + 27 = 8.7681. Hence 


Zz = 2t = 0.6567, 2.4849, 6.9399, 8.7681 
and so, to four decimal places, 
t = 0.3284, 1.2425, 3.4700, 4.3840 


Let z = 3t + 2. Ast varies from 0 to 271 then z varies from 2 to 67t + 2. Hence the 
problem is equivalent to solving 


cos z = —0.3685 2<z<6m+4+2 


Solutions between 0 and 271 are given in Example 3.16 as z = 1.9482, 4.3350. 
Noting that cos z has period 27r, then solutions between 27 and 471 are z = 1.9482 + 
27m = 8.2314 and z = 4.3350 + 27 = 10.6182, solutions between 47t and 671 are 
z= 1.9482+ 47 = 14.5146 and z = 4.3350+47 = 16.9014 and solutions between 
67 and 87t are z = 1.9482 + 67 = 20.7978 and z = 4.3350 + 67 = 23.1846. The 
solutions between z = 0 and z = 87 are thus 


Zz = 1.9482, 4.3350, 8.2314, 10.6182, 14.5146, 16.9014, 20.7978, 23.1846 


Noting that 677 + 2 = 20.8496 we require values of z between 2 and 20.8496, 
that is 


Zz = 4.3350, 8.2314, 10.6182, 14.5146, 16.9014, 20.7978 
Finally 


z—2 
—— 


= 0.7783, 2.0771, 2.8727, 4.1715, 4.9671, 6.2659 


A voltage, u(t), is given by 


v(t) = 3sin(t + 1) t>0 


Find the first time that the voltage has a value of 1.5 volts. 


We need to solve 3 sin(t + 1) = 1.5, that is 


sin(t + 1) = 0.5 t>0 


Let z=t+ 1. Since t > 0 then z > 1. The problem is thus equivalent to 


sinz = 0.5 z>l1 
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Using a scientific calculator we have 
z= sin’ (0.5) = 0.5236 
This solution is outside the range of interest. By reference to Figure 3.7 the next solu- 
tion is 
z= 7 — 0.5236 = 2.6180 
This is the first value of z greater than 1 such that sinz = 0.5. Finally 
t=z—1= 1.6180 


The voltage first has a value of 1.5 volts when t = 1.618 seconds. 


EXERCISES 3.8 
1. Solve the following equations for 0 < tf < 27: 5 Solve the following equations for 0 < t < 27: 
(a) sint = 0.8426 (b) sint = 0.2146 (a) cos 2t = 0.4234 
(c) sint = 0.5681 (d) sint = —0.4316 
(e) sint = —0.9042 (f) sint = —0.2491 (b) co(5) = —0.5618 
3 
2 Solve the following equations for 0 < t < 27: 
(a) cost = 0.4243 (b) cost = 0.8040 idl aadau 
(c) cost = 0.3500 (d) cost = —0.5618 3 
(e) cost = —0.7423 (f) cost = —0.3658 (d) cos(2t + 0.5) = —0.8300 
3 Solve the following equations for 0 < t < 27: (e) cos(t — 2) = 0.7431 
(a) tant = 0.8493 (b) tant = 1.5326 (f) cos(mt — 1) = —0.5325 
(c) tant = 1.2500 (d) tant = —0.8437 


6 Solve the following equations for 0 < t < 27: 


(a) tan2t = 1.5234 


(ec) tant = —2.0612 —(f) tant = —1.5731 


4 Solve the following equations for 0 < t < 27: 


(b) wn( 5) = —0.8439 


(a) sin 2¢ = 0.6347 (c) tan(3t — 2) = 1.0641 
(b) sin 3t = —0.2516 (d) tan(1.5¢ — 1) = —1.7300 
t+ 
(c) (5) = 0.4250 ee oa 
7 


(f) tan(5t — 6) = —1.2323 
(d) sin(2t + 1) = —0.6230 


(e) sin(2t — 3) = 0.1684 


7 A time-varying voltage, v(t), has the form 


v(t) = 20sin(50mt +20) 120 


(f) vol = ") = —0.4681 Calculate the first time that the voltage has a value of 
3 (a) 2 volts (b) 10 volts (c) 15 volts 
Solutions 
1 (a) 1.0021, 2.1395 (b) 0.2163, 2.9253 2 (a) 1.1326, 5.1506 (b) 0.6368, 5.6464 
(c) 0.6042, 2.5374 (d) 3.5879, 5.8369 (c) 1.2132, 5.0700 (d) 2.1674, 4.1158 


(e) 4.2711, 5.1537 (f) 3.3933, 6.0314 (e) 2.4073, 3.8759 (f) 1.9453, 4.3379 
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3 


(a) 0.7041, 3.8457 (b) 0.9927, 4.1343 
(c) 0.8961, 4.0376 (d) 2.4408, 5.5824 


(e) 2.0225, 5.1641 (f) 2.1370, 5.2786 


(a) 0.3438, 1.2270, 3.4854, 4.3686 

(b) 1.1320, 2.0096, 3.2264, 4.1040, 5.3208, 6.1984 
(c) 0.8779, 5.4053 

(d) 1.4071, 2.3053, 4.5487, 5.4469 

(e) 1.5846, 2.9862, 4.7262, 6.1278 


(f) no solutions 


(a) 0.5668, 2.5748, 3.7084, 5.7164 
(b) no solutions 

(c) 1.3504 

(d) 1.0250, 1.6166, 4.1665, 4.7582 


Technical Computing Exercises 3.8 


Plot y = sint for0 <t < 27 and y = 0.3500 using 
the same axes. Use your graphs to find approximate 
solutions to 


sint = 0.3500 O<t<2n 


Plot y = cost for0 <t < 27 and y = —0.5500 using 
the same axes. Use your graphs to find approximate 
solutions to 


cost + 0.5500 = 0 O<t<2n 


(e) 1.2669, 2.7331 

(f) 0.9971, 1.6396, 2.9971, 3.6396, 4.9971, 5.6396 
(a) 0.4950, 2.0658, 3.6366, 5.2073 

(b) no solutions 

(c) 0.9388, 1.9860, 3.0332, 4.0804, 5.1276, 6.1748 
(d) 2.0633, 4.1577, 6.2521 

(e) 0.6781, 5.3905 


(f) 0.3939, 1.0222, 1.6505, 2.2788, 2.9071, 3.5355, 
4.1638, 4.7921, 5.4204, 6.0487 


(a) 1.2038 x 1072 
(b) 9.3427 x 1073 
(c) 7.2771 x 1073 


Plot y = sin(2t + 1) and y = 2sin¢ forO <t < 27. 
Use your graphs to state approximate solutions to 
sin(2t + 1) = 2sint O0<t<2n 


Plot y = 2 sin 3t and y = 3 cos 2t forO < t < 27. 
Hence state approximate solutions of 


2 sin 3t = 3 cos 2t O0<t<2n 


REVIEW EXERCISES 3 


1 


Express the following angles in radians: 
(a) 45° (b) 72° (c) 100° 
(e) 440° 


(d) 300° 


The following angles are in radians. Express them in 
degrees. 


Tt 3m 
(a) a (b) 37 (c) 1 (d)2 (e) 3.62 


State the quadrant in which the angle a lies given 
(a) sina > Oand tana > 0 

(b) cosa > Oand sina < 0 

(c) tana > Oand cosa < 0 

(d) sina < Oandcosa < 0 

(e) tana < Oandcosa <0 

Simplify the following expressions: 


(a) sint cosec t 


sin x 
(b) —— 


tan x 
cotA 


cosA 
secA 


(c) 


d 
@) cosec A 
(e) cotxtanx 


Simplify the following expressions: 

(a) cos?A + 1+ sin2A 
2sinAcosA 

Oy oA sin A 

cos- A — sin- A 

(c) Vsec2x— 1 

(d) sintcost + 

1 


cosec 2A — 1 


sect cosec t 


(e) 


Simplify the following expressions: 


(a) (sinx + cos x)? -—1 

(b) tanA sin2A + 1 + cos2A 
sin 46 + sin 20 
cos 20 — cos 4@ 


(d) 4sinA cosA cos 2A 
sint 


State the amplitude, angular frequency, period, 
frequency, phase and time displacement of the 
following waves: 


(a) 2sin 3t 

(b) 4cos 6t 

(c) 0.7 sin(2t + 3) 

(d) 0.1 cos mt 

(e) sin(507t + 20) 

(f) 6cos(1007t — 30) 


(g) + sin( ¢ 
= sin| = 
a a 
(h) 0.25 cos(27t + 1) 
Express the following in the form 
Asin(at + ¢), d > 0: 
(a) 6sin 5t + 5 cos 5t 
(b) 0.1 sint — 0.2 cost 


(c) 7 sin 3t + 6cos 3t 


«aee(5) - ssin(5) 


(e) 3 sin 2t + 15 cos 2t 


Express the following in the form 
Asin(wt — ¢), d > 0: 


(a) 3sin4t + 7 cos 4t 
(b) 3cos 2t — 5 sin 2t 
(c) 4sin 6t — 7 cos 6t 
(d) : t ar t 
5 cost + 3 sin 
(e) 0.75 sin(0.5t) — 1.25 cos(0.5r) 


Express the following in the form A cos(wt + ¢), 


b > 0: 

(a) 10sin3t + 16cos 3t 
(b) —6sin 2t — 3cos 2t 
(c) sint —2cost 

(d) 0.6 cos 4t + 1.3 sin 4t 
(e) cos 7t —5sin7t 


11 


12 


13 


14 


Review exercises 3 


Express each of the following in the form 
Acos(wt — f), @ > 0: 


(a) 2.3 sin 3t + 6.4 cos 3t 
(b) —sin2t — 2 cos 2t 
(c) 2cos9t + 9 sin 9t 
(d) 4sin4t + 5 cos 4t 


t wht 
(e) soo 5] 2sin(5) 


Express each of the following in the form 
Asin(at + @), d > 0: 

(a) sin(t + 1) + cos(t + 1) 

(b) 2sin(2t + 3) — 3cos(2t + 1) 

(c) cos(3t — 1) — 2 sin(3t + 4) 

(d) sin(t+ 1) + sin(t + 3) 

(e) cos(2t — 1) + 3cos(2t + 3) 
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Reduce each of the following expressions to a single 
wave and in each case state the amplitude and phase 


angle of the resultant wave: 


(a) 2coswt + 3 sin wt 


T 
(b) cos{ wt + 4 + sin wt 


(c) 2sin eee +4cos ease 
2 4 
(d) 0.5sin (~ = ;) a sn + ;) 


(e) 3sinot +4 sin(wt + 1) —2cos (~ = 5) 


Solve the following equations, stating all solutions 


between 0 and 27: 
(a) sint = 0.5216 
(b) sint = —0.3724 
(c) cost = 0.9231 


(d) cost = —1 
(e) tant = 0.1437 
(f) tant = —1 


Solve the following equations, stating all solutions 


between 0 and 27: 
(a) sin2t = 0.5421 
(b) cos 2t = —0.4687 


t 
(c) tan 3 = —1.6235 


(d) 2sin4t = 1.5 
(e) 5cos 2t = 2 
(f) 4tan2¢ = 5 
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16 A voltage, v(t), varies with time, t, according to 


v(t) = 240 sin(1007¢ + 30) 


t>0 


Find the first time that v(t) has a value of 


(a) 
(c) 


240 (b) 0 
—240 (d) 100 


Solutions 


a (a) 
(d) 
2 (a) 
(d) 
3 (a) 
(d) 
4& (a) 
(d) 


5 (a) 
(d) 


6 (a) 


(d) 


mm (a) 
(b) 
(c) 
(d) 
(e) 
(f) 
(g) 


(h) 


(b) 
(c) 


Tw) mw 1.7453 
4 5 OS 


5.2360 (e) 7.6794 

60° = (b) 540° (c)_ 135° 
114.6° (e) 207.4° 

Ist (b) 4th (c) 3rd 
3rd (e) 2nd 


1 (b) cosx (c) cosecA 
tanA (e) 1 


2 (b) tan2A (c) tanx 


sin 2t (e) tan? A 


sin 2x (b) 2 (c) coté 


it 
sin 4A (e) 2sin ( ‘) 


2n 3 
Qn’ 
3 
,—,0,0 
1 

1 
0.7, 2, 71, ig? 

Tt 2 


0,0 


’ 


1 
0.1, 7,2, =,0,0 
rg 


2 
1, 507, 0.04, 25, 20, — 
51 
6, 1007t, 0.02, 50, —30, a 
107 
1 1 1 


=> 5747, —,0,0 
2 2 4n 
1 


0.25, 2rr, 1, 1, 1, == 
V61 sin(5t + 0.6947) 
/0.05 sin(t + 5.1760) 
4/85 sin(3t + 0.7086) 


17 


10 


11 


12 


13 


Simplify as far as possible 
(a) cos 100° + cos 80° 


(b) cos 100° — cos 80° 
sin 50° + sin 40° 


() : 
cos 5 
sin 80° — sin 60° 
(d) ————. 
2 sin 10° 


(d) /97 sol | + a) 


(e) /234 sin(2r + 1.3734) 
(a) /58 sin(4t — 5.1173) 
(b) 34 sin(2t — 3.6820) 
(c) 65 sin(6t — 1.0517) 
(d) ; sin(t — 5.6397) 

(e) 1.4577 sin(0.5t — 1.0304) 
(a) V356cos(3t + 5.7246) 
(b) /45 cos(2t + 2.0344) 
(c) V5cos(t + 3.6052) 

(d) /2.05 cos(4t + 5.1448) 
(e) /26cos(7t + 1.3734) 
(a) 6.8007 cos(3t — 0.3450) 
(b) V5 cos(2t — 3.6052) 

(c) /85cos(9t — 1.3521) 
(d) 41 cos(4t — 0.6747) 


(e) vA00m( | — 24603) 


(a) V2sin(t + 1.7854) 

(b) 1.4451 sin(2r + 5.0987) 

(c) 2.9725 sin(3t + 0.7628) 

(d) 1.0806 sin(t + 2) 

(e) 2.4654 sin(2r + 4.8828) 

(a) /13 sin(wt + 0.5880), amplitude = ./13, phase 
angle = 0.5880 


(b) 0.765 sin(@t + 1.1781), amplitude = 0.765, 
phase angle = 1.1781 


(c) 5.596 sin(wt + 2.10), amplitude = 5.596, phase 
angle = 2.10 


(d) 1.581 sin(wt + 0.4636), amplitude = 1.581, 
phase angle = 0.4636 


(e) —3sinwft, amplitude = 3, phase angle = 7 
14 (a) 0.5487, 2.5929 
(c) 0.3947, 5.8885 


(b) 3.5232, 5.9016 


(d) 3.1416, ie. 71 
3x 71 
4° 4 
15 (a) 0.2865, 1.2843, 3.4281, 4.4259 


(e) 0.1427, 3.2843 (f) 


(b) 1.0293, 2.1123, 4.1709, 5.2539 


16 


17 
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(c) 4.2457 


(d) 0.2120, 0.5734, 1.7828, 2.1442, 3.3536, 3.7150, 
4.9244, 5.2858 


(e) 0.5796, 2.5620, 3.7212, 5.7035 

(f) 0.4480, 2.0188, 3.5896, 5.1604 

(a) 9.5070 x 1073 (b) 4.5070 x 1073 
(c) 1.9507 x 10-2. (d): 5.8751 x 1073 
(a) 0 (b) —2sin 10° 

(c) V2 (d) cos 70° 
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INTRODUCTION 


The coordinates of a point describe its position. The most common coordinate system 
is the x—y system: the first number, x, gives the distance along the x axis, the second 
number, y, gives the distance along the y axis. However, this is not the only way to 
describe the position of a point. This chapter outlines several ways in which the position 
of a point can be described. 


CARTESIAN COORDINATE SYSTEM — TWO DIMENSIONS 


The Cartesian coordinate system is named after the French mathematician Descartes. 
The system comprises two axes — the x axis and the y axis — which intersect at right 
angles at the point O. The point O is called the origin. Figure 4.1 shows the Cartesian 
coordinate system. By convention the x axis is drawn horizontally. The positive x axis 
lies to the right of the origin, the negative x axis lies to the left of the origin, the positive 
y axis lies above the origin and the negative y axis lies below the origin. 

Note that this coordinate system can be used only for locating points in a plane, that 
is it has two dimensions. 

Consider any point, P, in the plane. The horizontal distance of P from the y axis 
is called the x coordinate. The vertical distance of P from the x axis is called the y 
coordinate. Either coordinate can be positive or negative. 
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x coordinate 


y coordinate 


Figure 4.1 
23 Cartesian coordinate system in 
two dimensions. 


When stating the coordinates of a point, by convention we always state the x coor- 
dinate first. Thus (3, 1) means that the x coordinate is 3 and the y coordinate is 1. We 
also write, for example, A(3, 1) to mean that the point whose coordinates are (3, 1) is 
labelled A. In Figure 4.1 P has coordinates (2, 3). 


Example 4.1 Plot the points whose Cartesian coordinates are 
(a) (4,2) b=) (c) (-2, 1) 


Solution As plotted in Figure 4.2 


(a) R has coordinates (4, 2). 
(b) S has coordinates (—1, —3). 
(c) T has coordinates (—2, 1). 


Example 4.2 State the coordinates of the points A and B as shown in Figure 4.3. 


Solution A has coordinates (—3, —1); B has coordinates (3, —2). 


Figure 4.2 Figure 4.3 
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Engineering application 4.1 


Electrode coordinates 


Often in science and engineering it is useful to place metallic conductors, known as 
electrodes, inside a glass container from which air has been evacuated. For exam- 
ple, electrical valves, which are also known as vacuum tubes, are constructed in this 
manner. Audio amplifiers that make use of valve technology have made a comeback 
in recent years. Many musicians prefer the sound they generate. 

Figure 4.4 shows two electrodes inside an evacuated glass envelope. State the 
coordinates of the four points A, B, C and D. 


p< 


a 
SoS 


NM Go Cr GN oN 


glass envelope 


Figure 4.4 
3 4 5 6 7 8 9 10 11 12 x Electrode coordinates. 


So 
— 
N 


Solution 
A has coordinates (4, 9). 


B has coordinates (9, 6). 
C has coordinates (8, 10). 
D has coordinates (8, 6). 


To simulate the electric field between these electrodes, and to produce a proper design 
for manufacture, a computer model may be used. Such a model would involve the use 
of mathematical techniques to solve the fields in the space between the electrodes. 
However, the user would have to input the shape of the device. Coordinates would 
be used to specify exactly where each part of the device is located. The coordinates 
could easily be changed in the computer model to test the effects of changing the 
size and position of the electrodes. Computer models speed up the design process by 
avoiding the need for early-stage prototypes. 


EXERCISES 4.2 


1. Plot the following points: A(2, —2), B(—2, 1), 
C(—1, 0), DO, —2). 


2 State the coordinates of the points U, V and W as 
shown in Figure 4.5. 


Figure 4.5 


Solutions 


1. Figure S.6 shows the points A, B, C and D. 


2 U(-2, —2), V(4, Ll), WG, -—1) 
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A point P lies on the x axis. State the y coordinate 
of P. 


A point Q lies on the y axis. State the x coordinate 


of Q. 


y 
2 
B 
e 1 
Cc 
! e ! —_ 
2 -1 O 1 2 x 
-l 
—2 Pp a. 
Figure S.6 


“= ~=CARTESIAN COORDINATE SYSTEM — THREE DIMENSIONS 


Many engineering problems require the use of three dimensions. Figure 4.6 illustrates a 
three-dimensional coordinate system. It comprises three axes, x, y and z. The axes are 
all at right angles to one another and intersect at the origin, O. 

The position of any point in three-dimensional space is given by specifying its x, y 
and z coordinates. By convention the x coordinate is stated first, then the y coordinate and 
finally the z coordinate. For example, P(2, 3, 4) has an x coordinate of 2, a y coordinate 
of 3 and a z coordinate of 4. It is illustrated in Figure 4.6. From the origin, P is located 
by travelling 2 units in the x direction, followed by 3 units in the y direction, followed 


by 4 units in the z direction. 


Note that, as with a two-dimensional system, coordinates can be negative. 


Example 4.3 Plot the following points: A(1, —1, 2), B(O, 1, 2), C(O, 0, 1). 


Solution Figure 4.7 illustrates the points A, B and C. 


We now consider the equation of a plane. Any point on the x—y plane has a z coordinate 
of 0. Hence the equation of the x—y plane is z = 0. Similarly z = 1 represents a plane 
parallel to the x-y plane but | unit above it. Point C in Figure 4.7 lies in the plane 
z = 1. All points in this plane have a z coordinate of 1. 
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P (2, 3, 4) 


Figure 4.6 Figure 4.7 
Cartesian coordinate system in three The points A, B and C are 
dimensions. plotted in three dimensions. 


A and B in Figure 4.7 lie in the plane z = 2. All points in this plane have a z coordinate 


of 2. 
fl Plot the points A(2, 0, —1), BU, —1, 1) and | | State the equation of the plane passing through 
C(-1, 1, 2). (3, 1, 7), (-1, 1, 0) and (6, 1, —3). 


B State the equation of the plane passing through 
(4,7, —1), 3, 0, —1) and (1, 2, —1). 


ai Figure S.7 illustrates the points A, B and C. | 8 y=1 


g==1 


Figure 4.8 
P has polar coordinates r and 6. 
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POLAR COORDINATES 


We have seen how the x and y coordinates of a point describe its location in the x—y 
plane. There is an alternative way to describe the location of a point. Figure 4.8 illustrates 
a point P in the x-y plane. 

P has Cartesian coordinates (x, y). Hence 


OA =x, AP=y 


Consider the arm OP. The length of OP is the distance of P from the origin. We denote 
this by r, that is 


length of OP =r 


Clearly, r is never negative, that is r > 0. 

We note that the angle between the positive x axis and OP is 6. The value of @ lies 
between 0 and 271 radians or 0° to 360° if degrees are used. 

The values of r and 6 are known as the polar coordinates of P. Conventionally, the 
value of r is stated first, then the value of 9. We commonly write these polar coordinates 
as rZé. 

The values of r and @ specify the position of a point. Conventionally, positive values 
of @ are measured anticlockwise from the positive x axis. 


The polar coordinates of a point, P, are r@. The value of r is the distance of P from 
the origin; the value of 6 is the angle between the positive x axis and the arm OP. 


7? 20) 0<0@ <2n (0° < @ < 360°) 


Engineering application 4.2 


Pick and place robot 


Robots are now widely used in factories in order to reduce labour costs. They vary in 
complexity depending on their function. Almost all printed circuit boards found in 
electronic devices such as computers and mobiles are assembled by robots. Figure 4.9 
shows a simple robot that can pick up a surface-mount electronic component in one 
position and place it in another position. 

It consists of a rotating arm the length of which can be extended and contracted. 
On the end of the arm is a hand which can be closed to pick up a component and 
opened to release it. 


zero datum 


—— 


Figure 4.9 
A pick and place robot. 
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It is necessary for a computer to carry out calculations in order to find where a com- 
ponent is located and where to place a component. Decide upon a suitable coordinate 
system to use when carrying out these calculations. 


Solution 

If we examine the geometry of the robot then we see that a polar coordinate system 
would be the most suitable. The centre of the coordinate system should be on the axis 
of rotation. The length of the arm is then given by r and the orientation of the arm is 
given by @ relative to an agreed zero datum mark. 


Example 4.4 Plot the points P, Q and R whose polar coordinates are 


Solution 


(a) 2, 70° 
(b) 4, 160° 
(c) 3, 300° 


Figure 4.10 shows the three points plotted. 


(a) (b) 


Figure 4.10 
A point can be located by the values of its polar coordinates. 


Consider the arm from the origin to the point. The value of r gives the length of this 
arm. The value of 6 gives the angle between the positive x axis and the arm, measuring 
anticlockwise from the positive x axis. 

By studying AOPA, shown in Figure 4.11, we can see that 


cos @ = and so x = rcos@ (4.1) 


sind = 


SIs Se 


and so y = rsin@ (4.2) 


Hence if we know the values of r and 6@, that is the polar coordinates of a point, we 
can use Equations (4.1) and (4.2) to calculate x and y, the Cartesian coordinates of the 
point. 
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Figure 4.11 Figure 4.12 
The polar coordinates are r, 0; the The Cartesian coordinates can be 
Cartesian coordinates are x, y. calculated from the polar coordinates. 


Example 4.5 A point has polar coordinates r = 4, 0 = 210°. Calculate the Cartesian coordinates of 
the point. Plot the point. 


Solution The Cartesian coordinates are given by 


x =rcosé = 4cos 210° = —3.4641 
y=rsiné = 4sin210° = —2 


Figure 4.12 illustrates the point. 


We now look at the problem of calculating the polar coordinates given the Cartesian 
coordinates. Equations (4.1) and (4.2) can be arranged so that r and 6 can be found from 
the values of x and y. Consider a typical point P as shown in Figure 4.11. 

The Cartesian coordinates are (x, y). Suppose that the values of x and y are known. 
The polar coordinates are 7, 6; these values are unknown. By applying Pythagoras’s 
theorem to AOPA we see that 


Party’ 


and so 
r= / x2 + y? 


Note that since r is the distance from O to P it is always positive and so the positive 
square root is taken. 

We now express @ in terms of the Cartesian coordinates x and y. From Figure 4.11 
we see that 


tan@d = y 
% 


and hence 
6 =tan™! (2) 
x 


In summary we have 


r= Je+y 6 =tan7! (=) 


x 


However, we need to exercise a little extra care before calculating tan7! (*) and 
; ; ; x 
reading the result from a calculator. As an illustration note that 


tan 40° = 0.8391 and tan 220° = 0.8391 
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Example 4.6 


Solution 


and so tan7!(0.8391) could be 40° or 220°. Similarly tan105° = —3.7321 and 
tan 285° = —3.7321 and so tan~!(—3.7321) could be 105° or 285°. The value given 


on your calculator when calculating tan! (*) may not be the actual value of 6 we 
x 


require. In order to clarify the situation it is always useful to sketch the Cartesian coor- 
dinates and the angle 6 before embarking on the calculation. 


The Cartesian coordinates of P are (4, 7); those of Q are (—5, 6). Calculate the polar 
coordinates of P and Q. 


Figure 4.13 illustrates the situation for P. 
Then 


r=/42472 = V65 = 8.0623 


Note from Figure 4.13 that P is in the first quadrant, that is 6 lies between 0° and 90°. 
Now 


7 
From a calculator, tan7! ({) = 60.26°. Since we know that 6 lies between 0° and 90° 


then clearly 60.26° is the required value. 
The polar coordinates of P are r = 8.0623, 0 = 60.26°. 
Figure 4.14 illustrates the situation for Q. 

We have 


From Figure 4.14 we see that @ lies between 90° and 180°. Now 


tan~"(*) = tan! (=) = tan™'(—1.2) 
se = 


A calculator returns the value of —50.19° which is clearly not the required value. Recall 
that tan @ is periodic with period 180°. Hence the required angle is 


180° + (—50.19°) = 129.81° 


The polar coordinates of Q are r = 7.8102, 0 = 129.81°. 


-5 O x 


Figure 4.13 Figure 4.14 
P is in the first quadrant. Q lies in the second quadrant. 
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EXERCISES 4.4 
Given the polar coordinates, calculate the Cartesian Given the Cartesian coordinates, calculate the polar 
coordinates of each point. coordinates of each point. 
PS eae (a) (7,11) (b) (-6,-12) (©) (0,15) 
ie arn (d) (-4,6) — @) -4,0) () (-4,0) 
(c) r= 15.7, 0 = 3.7 radians 
i 7a he= 5 maine 
Solutions 
(a) 5.6631, 4.1145 (b) —1.9081, 9.8163 ie) 2 
d) 7.2111, 123.69° 
(c) —13.3152, —8.3184 (d) 0, 1 () 
(e) 4, 0° 
(a) r= 13.0384, 6 = 57.53 (f) 4, 180° 


4.5 


Example 4.7 


Solution 


13.4164, 243.43° 


SOME SIMPLE POLAR CURVES 


Using Cartesian coordinates the equation y = mx describes the equation of a line passing 
through the origin. The equation of a line through the origin can also be stated using 
polar coordinates. In addition, it is easy to state the equation of a circle using polar co- 
ordinates. 


Equation of a line 
Consider all points whose polar coordinates are of the form r/45°. Note that the angle 
@ is fixed at 45° but that r, the distance from the origin, can vary. As r increases, a line 
at 45° to the positive x axis is traced out. Figure 4.15 illustrates this. 

Thus, 6 = 45° is the equation of a line starting at the origin, at 45° to the positive 
X axis. 

In general, 6 = @., where @. is a fixed value, is the equation of a line inclined at 0. to 
the positive x axis, starting at the origin. 


Equation of a circle, centre on the origin 
Consider all points with polar coordinates 3/6. Here r, the distance from the origin, is 
fixed at 3 and @ can vary. As @ varies from 0° to 360° a circle, radius 3, centre on the 
origin, is swept out. Figure 4.16 illustrates this. 

In general r = r, where r, is a fixed value, 0° < 6 < 360° describes a circle of radius 
r., centred on the origin. 


Draw the curve traced out by r = 3, 0° < 6 < 180°. 


Here r is fixed at 3 and 6 varies from 0° to 180°. As @ varies a semicircle is traced out. 
Figure 4.17 illustrates this. 
At P, @ = 0°, while at Q, 0 = 180°. 
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Example 4.8 


Solution 


0 = 45° 


O 


Figure 4.15 
When @ is fixed and r varies, a straight 
line from the origin is traced out. 


Q O ex 


Figure 4.17 
As 6 varies from 0° to 180° a 
semicircle is traced out. 


yh 
3 
3 x 
Figure 4.16 
When r is fixed and 6 varies, a circle is 
swept out. 


O P Q x 


Figure 4.18 
Surface for Example 4.8. 


Describe the surface defined by | <r < 2,0° <0 < 90°. 


Here r varies from | to 2 and 6 varies from 0° to 90°. Figure 4.18 illustrates the surface 


so formed. 


AtP, r=1,6=0°; atQ,r=2,60=0°; atR, r=1,6=90°; atS,r=2,0=90°. 


We have seen some simple polar curves in Figures 4.16 and 4.17. In general a polar 
curve is given by the equation r = f (0), where the radius r varies with the angle 0. 


Engineering application 4.3 


Two-dimensional antenna radiation pattern 


Polar curves are often used to depict radiation patterns from antennas. It is often the 
case that the electric field strength at a fixed distance from an antenna such as shown 
in Figure 4.19 depends upon the angle 0. 

A typical expression for field strength at a particular angle @ could be 


cos(> cos 0) 


sind 
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Figure 4.19 
Electric field strength at a fixed distance from the 
antenna depends upon the angle 0. 


By considering 


cos(4 cos 0) 
=| —<—————— 


sin 6 
we can use a polar curve to depict the field strength at any angle. Graph-plotting 
software is available for producing polar plots such as the one shown in Figure 4.20, 
which shows a typical angle, @, and its associated radius r. 

The radius of the plot for a particular value of 0 represents the transmitted field 
strength and so the antenna in Figure 4.20 has no radiation in the horizontal direction 
and maximum radiation in the vertical direction. 


Figure 4.20 
The transmitted field strength for the antenna, 
r, varies with the angular position, 0. 


EXERCISES 4.5 


1. Describe the curve defined by r = 2,0° < 8 < 90°. 2 Describe the surface defined by 0 < r < 2, 
30° <6 < 45°. 
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Solutions 
This is a quarter circle of radius 2 as shown in 2 Figure S.9 illustrates the surface. OP is set at 30° to 
Figure S.8. the x axis; OQ is at 45° to the x axis. OP = OQ = 2. 
y y Q 
P 

O x O x 

Figure S.8 Figure S.9 
ve ~=CYLINDRICAL POLAR COORDINATES 


Consider the problem of studying the flow of water around a cylinder. A problem like 
this would be studied by engineers when investigating the forces exerted by the sea on 
the cylindrical supports of oil-rigs. It is often mathematically convenient to choose a 
coordinate system that fits the shape of the object being described. It makes sense here 
to select a cylindrical coordinate system. 

Cylindrical polar coordinates comprise polar coordinates with the addition of a 
vertical, or z, axis. Figure 4.21 illustrates a typical point, P, and its cylindrical polar 
coordinates. 

The point Q is in the x-y plane and lies directly below P. Q is the projection of P 
onto the x—y plane. 

Consider a point P in three-dimensional space, with Cartesian coordinates (x, y, z). 
We can also describe the position of P using cylindrical polar coordinates. To do this, 
the x and y coordinates are expressed as their equivalent polar coordinates, while the 
z coordinate remains unaltered. Hence the cylindrical polar coordinates of a point have 
the form (7, 6, z). 

Recall that r is the length of the arm OQ (see Figure 4.21); that is, it is the distance 
of a point in the x—y plane from the origin, and so r > 0. The angle 6 is measured from 
the positive x axis to the arm OQ and so @ has values between 0° and 360° or 271 radians. 
Finally, z is positive for points above the x—-y plane and negative for points below the 
x—y plane. In summary 


r>0, 0< 6 < 2n, -—00 <z< © 


Figure 4.21 
Q The cylindrical polar coordinates of P are (7, 0, z). 
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We can relate the Cartesian coordinates, (x, y, z), to the cylindrical polar coordinates, 
(7, 8, z). The following key point does this. 


i COSHG, a) 
y=r sind O<@o <= Mi 
LS % 


Engineering application 4.4 


Fluid flow along a pipe 


Cylindrical polar coordinates provide a convenient framework for analysing liquid 
flow down a pipe. The radial symmetry of a pipe makes it the natural choice. The 
distance along the pipe is defined using z. In order to utilize the radial symmetry of 
the pipe it is necessary to align the z axis with the centre axis of the pipe. Figure 4.22 
illustrates the arrangement. Distance from the centre of the pipe is defined by r. The 
angle 0 is used in conjunction with z and r to fix the position within the pipe. A typical 
problem that may be analysed is the variation in fluid velocity with distance from the 
centre of the pipe. For smooth flow, liquid tends to travel faster at the centre of a pipe 
than it does near the edge. 


Figure 4.22 
Fluid flow along a pipe. 


Pipes with metal walls are often used to guide electromagnetic waves, rather 
than fluids, in high-powered microwave communications systems. They are termed 
waveguides. Mathematically analysing the waveguide’s propagation modes is made 
much simpler by using cylindrical polar coordinates. 


Example 4.9 The Cartesian coordinates of P are (4,7, —6). State the cylindrical polar coordinates 
of P. 


Solution We have 
x= 4, y=7, z=-6 
Using x = 4 and y = 7 the values of r and @ are found to be r = 8.0623, 6 = 60.26° 


(see Example 4.6). The z coordinate remains unchanged. Hence the cylindrical polar 
coordinates are (8.0623, 60.26°, —6). 


168 


Chapter 4 Coordinate systems 


Example 4.10 Describe the figure defined by 


Solution 


fa) <r < 2,6 S60".1 ee <1 
(b) r=1,0° <6 < 90°,0<z<2 


t > 
P! : y 
x NY 
ge 
Cy i 
— : 
Figure 4.23 Figure 4.24 
AtPRr=1l;atQr=2. On the line AB, z = 1; on the line CD, z = —1. 


(a) Consider the r and 6 coordinates first. The r coordinate varies from 1 to 2 while 0 
is fixed at 60°. This represents the line PQ as shown in Figure 4.23. At P the value 
of ris 1; at Q the value of r is 2. The length of PQ is | and it is inclined at 60° to 


the x axis. 
Now, we note that z varies from —1 to 1. We imagine the line PQ moving in the 
z direction from z = —1 to z = 1. This movement sweeps out a plane. Figure 4.24 


illustrates this. 


(b) The r coordinate is fixed at r = 1. The @ coordinate varies from 0° to 90°. This 
produces the quarter circle, AB, as shown in Figure 4.25. At A,r = 1, 8 = 0°; at 
Br = 1;0-= 90". 


y 
B 
O A x 
Figure 4.25 Figure 4.26 
As @ varies from 0° to 90°, a quarter As z varies from 0 to 2, the curve AB 


circle is swept out. sweeps out the curved surface. 
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Examining the z coordinate, we see that z varies from 0 to 2. As z varies from 
0 to 2, we imagine the curve AB sweeping out the curved surface as shown in Fig- 
ure 4.26. AtC,r = 1,0 = 0°,z = 2; at D, r= 1, 0 = 90°, z = 2. This surface is 
part of a cylinder. 

If the range of values of a coordinate is not given it is understood that that variable 
varies across all its possible values. For example, a curve may be described by r = 1, 
Zz = —2. Here there is no mention of the values that 6 can have. It is assumed that 6 
can have its full range of values, that is 0° to 360°. 


Engineering application 4.5 


Helical antennas 


The helix is a shape commonly found in engineering. For example, the springs used 
in a car’s suspension often have a helical shape. Helical antennas were invented by 
John Kraus in the 1940s and since then have been used extensively in a variety of 
applications including space exploration, satellite communications and mobile tele- 
phony. Developing a mathematical definition of a helix is essential to analysing its 
electromagnetic properties. 

We can set up a cylindrical polar coordinate system with the z axis aligned with the 
axis of the helix as shown in Figure 4.27. If we were to look at the helix along the 
direction of the z axis, all we would see would be a circle. We say that the projection of 
the helix onto the x—y plane is a circle. 


4a 


Figure 4.27 
x Helix along the z axis. 


Suppose a particular helix can be defined parametrically by the Cartesian equa- 
tions 
MO) =J3CS2, MO) SIs, ao)=F 
where ¢ is varied over a particular range in order to generate the finite length helix 


required. By specifying a particular value of t these equations enable us to calculate 
particular values of x, y and z corresponding to a point on the helix. 
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We now develop the equation of the helix in cylindrical polar coordinates. By 
comparing 


x=3cos2t, y=3sin2t with x=rcosé, y=rsind 


(see Equations (4.1) and (4.2)), we have r = 3 and @ = 2r. Note that r = 3 is the 
equation of a circle, radius 3, centre the origin. 

So, the projection of the helix onto the x—y plane is a circle of radius 3. Because 
z(t) = t, the value of z increases as the parameter f increases and the helix is traced 
out. 

We can now state an alternative definition of the helix in terms of cylindrical polar 
coordinates: 


r(t)=3, O@)=2, zt)=t 


As in the Cartesian case, specifying a value of the parameter ¢ enables us to calculate 
particular values of r, 9 and z corresponding to a point on the helix. This provides a 
more elegant definition of the helix than that available using Cartesian coordinates. 
Many problems require the use of these alternative coordinate systems in order to 
simplify analysis. 


EXERCISES 4.6 
Express the following Cartesian coordinates as 3 Describe the surface defined by 
cylindrical polar coordinates. we 2=0 
(a) (-2,-1,4) (b) (0,3,-1) (©) (-4,5, 0) (b) z=-1 
Express the following cylindrical polar coordinates as (c) r=2,z=1 
Cartesian coordinates. (d) 6=90°,z=3 
(a) (3,70°,7) (b) (1,200°,6) (c) (5, 180°, 0) (ec) r=2,0<z<4 
Solutions 
(a) (/5, 206.57°, 4) 3 (a) the x-y plane 
(b) (3, 90°, —1) (b) a plane parallel to the x—-y plane and | unit 
below it 
(c) (V41, 128.66°, 0) ; ; 
(c) acircle, radius 2, parallel to the x—-y plane and 
(a) (1.0261, 2.8191, 7) with centre at (0, 0, 1) 
(b) (—0.9397, —0.3420, 6) (d) a line 3 units above the positive y axis and 
parallel to it 
(c) (—5, 0, 0) 


(e) the curved surface of a cylinder, radius 2, height 4 


“wm SPHERICAL POLAR COORDINATES 


When problems involve spheres, for example modelling the flow of oil around a ball 
bearing, it may be useful to use spherical polar coordinates. The position of a point is 
given by three coordinates, (R, 6, @). These are illustrated in Figure 4.28. 


4.7 Spherical polar coordinates 171 


Figure 4.28 
Spherical polar coordinates are (R, 0, ¢). 


Consider a typical point, P. We look at each of the three coordinates in turn. 

The value of R is the distance of the point from the origin; that is, R is the length of 
OP. Note that R > 0. 

Let Q be the projection of P onto the x—y plane. Then @ is the angle between the 
positive x axis and OQ. Thus, 6 has the same definition as for polar and cylindrical 
polar coordinates. Note that @ can have any value from 0° to 360°. 

Consider the line OP. Then ¢ is the angle between the positive z axis and OP. The 
angle @ can have values between 0° and 180°. When P is above the x—y plane, then ¢ 
lies between 0° and 90°; when P lies below the x—y plane, then ¢ is between 90° and 
180°. When ¢@ = 0°, then P is on the positive z axis; when ¢ = 90°, P lies in the x-y 
plane; when ¢ = 180°, P lies on the negative z axis. 

We can determine equations which relate the Cartesian coordinates, (x, y, z), and the 
spherical polar coordinates, (R, 6, @). 

Note that some books describe spherical polar coordinates as (R, ¢, @), that is the 
definitions of 6 and ¢ are interchanged. Be aware of this when reading other texts. 

Consider AOPQ. Note that 2 OQP is a right angle and so 


OQ = OPsing = Rsing 
OQ lies in the x—y plane and so 


x = OQ cosé = Rsingcosé 
y = OQ sind = Rsing sind 


We also note that 
z= OP cos¢ = Rcosd 
In summary we have 
x = Rsingcosé 
y = Rsing sind 
z=Rcosd 
and 


R20, 0<¢< 7(180°) 0 < @ < 27(360°) 
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Example 4.11 Show that 


R=Vrtyt+2 


Solution From Figure 4.28 
e+y=0e 
From AOPQ 
OP* = OQ + PQ 
But OP = R and PQ =z, so 
Ra=vty+e 
and so 


R=Vrtyt+e2 


Example 4.12 Describe the surface R = 4. 


Solution We have R = 4 and 6 and ¢ can vary across their full range of values. Such points 
generate a sphere of radius 4, centred on the origin. 


Engineering application 4.6 


Three-dimensional radiation pattern of a half-wave dipole 


One of the simplest types of practical antenna is the half-wave dipole. This consists 
of two conductor elements stretched out along a straight line having a combined 
length of approximately half the wavelength at the frequency of the a.c. signal that is 
to be transmitted. The signal is applied to the antenna at the centre of the arrangement 
by a feed cable. The electric field strength produced by the antenna at a fixed distance 
is usually expressed using a spherical coordinate system. The coordinates for the 
antenna and the origin of the radiation itself are assumed to be located at the antenna 
feed point and the electric field strength is represented by the radius, R. Plots produced 
like this are in general termed radiation patterns and are a useful way of visualizing 
the amount of radiated field in a given direction, (0, @), for a particular antenna. 
The half-wave dipole pattern is described by the equation 


cos (5 cos ) 
sing 


IR Shik 


where R represents the electric field strength and K is a constant for a given distance 
from the antenna centre point. The equation for this simple antenna does not involve 
0, which indicates that R does not depend on it, hence there is radial symmetry to the 
pattern. This function is plotted in Figure 4.29. 
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Feeding lines 
attached to 
an a.c source 
Antenna 
elements 3D radiation 
pattern 
Figure 4.29 
The half-wave dipole antenna and its radiation pattern in spherical polar coordinates. 
EXERCISES 4.7 
1. A point has spherical polar coordinates (3, 40°, 70°). 3 Describe the surface R = 1,0° < 6 < 360°, 
Determine the Cartesian coordinates. 0° <¢@< 90°. 
2 A point has Cartesian coordinates (1, 2, 3). Determine 
the spherical polar coordinates. 
Solutions 
1 (2.1595, 1.8121, 1.0261) 3 A hemisphere of radius 1. The flat side is on the x—y 


plane. 


2 R=3.7417, 6 = 63.43°, d = 36.70° 


Technical Computing Exercises 4.7 


Use a technical computing language such as 
MATLAB® to produce a plot similar to Engineering 
application 4.3. You may find it helpful to first generate 


a set of numbers O < t < 271 and then use a built in 


graph of the equation. 


function such as polar or polarplot to generate a 


REVIEW EXERCISES 4 
1 Phas Cartesian coordinates (6, —3, —2). Calculate 4 Describe the surface defined by 
the distance of P from the origin. l<r<4,0° <6 < 90°. 
2 Phas Cartesian coordinates (—4, —3). Calculate the 5 Calculate the cylindrical polar coordinates of a point 


polar coordinates of P. 


with Cartesian coordinates (—1, 4, 1). 


3 Phas polar coordinates (5, 240°). Calculate the 6 Describe the surface x = 0 in three dimensions. 
Cartesian coordinates of P. 
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A point has Cartesian coordinates (—1, —1, 2). 


Calculate the spherical polar coordinates of the point. 


Describe the surface R = 2, 0° < 6 < 180°, 
0° <¢@ < 180°. 


Solutions 


7 
5, 216.87° 
—2.5, —4.3301 


Figure S.10 illustrates the surface. 


Figure S.10 


10 


10 


Describe the three-dimensional surface defined 
byx=y. 


The sphere defined by R = 2 intersects the plane 
defined by z = 1. Describe the curve of intersection. 


(4.1231, 104.04°, 1) 
y-z plane 
R= V6, 6 = 225°, o = 35.26° 


A hemisphere of radius 2. The flat surface is on the 
x-z plane. 


The surface is a plane generated by moving the line 
y =x up and down the z axis. 


A circle of radius V3, centre (0, 0, 1), in the plane 
g= ds 
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INTRODUCTION 


The term discrete is used to describe a growing number of modern branches of mathe- 
matics involving topics such as set theory, logic, Boolean algebra, difference equations 
and z transforms. These topics are particularly relevant to the needs of electrical and 
electronic engineers. Set theory provides us with a language for precisely specifying a 
great deal of mathematical work. In recent years this language has become particularly 
important as more and more emphasis has been placed upon verification of software. 
Boolean algebra finds its main use in the design of digital electronic circuits. Given 
that a very large proportion of electronic circuits are digital rather than analogue, this is 
an important area of study. Digital electronic circuits confine themselves to two effective 
voltage levels rather than the range of voltage levels used by analogue electronic circuits. 
These make them easier to design and manufacture as tolerances are not so critical. Dig- 
ital circuits are becoming more complex each year and one of the few ways of dealing 
with this complexity is to use mathematics. One of the likely trends for the future is that 
more and more circuit designs will be proved to be correct using mathematics before they 
are implemented. Difference equations and z transforms are of increasing importance 
in fields such as digital control and digital signal processing. We shall deal with these 
in Chapter 22. 


SET THEORY 


A set is any collection of objects, things or states. 
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The objects may be numbers, letters, days of the week, or, in fact, anything under discus- 
sion. One way of describing a set is to list the whole collection of members or elements 


and enclose them in braces { }. Consider the following examples. 


A= {1,0} the set of binary digits, one and zero 


B= {off, on} the set of possible states of a two-state system 
C = {high, low} the set of effective voltage levels in a digital electronic 
circuit 


D= ({0, 1, 2,3, 4,5, 6, 7, 8, 9} the set of digits used in the decimal system 


Notice that we usually use a capital letter to represent a set. To state that a particular 
object belongs to a particular set we use the symbol € which means ‘is a member of’. 
So, for example, we can write 


off € B 3€D 
Likewise, ¢ means ‘is not a member of’ so that 
low ¢ B S¢A 


are sensible statements. 

Listing members of a set is fine when there are relatively few but is useless if we are 
dealing with very large sets. Clearly, we could not possibly write down all the members 
of the set of whole numbers because there are an infinite number. To assist us special 
symbols have been introduced to stand for some commonly used sets. These are 


N the set of non-negative whole numbers, 0, 1, 2,3,... 
Nt the set of positive whole numbers, 1, 2, 3,... 

Z the set of whole numbers, positive, negative and zero, 
...—3, -2,-1,0,1,2,3... 

R_ the set of all real numbers 

Rt the set of positive real numbers 

IR~ the set of negative real numbers 

Q the set of rational numbers 


Note that a real number is any number in the interval (—oo, oo). 
Another way of defining a set is to give a rule by which all members can be found. 
Consider the following notation: 


A= {x:x e€Randx < 2} 


This reads ‘A is the set of values of x such that x is a member of the set of real numbers 
and x is less than 2’. Thus A corresponds to the interval (—oo, 2). Using this notation 
other sets can be defined. 

Note that 


Rt = {x:x € Randx > 0} 
R” = {x: x € Randx < 0} 


Example 5.1 


Solution 


5.2.1 


5.2.2 


Example 5.2 
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Use set notation to describe the intervals on the x axis given by 
(a) [0,2] (bv) (0,2) ~=—©) [-9, 9] 


(a) {x:xeRand0O < x < 2} 
(b) {x:xe€RandO < x < 2} 
(c) {x:xeRand-9 < x < 9} 


Sometimes we shall be content to use an English description of a set of objects, such as 


M is the set of capacitors made by machine M 
N is the set of capacitors made by machine N 


Q is the set of faulty capacitors 


Equal sets 


Two sets are said to be equal if they contain exactly the same members. For example, the 
sets {9, 5, 2} and {5, 9, 2} are identical. The order in which we write down the members 
is immaterial. The sets {2, 2,5, 9} and {2, 5, 9} are equal since repetition of elements is 
ignored. 


Venn diagrams 


Venn diagrams provide a graphical way of picturing sets which often aids understanding. 
The sets are drawn as regions, usually circles, from which various properties can be 
observed. 


Suppose we are interested in discussing the set of positive whole numbers between | 
and 10. LetA = {2, 3,4, 5} and B = {1, 3,5, 7, 9}. The Venn diagram representing these 
sets is shown in Figure 5.1. The set containing all the numbers of interest is called the 
universal set, EZ. Eis represented by the rectangular region. Sets A and B are represented 
by the interiors of the circles and it is evident that 2, 3, 4 and 5 are members of A while 1, 
3, 5, 7 and 9 are members of B. It is also clear that 6 ¢ A,6 ¢ B,8 ¢ A, 8 ¢ B. The 
elements 3 and 5 are common to both sets. 


E 


Figure 5.1 
Venn diagram for Example 5.2. 


The set containing all the members of interest is called the universal set E. 


It is useful to ask whether two or more sets have elements in common. This leads to the 
following definition. 
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5.2.3 


5.2.4 


5.2.5 


Example 5.3 


5.2.6 


Example 5.4 


Solution 


Intersection 


Given sets A and B, a new set which contains the elements common to both A and B is 
called the intersection of A and B, written as 


ANB={x:xeAandx e B} 


In Example 5.2, we see that AN B = {3, 5}, thatis3 e AN Band5 € ANB. If the set 
AQB has no elements we say the sets A and B are disjoint and write AM B = ¢, where 
% denotes the empty set. 


A set with no elements is called an empty set and is denoted by ¢. 
If AN B= G, then A and B are disjoint sets. 


Union 


Given two sets A and B, the set which contains all the elements of A and those of B is 
called the union of A and B, written as 


AUB= {x:xeAorx € B or both} 


In Example 5.2, AUB = {1, 2,3, 4,5, 7, 9}. We note that although the elements 3 and 5 
are common to both sets they are listed only once. 


Subsets 


If all the members of a set A are also members of a set B we say A is a subset of B and 
write A C B. We have already met a number of subsets. Convince yourself that 


NcZandZcR 


If M represents the set of all capacitors manufactured by machine M, and M, represents 
the faulty capacitors made by machine M, then clearly M,; C M. 


Complement 


If we are given a well-defined universal set E and a set A with A C E, then the set of 
members of E that are not in A is called the complement of A and is written as A. Clearly 
AUA =E. There are no members in the set ANA, that is ANA = &. 


A company has a number of machines which manufacture thyristors. We consider only 
two machines, M and N. A small proportion made by each is faulty. Denoting the sets 
of faulty thyristors made by M and N by M, and N,, respectively, depict this situation on 
a Venn diagram. Describe the sets M, UN; and M UN. 


Let E be the universal set of all thyristors manufactured by the company. The Venn 
diagram is shown in Figure 5.2. Note in particular that MM N = ¢. There can be no 
thyristors in the intersection since if a thyristor is made by machine M it cannot be made 


5.2.7 


5.2.8 
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M N 


Figure 5.2 
Venn diagram for Example 5.4. 


by machine N and vice versa. Thus M and WN are disjoint sets. Also note M, C M, 
N, C N. The set M,; UN; is the set of faulty thyristors manufactured by either machine M 
or N. The set M U N is the set of thyristors made by machines other than M or N. 


We have seen how the operations M, U can be used to define new sets. It is not difficult 
to show that a number of laws hold, most of which are obvious from the inspection of 
an appropriate Venn diagram. 


Laws of set algebra 


For any sets A, B, C and a universal set E, we have the laws in Table 5.1. From these it 
is possible to prove the laws given in Table 5.2. 


Table 5.1 
The laws of set algebra. 


UB=BUA 
NB=BNA 
U (BUC) = (AUB)UC 
N(BNC)=(ANB)NC 
N (BUC) = (ANB) U(ANC) 
(BNC) = (AUB)N (AUC) 


Commutative laws 
Associative laws 


Distributive laws 


S(> > Se BPS SB BEB 
C 


UP=A . 

AE=A Identity laws 
UA=E 

NA= g Complement laws 
=A 


Table 5.2 
Laws derivable from Table 5.1. 


AU(ANB)=A 


AN (AUB) =| Absorption laws 


(ANB)U(ANB)=A 


(AUB) N(AUB) =A Minimization laws 
AUB=ANB 
ANB=AUB 


De Morgan’s laws 


Sets and functions 


If we are given two sets, A and B, a useful exercise is to examine relationships, given by 
rules, between the elements of A and the elements of B. For example, if A = {0, 1, 4, 9} 
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r:A—+B 


‘take plus or minus the square root of” s:D—+E 


Figure 5.3 
A relation between sets A and B. The relation s maps elements of D to E. 


Example 5.5 


Solution 


s:m—+~3m+1 


Figure 5.4 


and B = {—3, —2, —1, 0, 1, 2, 3} then each element of B is plus or minus the square root 
of some element of A. We can depict this as in Figure 5.3. 

The rule, which, when given an element of A, produces an element of B, is called a 
relation. If the rule of the relation is given the symbol r we write 


r:A->B 


and say ‘the relation r maps elements of the set A to elements of the set B’. For the 
example above, we can writer: 1 > +1,r:4— +2, and generally r: x > +,/x. 
The set from which we choose our input is called the domain; the set to which we map 
is called the co-domain; the subset of the co-domain actually used is called the range. 
As we shall see this need not be the whole of the co-domain. 


A relation r maps elements of a set D, called the domain, to one or more elements 
of a set C, called the co-domain. We write 


iP 2ID => 


If D = {0, 1, 2, 3,4, 5} and E = {1, 4, 7, 10, 13, 16, 19, 22} and the relation with sym- 
bol s is defined by s: D > E,s :m — 3m +1, identify the domain and co-domain of 
s. Draw a mapping diagram to illustrate the relation. What is the range of s? 


The domain of s is the set of values from which we choose our input, that is D = {0, 1, 2, 
3,4, 5}. The co-domain of s is the set to which we map, that is E = {1, 4, 7, 10, 13, 16, 
19, 22}. The rule s : m — 3m+1 enables us to draw the mapping diagram. For example, 
s:3— > 10 and so on. The diagram is shown in Figure 5.4. The range of s is the subset 
of E actually used, in this case {1, 4, 7, 10, 13, 16}. We note that not all the elements of 
the co-domain are actually used. 


The notation introduced is very similar to that for functions described in Section 2.3. 
This is no accident. In fact, a function is a very special form of a relation. Let us recall 
the definition of a function: 


‘A function is a rule which when given an input produces a single output’. 


If we study the two relations r and s, we note that when relation r received input, it 
could produce two outputs. On the mapping diagram this shows up as two arrows leaving 


1 


5.2 Settheory 181 


some elements in A. When relation s received an input, it produced a single output. This 
shows up as a single arrow leaving each element in D. Hence the relation r is not a 
function, whereas the relation s is. This leads to the following more rigorous definition 
of a function. 


A function f is a relation which maps each element of a set D, called the 
domain, to a single element of a set C, called the co-domain. We write 


f:D>-C 


Example 5.6 If M = {off, on}, N = {0, 1} and we define a relation r by 
r:M>N 
r: off > 0 r: on>1 
then the relation r is a function since each element in M is mapped to a single element 
in N. 
Example 5.7 If P = {0, 1} and Q = {high} and we define a relation r by 
r:P>-@Q 
r:1— high 
then r is not a function since each element in P is not mapped to an element in Q. 
All of the functions described in Chapter 2 have domains which are subsets of the real 
numbers R. The input to each function is the particular value of the independent variable 
chosen from the domain and the output is the value of the dependent variable. When 
dealing with continuous domains the graphs we have already considered replace the 
mapping diagrams. 
Example 5.8 Find the domain, D, of the rational function f : D > R given by 
3x 
fix-> ae, 

Solution Since no domain is given, we choose it to be the largest set possible. This is the set 
of all real numbers except the value x = 2 at which point f is not defined. We have 
D={x:xER,x $2}. 

EXERCISES 5.2 
Use set notation to describe the intervals on the x axis 2 Sketch the following sets. [Hint: see Section 2.2 on 
given by open and closed intervals. ] 
(a) (—3, 2) (b) [0, 2] (c) [—2, -1) (a) {x: xe Rand2<x< 4} 


(d) (3, 6] (e) |x| <1 (b) {x:x € Rand-1 <x <0} 
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(c) {x:xeE 
(d) {x:xeE 


R and 0 < x < 2} 
R and 1 < x < 3} 


Using the definitions given in Section 5.2, state 
whether each of the following is true or false: 


(a) 7EZ (b) R7-NQ=¢ 

(c) 0.7¢€Q (d) N‘'UQ=Rt 

(:) R-ON=¢$ (ff) RNZ=Z 

(g) 5€Q (h) NCQ 

IfA = {1, 3,5, 7,9, 11} and B = {3, 4, 5, 6} find 
(a) ANB (b) AUB 

Given A = {1, 2, 3,4, 5, 6}, B = {2, 4, 6, 8, 10} and 


C = {3, 6, 9} state the elements of each of the 
following: 


(a) ANB 
(c) ANC 
(ec) AN(BUC) 


(b) BNC 
(d) ANBNC 
(f) BU(ANC) 


Write out all the members of the following sets: 


(a) A= {x:x Ee Nandx < 10} 

(b) B= {x:x € Rand0 <x < 10 and xis divisible 
by 3} 

The sets A, B and C are given by A = {1, 3,5, 7, 9}, 

B= {0, 2, 4, 6} and C = {1, 5, 9} and the universal 

set; E:= {0, 1,2, 04.4 9}. 


Solutions 


(a) {x: 
(b) {x: 
(c) {x:x 
(d) {x:xeE 
(e) {x:xeE 


See Figure S.11. 


x € Rand —3 <x < 2} 


Rand0 <x < 2} 


xE 


Rand —2 <x < —l} 


R and 3 <x < 6} 


R and —1 <x < I} 


-o—o—__ 


0123 45 x ( -10 1 x 


Figure S.11 


(a) T 
(e) T 


(b) F 
(f) T 


(c) F 
(g) T 


(b) {1,3,4,5, 6,7, 9, 11} 


(d) F 
(h) T 


(a) {3,5} 


10 


11 


(a) Represent the sets on a Venn diagram. 
(b) State AU B. 
(c) State BNC. 
(d) State ENC. 
(e) State A. 
(f) State BNC. 


(g) State BUC. 


Use Venn diagrams to illustrate the following for 
general sets C and D: 


(a) CND ~~ (b) CUD 
(d) CUD (e) CND. 


(c) CND 


By drawing Venn diagrams verify De Morgan’s laws 
ANB=AUB- and AUB=ANB 

For sets A = {0, 1, 2} and B = {3, 4}, draw a mapping 
diagram to illustrate the following relations. 
Determine which relations are functions. For those 
that are not functions, give reasons for your decision. 


(a) r:A>B,r:0373,r:174,r:27>4 


(b) s:A> B,s:0>53,5:0>54,5:1— 3, 
gs:2>3 


(c) t:A—>B,t:0 


>3,t:1734 


IfA = {1,3,5, 7} and B = {1, 2,3, 4}, drawa 
mapping diagram to illustrate the relation r: A > B, 
where r is the relation ‘is bigger than’. Is r a function? 


(a) {2,4,6} — (b) {6} — () {3,6} Gd) {6} 
(e) {2,3,4, 6} (f) {2, 3, 4, 6, 8, 10} 

(a) {0,1, 2,3, 4,5, 6, 7, 8, 9} 

(b) {0, 3, 6, 9} 


(a) See Figure S.12. 


Figure S.12 


(b) {0, 1, 2,3, 4,5, 6, 7, 9} (c) D 
(d) {1,5,9} (e) {0, 2, 4, 6, 8} 
(f) {3, 7, 8} (g) {3, 7,8} 


8 See Figure S.13. 
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C >|] CAN 
GD" (LC 


tek 


Figure S.13 


(e) 


11 ris nota function because | is not mapped to 
anything. 


(b) Not a function since 0 is mapped to two elements 


(c) Not a function since 2 is not mapped to anything 


5.3.1 


LOGIC 


In Section 5.4 we will examine Boolean algebra. This concerns itself with the manip- 
ulation of logic statements and so is suitable for analysing digital logic circuits. In this 
section we introduce the basic concepts of logic by means of logic gates as these form 
the usual starting point for engineers studying this topic. 


The OR gate 


The OR gate is an electronic device which receives two inputs each in the form of a 
binary digit, that is 0 or 1, and produces a binary digit as output, depending upon the 
values of the two inputs. It is represented by the symbol shown in Figure 5.5. 

A and B are the two inputs, and F is the single output. As high (1) or low (0) voltages 
are applied to A and B various possible outputs are achieved, these being defined by 
means of a truth table as shown in Table 5.3. So, for example, if a low (0) voltage is 
applied to A and a high (1) voltage is applied to B, the output is a high (1) voltage at 
F. We note that a ‘1’ appears in the right-hand column of the truth table whenever A 
or B takes the value 1, hence the name OR gate. We use the symbol + to represent OR. 
Because it connects the variables A and B, OR is known as a logical connective. We shall 
meet other logical connectives shortly. This connective is also known as a disjunction, 
so that A + B is said to be the disjunction of A and B. 


Table 5.3 
The truth table for an OR gate 
with inputs A and B. 


A B F=A+B 


A 
F=A+B 1 1 1 
B 1 0 1 
0 1 1 
Figure 5.5 0 0 0 


Symbol for an OR gate. 
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5.3.2 


5.3.3 


5.3.4 


Table 5.4 
The truth table for an AND gate 
with inputs A and B. 


A B F=A-B 


A 
F=A:B ! | i 
B 1 0 0 
0 1 0 
Figure 5.6 0 0 0 
Symbol for an AND gate. 
Table 5.5 
The truth table for 
a NOT gate. 
A —>— F=A A F=A 
1 0 
Figure 5.7 0 I 


Symbol for an inverter. 


The AND gate 


It is possible to construct another electronic device called an AND gate which works 
in a similar way except that the output only takes the value 1 when both inputs are 1. 
The symbol for this gate is shown in Figure 5.6 and the complete truth table is shown in 
Table 5.4. The logical connective AND is given the symbol - and is known as a con- 
junction so that A-B is said to be the conjunction of A and B. 


The inverter or NOT gate 


The inverter is a device with one input and one output and has the symbol shown in 
Figure 5.7. It has a truth table defined by Table 5.5. If the input is A, then the output is 
represented by the symbol A, known as the complement of A. 


The NOR gate 


This gate is logically equivalent to a NOT gate in series with an OR gate as shown in 
Figure 5.8. It is represented by the symbol shown in Figure 5.9 and has its truth table 
defined in Table 5.6. 


A A+B 
B F=A+B 
: Table 5.6 
Figure 5.8 The truth table for a NOR gate. 
A NOT gate in series with an OR gate. — 
A B F=A+B 
A 
F=A+B 1 1 0 
p 1 0 0 
. 0 1 0 
Figure 5.9 0 0 1 


Symbol for a NOR gate. 
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Table 5.7 
Figure 5.10 The truth table for a NAND gate. 
A NOT gate in series with an AND gate. 


A B F=A-B 
. F=A°B 1 1 0 
e 1 0 I 
, 0 1 1 
Figure 5.11 0 0 fl 
Symbol for a NAND gate. 
The NAND gate 


This gate is logically equivalent to a NOT gate in series with an AND gate as shown in 
Figure 5.10. It is represented by the symbol shown in Figure 5.11 and has the truth table 
defined by Table 5.7. 

Although we have only examined gates with two inputs it is possible for a gate to have 
more than two. For example, the Boolean expression for a four-input NAND gate would 
be F = A-B-C-D while that of a four-input OR gate would be F =A+B+C+D, 
where A, B, C and D are the inputs, and F is the output. Logic gates form the building 
blocks for more complicated digital electronic circuits. 


BOOLEAN ALGEBRA 


Suppose A and B are binary digits, that is 1 or 0. These, together with the logical connec- 
tives + and - and also the complement NOT, form what is known as a Boolean algebra. 
The quantities A and B are known as Boolean variables. Expressions such as A + B, 
A+ Band A are known as Boolean expressions. More complex Boolean expressions can 
be built up using more Boolean variables together with combinations of +, - and NOT; 
for example, we can draw up a truth table for expressions such as (A+ B) + (C+ D). 

We shall restrict our attention to the logic gates described in the last section although 
the techniques of Boolean algebra are more widely applicable. A Boolean variable can 
only take the values 0 or |. For our purposes a Boolean algebra is a set of Boolean 
variables with the two operations - and +, together with the operation of taking the 
complement, for which certain laws hold. 


Laws of Boolean algebra 


For any Boolean variables A, B, C, we have the laws in Table 5.8. From these it is possible 
to prove the laws given in Table 5.9. You will notice that these laws are analogous to those 
of set algebra if we interpret + as U, - as M, | as the universal set E, and 0 as the empty 
set %. In ordinary algebra, multiplication takes precedence over addition. In Boolean 
algebra - takes precedence over +. So, for example, we can write the first absorption 
law without brackets, that is 


A+A-B=A 


Similarly, the first minimization law becomes 
A:B+A-B=A 


We shall follow this rule of precedence from now on. 
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Example 5.9 


Solution 


Table 5.8 

Laws of Boolean algebra. 
A+B=B+A ‘ 
A-B=B-A Commutative laws 


A+ (B+C)=(A+B)+C 


A-(B:C) =(A-B)-C 


A-(B+C) =(A-B)+(A-C) 
A+(B-C) =(A+B)-(A+C) 


A+0=A ‘ 

eres | Identity laws 
A+A=1 

A-A=0 Complement laws 
A=A 

Table 5.9 


Associative laws 


Distributive laws 


Laws derived from the laws of Table 5.8. 


A+ (A+B) =A 
A:(A+B)=A 


(A-B)+(A-B)=A 


(A+B)-(A+B)=A 


Absorption laws 


Minimization laws 


a te 5 De Morgan’s laws 


Find the truth table for the Boolean expression A + B-C. 


We construct the table by noting that A, B and C are Boolean variables; that is, they can 
take the values 0 or 1. The first stage in the process is to form all possible combinations of 
A, B and C, as shown in Table 5.10. Then we complete the table by forming C, then B-C 
and finally A + B-C, using the truth tables defined earlier. So, for example, whenever 
C = 1,C = 0. The complete process is shown in Table 5.11. Work through the table to 
ensure you understand how it was constructed. 


Table 5.10 
The possible 
combinations for three 


variables, A, B and C. 


A B 


oococOorFR KR ee 
ocorr COOF Fe 


Q 


Ororeorcr 


Table 5.11 
The truth table for A + B-C. 


A B C 


& 
al 
> 
+ 
& 
al 


rPoOorOorOrFrSO [=| 


ooo OR FP Ree 
OoOOoOrr CO COrFR Fe 
OoOorororcr 
oorocqc”noro 
oOoOrF OR Re ee 
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Table 5.12 
Truth table for (A + B)-(A+C). 


A B Cc Cc A+B A+C (A+ B)-(A+C) 
1 1 1 0 1 1 1 
1 1 0 1 1 1 1 
1 0 1 0 1 1 1 
1 0 0 1 1 1 1 
0 1 1 0 1 0 0 
0 1 0 1 1 1 1 
0 0 1 0 0 0 0 
0 0 0 1 0 1 0 


5.4.2 Logical equivalence 


We know from the distributive laws of Boolean algebra that 
A+(B-C) = (A+B)-(A+C) 


Let us construct the truth table for the r.h.s. of this expression (Table 5.12). If we now 
observe the final column of Table 5.12 we see it is the same as that of Table 5.11. We say 
that A + (B- C) is logically equivalent to (A +B) - (A+C). Figures 5.12 and 5.13 show 
the two ways in which these logically equivalent circuits could be constructed using OR 
gates, AND gates and inverters. Clearly different electronic circuits can be constructed 
to perform the same logical task. We shall shortly explore a way of simplifying circuits 
to reduce the number of components required. 


A+(B:C) 


Figure 5.12 
Circuit to implement A + (B-C). 


A A+B 


B — 
(A + B)*(A+C) 
A 


al 
| 


C A+C 


Figure 5.13 
Circuit to implement (A + B)-(A+C). 


Engineering application 5.1 


Boolean expression and truth table for an electronic circuit 


Find the Boolean expression and truth table for the electronic circuit shown in Fig- 
ure 5.14. 
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Example 5.10 


Solution 
Table 5.14 
The truth table for a 
system with inputs A, B 
and C and an output X. 


A B Cc xX 


SOC ORRR HE 
SCOFFR COC ORF 
Cor Or OF Oe 
BOR OR ORF 


Solution 

By labelling intermediate points in the circuit we see that X = A-B+C. In order to 
obtain the truth table we form all possible combinations of A, B and C, followed by 
A-B,C and finally X = A-B-+C. The complete calculation is shown in Table 5.13. 


Table 5.13 
The truth table for Figure 5.14. 
A B C A-B C X=A-B+C 
i i i 1 0 1 
‘ TM Tie eciee poeei he 881 1 
1 © 1 © @ 0 
x if © ®@ © i 1 
C 0 1 1 +0 0 0 
® 1 © @ ii 1 
© ®@ i © @ 0 
Figure 5.14 0 0 0 0 1 1 


Circuit for Engineering application 5.1. 


What we would now like to be able to do is carry out the reverse process: that is, start with 
a truth table and find an appropriate Boolean expression so that the required electronic 
device can be constructed. 


Given inputs A, B and C, find a Boolean expression for X as given by the truth table in 
Table 5.14. 


To find an equivalent Boolean expression the procedure is as follows. Look down the 
rows of the truth table and select those with an r.h.s. equal to 1. In this example, there 
are five such rows: 1, 2, 4, 6 and 8. Each of these rows gives rise to a term in the required 
Boolean expression. Each term is constructed so that it has a value | for the input values 
of that row. For example, for the input values of row 1, that is 1, 1, 1, we find A+-B-C 
has the value 1, whereas for the input values of row 2, that is 1, 1, 0, we find A+ B- Cc 
has the value 1. Carrying out this process for the other rows we find that the required 
expression is 


X=A-B-C+A-B-C+A-B-C+A-B-C+A-B-C (5.1) 


that is, a disjunction of terms, each term corresponding to one of the selected rows. This 
important expression is known as a disjunctive normal form (d.n.f.). We note that the 
truth table is the same as that of Engineering application 5.1 which had Boolean expres- 
sion (A- B) + C. The d.n.f. we have just calculated, while correct, is not the simplest. 


More generally, to find the required d.n.f. from a truth table we write down an expression 
of the form 


( para dtr + ( ) 


where each term has the value | for the input values of that row. We could now construct 
an electronic circuit corresponding to Equation (5.1) using a number of AND and OR 
gates together with inverters, and it would do the required job in the sense that the desired 
truth table would be achieved. 


Example 5.11 


Solution 
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However, we know from Engineering application 5.1 that the much simpler expres- 
sion (A+ B) + C has the same truth table and if a circuit were to be built corresponding 
to this it would require fewer components. Clearly what we need is a technique for find- 
ing the simplest expression which does the desired job since the d.n-f. is not in general 
the simplest. It is not obvious what is meant by ‘simplest expression’. In what follows 
we shall be concerned with finding the simplest d.n.f. It is nevertheless possible that a 
logically equivalent statement exists which would give a simpler circuit. Simplification 
can be achieved using the laws of Boolean algebra as we shall see in Example 5.11 and 
Engineering application 5.3. 


Engineering application 5.2 


The exclusive OR gate 


We have already looked at the OR gate in Section 5.3. The full name for this type of 
OR gate is the inclusive OR gate. It is so called because it gives an output of 1 when 
either or both inputs are 1. The exclusive OR gate only gives an output of 1 when 
either but not both inputs are 1. The truth table for this gate is given in Table 5.15 and 
its symbol is shown in Figure 5.15. Using the truth table, the d.n.f. for the gate is 


F=A-B+A-B 
The exclusive OR often arises in the design of digital logic circuits. In fact, it is 
possible to buy integrated circuits that contain exclusive OR gates as basic units. 


Table 5.15 
The truth table for an 
exclusive OR gate. 


A Ss ces 
a Ip ut . +) > FACE Ae 
1 1 0 
1 0 1 Figure 5.15 
0 1 1 Symbol for an exclusive OR gate. 
0 0 0 


Use the laws of Boolean algebra given in Table 5.8 to simplify the following expressions. 
(a) A-B+A-B 

(b) A+A-B 

(c) A+A-B-C 

(d) A-B-C+A-B-C+A+B-C+A-B-C+A-B-C 


(a) Using the distributive law we can write 
A-B+A-B=A-(B+B) 
Using the complement law, B + B = 1 and hence 


A-B+A-B=A+l 
=A using the identity law 
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Hence A- B+ A -B simplifies to A. Note that this is the first minimization law given 
in Table 5.9. 


(b) A+A-B=(A+A):(A+B) by the distributive law 
=1-(A+B) by the complement law 
=A+B using the identity law 


(c) Note that A + A-B-C can be written as A + (A-B)-C using the associative laws. 
Then 


A+ (A-B)-C = (A+A-B)-(A+C) by the distributive law 
= (A+B)-(A+C) using part (b) 
—A+B-C by the distributive law 


(d) A-B-C+A-B-C+A+B-C+A-B-C+A-B-C 
can be rearranged using the commutative law to give 


A-B-C+A-B-C+A-B-C+A-B-C+A-B-C 


This equals 


A-B-(C+C)+A-B-(C+C)+A-B-C by the distributive law 
B 


=A-B-1+A-B-1+A-B-C using the complement law 
=A-B+A-B+A-B-C using the identity law 
=A-(B+B)+A-B-C using the distributive law 
=A+A-B-C using the complement and 


identity laws 


Using the result of part (c) this can be further simplified to A + BC. 


Engineering application 5.3 


Design of a binary full-adder circuit 


The binary adder circuit is a common type of digital logic circuit. For example, the 
accumulator of a microprocessor is essentially a binary adder. The term full-adder 
is used to describe a circuit which can add together two binary digits and also add 
the carry-out digit from a previous stage. The outputs from the full-adder consist of 
the sum value and the carry-out value. By connecting together a series of full-adders 
it is possible to add together two binary words. (A binary word is a group of binary 
digits, such as 0111 1010.) For example, adding together two 4-bit binary words 
would require four full-adder circuits. This is shown in Figure 5.16. 

The inputs AO-A3 and BO-B3 hold the two binary words that are to be added. 
The outputs SO—S3 hold the result of carrying out the addition. The lines CO-—C3 hold 
the carry-out values from each of the stages. Sometimes there will be a carry-in to 
stage 0 as a result of a previous calculation. Let us consider the design of stage 2 in 
more detail. The design of the other stages will be identical. First of all a truth table 
for the circuit is derived. This is shown in Table 5.16. 
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Table 5.16 
Truth table for a full-adder. 


Cil AW tp sD 


0 0 0 0 0 

0 0 1 1 0 

0 1 0 1 0 

AO BO A1BI A2 B2 A3 B3 0 1 1 0 1 

1 0 0 1 0 

1 0 1 0 1 

Figure 5.16 1 1 0 0 1 

Four full-adders connected to allow two 4-bit binary words to 1 1 1 1 1 
be added. 


Notice that there are three inputs to the circuit, Cl, A2 and B2. There are also 
two outputs from the circuit, S2 and C2. Writing expressions for the outputs in d.n-f. 
yields 

S2 = C1-A2-B2+C1-A2-B2+C1-A2-B2+C1-A2-B2 
C2 =C1-A2-B2+C1-A2-B2+C1-A2-B2+C1-A2-B2 


It is important to reduce these expressions to as simple a form as possible in order to 
minimize the number of electronic gates needed to implement the expressions. So, 
starting with $2, 


S2 =C1-A2-B2+C1-A2-B24+C1-A2-B2+C1-A2-B2 
= Cl1- (A2-B2+A2-B2)+Cl-(A2-B2+A2-B2) 
by the distributive law (5.2) 


Let 
X =A2-B2+A2-B2 (5.3) 


Notice this is an equation for an exclusive OR gate with inputs A2 and B2. 
Using Equation (5.3) we have 


X =A2-B2+A2-B2 


= (A2-B2)-(A2-B2) by De Morgan’s law 

— (A2 + B2)- (A2 + B2) by De Morgan’s law 

= (A2 + B2)- (A2 + B2) by the complement law 
= A2-A2+A2-B2+ B2-A2+ B2-B2 by the distributive law 
= 0+A2-B2+ B2-A2+0 by the complement law 
= A2-B2+B2-A2 by the identity law 

= A2-B2+A2-B2 by the commutative law 


It is now possible to write Equation (5.2) as 


SD =CloX & Cloak 
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This is the expression for an exclusive OR gate with inputs C1 and X. It is there- 
fore possible to obtain $2 with two exclusive OR gates which are usually available 
as a basic building block on an integrated circuit. The circuit for S2 is shown in Fig- 
ure 5.17. Turning to C2, we have 


C2 = C1-A2-B2+C1-A2-B2+C1-A2-B2+C1-A2-B2 
= A2-B2-(C1+C1)+C1-(A2-B2+A2-B2) _ by the distributive law 
= AW ola) = (Cll o dX by the complement law 


since Cl + Cl = 1, where X is given by Equation (5.3). The output, X, has already 
been generated to produce $2 but can be used again provided the exclusive OR gate 
can stand feeding two inputs. Assuming this is so then the final circuit for the full- 
adder is shown in Figure 5.18. 


A2 xX Figure 5.17 an ee 
B2 $2 Circuit to implement S2 = C1-X +C1-X, 
where X = A2-B2 + A2-B2. 


Figure 5.18 
Circuit to implement the stage 2 full-adder. 


Engineering application 5.4 


Realization of logic gates 


Logic gates are usually constructed from one or more transistors. The most com- 
mon technology used is CMOS (complementary metal-oxide semiconductor) logic, 
which was invented by Frank Wanlass and was perfected whilst he was working 
at Fairchild Semiconductor in the 1960s. CMOS logic makes use of two different 
types of transistor known as PMOS (p-type metal-oxide semiconductor) and NMOS 
(n-type metal-oxide semiconductor) transistors. Both types can be readily manufac- 
tured in vast numbers on a single silicon wafer along with their associated wiring. 
This principle is the constructional basis of modern microprocessors which may con- 
tain billions of individual transistors. The term very-large-scale integration (VLSI) 
was coined to refer to the practice of assembling such a large number of transistor 
devices on a single silicon wafer. 

Figure 5.19 shows the construction of a single NAND gate made from its individ- 
ual transistors. 
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Figure 5.19 
Internal construction of a single 
CMOS NAND gate. 


The diagram shows two PMOS transistors, labelled Q1 and Q2, connected in par- 
allel. These are connected in series with two NMOS transistors, Q3 and Q4. There 
are four source terminals all labelled S, together with four drain terminals labelled D. 
The voltage Vp is the positive voltage supply which is connected to the drain on the 
field effect transistors. The labelling V, is a convention often adopted in this type of 
circuit. Logic levels in a circuit like this are represented by taking a low voltage, close 
to OV, to be a logic 0 and a high voltage, close to Vyp, to be a logic 1. The PMOS 
transistors Q1 and Q2 each carry current between their source and drain terminals 
only when a low voltage (logic 0) is connected at their gate terminal (labelled G). 
The NMOS transistors Q3 and Q4 are a complementary type where current flows, 
which only occurs when a high voltage, corresponding to logic 1, is presented at their 
gates. Thus when both A and B are at logic 0, Q1 and Q2 are switched on and Q3 and 
Q4 are switched off, hence the output is Vj), which represents logic 1. This output 
is still the same if either A or B, but not both, are at logic 1 because although Q3 or 
Q4 will be turned on they are connected in series and individually have no effect. If 
both A and B are at logic 1, then QI and Q2 are switched off, and Q3 and Q4 are 
switched on, hence the output will be approximately 0 V, which represents logic 0. 
This behaviour is consistent with the truth table given in Table 5.7. 

All modern VLSI chips are designed using high-level design languages such as 
VHDL (a specialized computer language for hardware) and the transistor design and 
layout is fully automated. It is now rarely necessary for the microprocessor designer 
to consider individual transistors or even individual gates. 


EXERCISES 5.4 
Write Boolean expressions for the output, F’, from the 3 Design electronic devices which produce the 
electronic devices shown in Figure 5.20. following outputs: 
Write Boolean expressions for the output from the (a) A+B (b) A-(B-C) (c) (C+D)-(A+B) 


devices shown in Figure 5.21. (d) A+B (ec) A‘-B+A-B+B:-C 


194 Chapter 5 Discrete mathematics 


(a) A 
B F 
c 

(b) A 


ies) 
5] 


(c) A 
B 
F 
Cc 
qd) A 
F 
B 
Figure 5.20 
(a) A 
F 
B 
A 
b 
(b) # 
F 
Cc 
Figure 5.21 
4 Draw up the truth tables for the expressions given in (c) A-A-A 
Question 3. (d) (A+A)-(A+A) 
5 Use truth tables to verify that the following pairs of (e) A+0 
expressions are logically equivalent: () (A+1)-A41) 


(a) p+p-q:r+p-qgandp+q 
(b) (A+ B)-(A+C) andA+B-C 
(c) prqer+p-q:r+p-q-rand p:(r+q) 


(g) A+1 


7 Simplify the following Boolean expressions using the 


; laws of Boolean algebra: 
6 Simplify the following Boolean expressions using the 


laws of Boolean algebra: (a) (A+A)-(B+B) 
(a) A-A-A (b) A-(A+B+A-B) 
(b) A-A-A (c) (A+A)+(A+C) 
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(d) A-B-(A+B) normal form. Use the laws of Boolean algebra to 
(ce) A-(A+B)-B simplify your expressions. , 
(f) A-B-C+A+B-C @) ————__ ©) 
A B x A B x 
(g) A-B-C+A+B+C 
8 Construct a truth table showing A - B and A + B in 0 0 J 0 0 : 
: : : ; 0 1 0 0 1 1 
order to verify the logical equivalence expressed in 
De Morgan’s law A- B = A + B. Carry out a similar ; : ! ; ° a 
exercise to verifyA + B=A-B. ! ! l : ! 0 
9 Let B = 1 and then B = 0 in the absorption laws, and (c) 
use the identity laws to obtain (a) A+ A = A and . A B C xX 
(b) A-A =A. Verify your results using truth tables. 
0 0 0 1 
10 Derive Boolean expressions and truth tables for the 0 0 1 1 
circuits shown in Figure 5.22. 0 iT 0 0 
11 Simplify the following Boolean expressions using 0 1 i 0 
Boolean algebra: 1 0 0 0 
= 1 0 1 0 
(a) A-B+A-B+B-C+A-B-C 1 1 0 1 
(b) A-(C+A)+C-B+D4+C+4+B-C4+C-A 1 1 1 1 
(c) A+B-C-D+A-+B-C+A-B-C-D+ 
A+-B-C-D+A-B-C:-D (d) 
12. The truth values of the Boolean expression, X, are A B c x 
given in the following tables. Write X in disjunctive 0 0 0 0 
0 0 1 0 
A 0 1 0 0 
B 0 1 1 1 
a 1 0 0 0 
Cc 1 0 1 1 
D 1 1 0 1 
E 1 1 1 0 
(a) 
A 13 Express A-B+A-B using only conjunction (AND 
B gate) and negation (NOT gate). 
Cc 
D xX 
E 
F 
(b) 
Figure 5.22 
Solutions 
7) (a) (A+B)-C (b) (A-B)-B 2 (a) A-B+A-B which is the same as A-B+A-B 
(c) (A-B)+B+C (d) A+B (b) (A-B+A-B)-(B+C) + (A-B+A-B)- (B+C) 


3 See Figure S.14. 
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Cc A 
D B 
p pp 
A 
(c) 
Cc 
(e) 


A 
sr —— A 
e B 
(b) (d) 
Figure S.14 
BE (a) @ — 
A B A+B ee 
1 1 1 
1 1 
1 0 ° - @ i 
0 1 0 
1 1 
0 0 0 0 0 
(b) (e) ————_ 
A B C A-(B-C) A B CC A+B+A-B+B-C 
1 1 1 0 a 1 
1 1 0 1 1 1 0 1 
1 0 1 1 1 0 1 0 
1 0 0 1 1 0 0 0 
0 1 1 I oO 1 1 0 
0 1 0 1 0 1 0 1 
0 0 1 1 0 oOo 1 1 
0 0 0 1 0 Oo 0 1 
(c) 
A B C D> (C+D)-(A+B) 6 (a)A (b)0 (c)0 A 
A 1 0 
ik A 4 i (e) ) (g) 
1 1 1 0 1 7 (a) A-B (b) A (c) A 
a ae : (d) 0 (e) 0 (f) A-B 
1 1 0 0 0 
101 1 1 (g) A+B-C 
1 0 1 0 1 
1 0 0 1 1 10 (a) X=(A-B)-(C+D+E) 
1 0 0 0 0 (b) X =A-B+A+B+C-D+E+F 
0 1 1 441 0 
ci hw 0 11 (a) A+B-C 
0 1 0 1 0 (b) A+B+C+D 
0 1 0 O 0 (c) (A+B) + (A-B-D) which can be further 
0 oO 1 1 1 simplified to A - (B+ D) 
0 0 1 0 1 
0 0 0 1 1 
0 0 0 0 0 


12 (a) 


1 


(b) 


Classify the following as true or false: 


(a) Ré CR (b) 0.667 € R~ 
(c) 0.667 € Q (d) NUZ=R 
(ce) —-6€Q () 9€R 

(g) NNQ=N () RNQ=4 


Use set notation to describe the following intervals on 
the x axis. 


(a) [—-6, 9] 
(d) (0, 2] 


(b) (-1, 1) 
(e) |x| > 1 


(c) |x| < 1.7 
(f) |x| > 2 


The sets A, B and C are given by 

A = {1,2, 6,7, 10, 11, 12, 13}, B = {3, 4, 7, 8, 11}, 
C = {4,5, 6, 7, 9, 13} and the universal set 
E={x:xeNt,1 <x < 13}. List the elements of 
the following sets: 


(a) AUB (b) BUC (c) ANB 
(d) AN(BUC) (ec) ANBNC 
(f) AU(BNC)  (g) CU(BNA) 


Represent the sets A, B and C described in Question 3 
on a Venn diagram. 


> 


(a) 


Sa BS 


(b) 


Be 


wo) 


(c) 
Figure 5.23 
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(c) A-B-C+A-B-C+A-B-C+A-B-C 
(d) A-B-C+A-B C 


13 (A-B)-(A-B) 


REVIEW EXERCISES 5 


5 


List all the elements of the following sets: 
(a) S={n:n€Z,5 <n? <50} 

(b) S={m:meN,5<m < 50} 

(c) S={m:meN,n? + 2m—15 =0} 
IfA = {n:ne Z,—-10 <n < 20}, 


B={m:m€N,m > 15} list the members of AN B 
and write down an expression for A U B. 


Write Boolean expressions for the output, F’, from the 
electronic devices shown in Figure 5.23. 


Simplify the following Boolean expressions using the 
laws of Boolean algebra: 

(a) A-B-l (b) A+A-B+B 

(c) 14+0 (d) D-(C+B)+C-D 

(e) (A+C)-(C+A) 

(f) A-B+A+B (g) A-B 

(h) A+B (i) A-B+O)-C 

(j) (C+A4+D)-(C-D+A) 


Draw up truth tables to verify that A- B + B and 
A-B + Aare logically equivalent. 
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12 
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Express A + B+ C+ D using the Boolean 
connectives AND (- ) and negation. 


Simplify the following expressions using the laws of 
Boolean algebra: 


(a) (A+A)-(B+B) (b) A-B-A 
(c) (A+B)-(A+B) (d) A-A 

(ec) A-B-C-D-1-0 (f) A-B-C-B 
(g) B-C=1 


Reduce the following expressions using Boolean 
algebra: 


(a) A+C+A-B-(B+C) 


(b) (A+B)-B-C+(A+C)-(B+A-C) 


(c) (A+B)-(A+C) 


The truth tables of the Boolean expression, X, are 
given in the following tables. Write the disjunctive 
normal form of X in each case. 


(a) 


> 
wy 


x 


—-Fe OO 
a) 
ooroe 


Solutions 


1 


(a) T (b) F (c) T (d) F 
(e) T (f) F (g) T (h) F 
(a) {x:xER,-6<x< 9} 

(b) {x:xER,-l<x< 1} 

(c) {x:x ER, -1.7 <x < 1.7} 

(d) {x:xeER,0<x< 2} 

(ce) {x:xER,x> lorx<-—l} 


(ff) {x:xE€R,x>20rx < —2} 


(a) {1,2,3,4, 6,7, 8, 10, 11, 12, 13} 
(b) (3, 4,5, 6, 7, 8,9, 11, 13} 

(c) {7,11} 

(d) {6, 7, 11, 13} 

(e) {7} 

(f) {1,2,4,6, 7, 10, 11, 12, 13} 

(g) {4,5,6,7,9, 11, 13} 


(b) 
A B xX 
0 0 0 
0 1 0 
1 0 1 
1 1 0 
On, Bp c x 
0 0) 0 1 
0 0 1 0 
0 1 0 0 
0 1 1 1 
1 0 0 0 
1 0 1 1 
1 1 0 0 
1 1 1 1 
(d) 
A B Cc XxX 
0 0 0 0 
0 0 1 1 
0 1 0 0 
0 1 it 1 
1 0 0 1 
1 0 1 1 
1 1 0 0 
1 1 1 0 
4 See Figure $.15. 
“(of # )’ 
Cc 
Figure S.15 


5 (a) S={-7, —6, —S5, —4, —3, 3, 4,5, 6, 7} 
(b) S = {3,4,5, 6, 7} 


(c) S= {3} 


6 ANB = {16, 17, 18, 19, 20} 
AUB=({m:meéeZ,m => —10} 
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(a) (A-B)-C+(A-B)-C 11 (a) 1 (b) 0 (c) A+B (d) A 

(b) (A+B)-B (e) 0 (f) 0 (g) B-C 

(c) (A-B)+ (C+D) 12 (a) A+B-C 

(a) A+B (b) A+B (c) 1 (b) A-B+A-C+B-C 

(d) D-(C+B) (eC (f) A-B (c) A+ (B+C) 

(g) A+B (h) A‘-B (i) A-C 13 (a) A-B 

Gj) A (b) A-B 

A-B-G-D (c) A-B-C+A+B-C+A+B-C+A-B-C 
(d) A‘B-C+A+B-C+A+B-C+A:B-C 
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INTRODUCTION 


Much of the material in this chapter is of a fundamental nature and is applicable to many 
different areas of engineering. For example, if continuous signals or waveforms, such as 
those described in Chapter 2, are sampled at periodic intervals we obtain a sequence of 
measured values. Sequences also arise when we attempt to obtain approximate solutions 
of equations which model physical phenomena. Such approximations are necessary if 
a solution is to be obtained using a digital computer. For many problems of practical 
interest to engineers a computer solution is the only possibility. The z transform is an 
example of an infinite series which is particularly important in the field of digital signal 
processing. Signal processing is concerned with modifying signals in order to improve 
them in some way. For example, the signals received from space satellites have to un- 
dergo extensive processing in order to counteract the effects of noise, and to filter out 
unwanted frequencies, before they can provide, say, acceptable visual images. Digital 
signal processing is signal processing carried out using a computer. So, skill in manip- 
ulating sequences and series is crucial. Later chapters will develop these concepts and 
show examples of their use in solving real engineering problems. 
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SEQUENCES 


A sequence is a set of numbers or terms, not necessarily distinct, written down in a 
definite order. 


For example, 


1111 
1,3,5,7,9 and Las TE 
are both sequences. Sometimes we use the notation ‘...’ to indicate that the sequence 
continues. For example, the sequence 1, 2, 3,..., 20 is the sequence of integers from 1 


to 20 inclusive. These sequences have a finite number of terms but we shall frequently 
deal with ones involving an infinite number of terms. To indicate that a sequence might 
go on for ever we can use the .. . notation. Thus 


2,4, 6,8,... 
and 
1,-1,1,-1,... 


can be assumed to continue indefinitely. 
In general situations we shall write a sequence as 


x[1];x[2],.213),.0%. 
or more compactly, 
x[k] k= 1,233 5-05 
An alternative notation is 
His Hai Kayo 


The former notation is usually used in signal processing where the terms in the sequence 
represent the values of the signal. The latter notation arises in the numerical solution of 
equations. Hence both forms will be required. Often x[1] will be the first term of the 
sequence although this is not always the case. The sequence 


..,X[—3], x[—2], x[—1], x[0], x[1], x[2], x[3],... 
is usually written as 
x[k] kag $35 = 2,-1.0, 12S hax 
A complete sequence, as opposed to a specific term of a sequence, is often written using 
braces, for example 


(fk]} = x{1], x[2],... 


although it is common to write x[k] for both the complete sequence and a general term 
in the sequence when there is no confusion, and this is the convention we shall adopt in 
this book. 
A sequence can also be regarded as a function whose domain is a subset of the set of 
integers. For example, the function defined by 
3k 


x.N>R a alee 
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Example 6.1 


Solution 


Example 6.2 


Solution 


Example 6.3 


oe 


Figure 6.1 
The unit step sequence. 


is the sequence 


3 9 
x[0] = 0 x[1] = 5 x[2] = 3 x[3] = ge 
The values in the range of the function are the terms of the sequence. The independent 
variable is k. Functions of this sort differ from those described in Chapter 2 because the 
independent variable is not selected from a continuous interval but rather is discrete. It 
is, nevertheless, possible to represent x[k] graphically as illustrated in Examples 6.1-6.3, 
but instead of a piecewise continuous curve, we now have a collection of isolated points. 


Graph the sequences given by 


0 k<0O ; 
(a) wai={) be 0 k=...,—3, —2,—-1,0,1,2,...,thatisk € Z 
1 keven 
(b) wal { re 2 


(a) From the definition of this sequence, the term x[k] is zeroifk <Oand 1lifk > 0.The 
graph is obtained by plotting the terms of the sequence against k (see Figure 6.1). 
This sequence is known as the unit step sequence. We shall denote this by u[k]. 


(b) The sequence x[k] is shown in Figure 6.2. 


Graph the sequence defined by 


kl = |p ee | ee. meee | A ee 


From the definition, if k = 0 then x[k] = 1. If k is not equal to zero the corresponding 
term in the sequence equals zero. Figure 6.3 shows the graph of this sequence which is 
commonly called the Kronecker delta sequence. 


The sequence x[k] is obtained by measuring or sampling the continuous function f(t) = 
sint,t € R, att = —2n, —37/2, —n, —71/2, 0, 1/2, 7, 370/2 and 27. Write down the 
terms of this sequence and show them on a graph. 


afk] 


Figure 6.2 1 
The sequence x[k] = | 
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4 3 2 
Figure 6.4 
Figure 6.3 The function f(t) = sint with sampled points 
The Kronecker delta sequence. shown. 
Figure 6.5 
Sequence formed from sampling 
f(t) = sint. 

Solution The function f(t) = sint, for —2m < t < 27, is shown in Figure 6.4. We sample the 
continuous function at the required points. The sample values are shown as e. From the 
graph we see that 

x[k] = 0, 1,0, —1, 0, 1,0, —1,0 k= —4,-3,...,3,4 
The graph of x[k] is shown in Figure 6.5. 
Sometimes it is possible to describe a sequence by a rule giving the kth term. For ex- 
ample, the sequence for which x[k] = 2* k =0,1,2,..., is given by 1,2,4,8....On 
occasions, a rule gives x[k] in terms of earlier members of the sequence. For example, the 
previous sequence could have been defined by x[k] = 2x[k — 1], x[0] = 1. The sequence 
is then said to be defined recursively and the defining formula is called a recurrence re- 
lation or difference equation. Difference equations are particularly important in digital 
signal processing and are dealt with in Chapter 22. 
Example 6.4 Write down the terms x[k] fork = 0,..., 7 of the sequence defined recursively as 
x[k] = x[k — 2] + x[k — 1] 
where x[0] = 1 and x[1] = 1. 
Solution The values of x[0] and x[1] are given. Using the given recurrence relation we find 


x[2] = x[0] + x[1] = 2 
x[3] = x[1] + x[2] = 3 


Continuing in this fashion we find the first eight terms of the sequence are 
V1.2 359482 13,21 


This sequence is known as the Fibonacci sequence. 
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6.2.1 


Example 6.5 


Solution 


Example 6.6 


Solution 


6.2.2 


Example 6.7 


Solution 


Arithmetic progressions 


An arithmetic progression is a sequence where each term is found by adding a fixed 
quantity, called the common difference, to the previous term. 


Write down the first five terms of the arithmetic progression where the first term is | and 
the common difference is 3. 


The second term is found by adding the common difference, 3, to the first term, 1, and 
so the second term is 4. Continuing in this way we can construct the sequence 


1,4,7, 10, 13,... 


A more general arithmetic progression has first term a and common difference d, 
that is 


d,a+d,a+2d,a+3d,... 
It is easy to see that the kth term is 
a+ (k—I1)d 


All arithmetic progressions can be written recursively as x[k] = x[k — 1]+d. 


Arithmetic progression: a,a+d,a+2d,... 


a= first term, d = common difference, kth term = a + (k — 1)d 


Find the 10th and 20th terms of the arithmetic progression with a first term 5 and common 
difference —4. 


Here a = 5 andd = —4. The kth term is 5 — 4(k — 1). Therefore the 10th term is 
5 — 4(9) = —31 and the 20th term is 5 — 4(19) = —71. 


Geometric progressions 


A geometric progression is a sequence where each term is found by multiplying the 
previous term by a fixed quantity called the common ratio. 


Write down the geometric progression whose first term is | and whose common ratio 
1 


is —. 


2 
: Bical _ di 1 
The second term is found by multiplying the first by the common ratio, > that is 5 xl= 


1 
5° Continuing in this way we obtain the sequence 


1 
1 +. Ses 
8 


Ale 


1 
> 9’ 
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A general geometric progression has first term a and common ratio r and can therefore 
be written as 


a, ar,ar’,ar°,... 


and it is easy to see that the kth term is ar*~!. All geometric progressions can be written 
recursively as x[k] = rx[k — 1]. 
Geometric progression: a, ar, OP oo. 


a= first term, r = common ratio, kth term = ar‘! 


More general sequences 
We have already met a number of infinite sequences. For example, 
(1) x[k] = 2,4,6,8,... 
1 
2) x[k] =1,=,-,... 
(2) x{k] 4 
In case (1) the terms of the sequence go on increasing without bound. We say the se- 
quence is unbounded. On the other hand, in case (2) it is clear that successive terms get 
smaller and smaller and as k — oo, x[k] — 0. The notion of getting closer and closer to 
a fixed value is very important in mathematics and gives rise to the concept of a limit. In 
case (2) we say ‘the limit of x[k] as k tends to infinity is 0’ and we write this concisely as 
lim x[k] = 0 
k>0oo 
We say that the sequence converges to 0, and because its terms do not increase without 
bound we say it is bounded. 

More formally, we say that a sequence x[k] converges to a limit / if, by proceeding 
far enough along the sequence, all subsequent terms can be made to lie as close to / as 
we wish. Whenever a sequence is not convergent it is said to be divergent. 

It is possible to have sequences which are bounded but nevertheless do not converge 
to a limit. The sequence 


xk) S15 eI, yg = Ups 


clearly fails to have a limit as k — oo although it is bounded, that is its values all lie 
within a given range. This particular sequence is said to oscillate. 

It is possible to evaluate the limit of a sequence, when such a limit exists, from knowl- 
edge of its general term. To be able to do this we can make use of certain rules, the proofs 
of which are beyond the scope of this book, but which we now state: 


If x[k] and y[k] are two sequences such that lim, _, ,, x[k] = /,, andlim,_, ., y[k] = 1, 
where /, and /, are finite, then: 

(1) The sequence given by x[k] + y[k] has limit /, + /,. 

(2) The sequence given by cx[k], where c is a constant, has limit cl,. 

(3) The sequence x[k]y[k] has limit /,/,. 


k l 
(4) The sequence Ale has limit + provided (5 20, 
yIk] l, 
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Furthermore, we can always assume that 


lim — =0 for any constant m > 0 


Example 6.8 Find, if possible, the limit of each of the following sequences, x[k]. 


1 

(a) xk] = 7 pS Ase, 

(b) x[k] =5 ke 1.2.34. 
1 

(©) IK = 345 1, 23:4, 523 

1 

d) x{k] = —— k= 1,254. 
(d) x[k] pa 

(ec) x{k] =k P2199 4S, 


Solution (a) 


(b) 
(c) 


(d) 


(e) 


Example 6.9 Given a sequence with general term x[k] = 


Solution It is meaningless simply to write k = oo to obtain lim,_, ,, x[k] = 


The sequence x[k] is given by 


111 
it, Ao a Ape? 
23 4 
Successive terms get smaller and smaller, and as k — oo, x[k] — 0. By proceeding 
far enough along the sequence we can get as close to the limit 0 as we wish. Hence 


i 
jim x[k] = jim i 0 


The sequence x[k] is given by 5,5,5,5,.... This cnn has limit 5. 
1 
The sequence 3, 3, 3, 3, ... has limit 3. The sequence 1, 5 7 . has limit 0. There- 
fore, using rule (1) we have 
1 
eet =3+0=3 
1 : 1 1 
The terms of the sequence x[k] = 3 + k are given by 4, 35. 33, ..., and by 


proceeding far enough along we can make all subsequent terms lie as close to the 
limit 3 as we wish. 


1 
The sequence x[k] = kad k=1,2,3,4,...,1s given by 


tot @ 
23°40" 
and has limit 0. 


The sequence x[k] = hk = 1,2,3,4,..., is given by 1,4,9, 16,..., and 
increases without bound. This sequence has no limit — it is divergent. 


—| 
kal’ find lim,_, ,, X[k]. 


-1 |. 
1” since such 


a quantity is undefined. What we should do is try to rewrite x[k] in a form in which we 
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can sensibly let k — oo. Dividing both numerator and denominator by k, we write 
k-1 1-—(/k) 
k+1. 1+(/k) 

Then, as k — oo, 1/k — 0 so that 


um ( - a = Mn sel SAI) baie 
k>oo\ 1 + (1/k) lim, ,.,(1 + (1/A)) 


eS eye 


Example 6.10 Given a sequence with general term 


3k? — 5k+6 
x{k] = on Sera 
R+2k+1 

find lim,_,. x{kl. 


Solution Dividing the numerator and denominator by k? introduces terms which tend to zero as 
k > o, that is 


3° —5k+6 — 3—(5/k) + (6/k’) 
R+2k+1 14 (2/k) + (1/k) 
Then as k — ov, we find 


lim x{k] = : =3 


k->oo 


2, 


Example 6.11 Examine the behaviour of < 7 38 k > o. 
Kk? k 
3k+1. 3+4(/k) 
As k — oo, 1/k — 0 so that the denominator approaches 3. On the other hand, as 
k —> oo the numerator tends to infinity so that this sequence diverges to infinity. 


Solution 


EXERCISES 6.2 


1. Graph the sequences given by 3 A sequence, x[k], is defined by 
(a) x[k] = k,k =0, 1, 2,3,... 2 
= x[k] = — +k,k =0,1,2,3,... 
(b)x1k] = i‘ =. 2 


0 otherwise k=0,1,2,3,... 
(c) x[k] =e*, k =0, 1, 2, 3,... 


State the first five terms of the sequence. 
4 Write down the first five terms, and plot graphs, of the 


2 The sequence x[k] is obtained by sampling sequences given recursively by 


f(t) = cos(t + 2),¢t € R. The sampling begins at 
t = O and thereafter att = 1, 2,3,.... Write down 
the first six terms of the sequence. 


x{k —1] 
(a) x[k] = a x{0] = 1 
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(b) x{k] = 3x[k — 1] — 2x[k — 2], 


x[0] = 2, x[1] = 1 
5 A recurrence relation is defined by 
x[n + 1] = x[n] + 10, x[0] = 1, 
n= 05-1,.2;. 35252 
Find x[1], x[2], x[3] and x[4]. 
6 A sequence is defined by means of the recurrence 
relation 
x[n + 1] = x[n] +n’, x(0] = 1, 
n= 05-1;2;3,40. 
Write down the first five terms. 
7 Consider the difference equation 
x[n + 2] — x[n + 1] = 3x[n], 
n= O051,2,3).2% 
If x[0] = 1 and x[1] = 2, find the terms 
x[2], x[3],..., x[6]. 
8 Write down the 10th and 19th terms of the arithmetic 
progressions 
(a) 8,11, 14,... 
(b) 8,5,2,... 
9 An arithmetic progression is given by 
2b b 
b, —, =,0,... 
3° 3 
(a) State the sixth term. 
(b) State the kth term. 
(c) Ifthe 20th term has a value of 15, find b. 
10 Write down the 5th and 10th terms of the geometric 
progression 8, 4, 2,.... 
11 Find the 10th and 20th terms of the geometric 
progression with first term 3 and common ratio 2. 
12 A geometric progression is given by 
a, ar, ar’, ar’, 
Solutions 
2 cos2, cos 3, cos 4, cos 5, cos 6, cos 7 
1 
3 (0, e 4,— 2 12 
2 2° 
OS wi... (b) 2, 1, -1, —5, —13 
C2 ae a ars pal =ap= 
2 4 8 16 
5 11, 21,31,4 
Oy 1,1,2,6, 15 
Me 5, 11, 26, 59, 137 


13 


14 


15 


16 


17 


18 


11 
12 


If |(k + 1)th term | > |Ath term | and (k + 1)th term x 
kth term < 0, which of the following, if any, must be 
true? 

(a) r>1 

(c) r<—-l 

(e) -l<r<l 


(b) a>1 
(d) ais negative 


A geometric progression has first term a = 1. The 
ninth term exceeds the fifth term by 240. Find 
possible values for the eighth term. 


3k +2 


If x[k] = —— find lim... x1K1. 
Find li Si a 
1n IM, _, 99 Fa 5} 

R47 


Find the limits as & tends to infinity, if they exist, of 
the following sequences: 


(a) x[{klak? 
(b) x[k] = = 
(c) x[k] = oa 
4 
6k +7 
Find i (5.75) 


Find lim,_, ,, X[k], if it exists, when 


(a) x[k] = (-1)* 


k 
(b) xk] =2— 55 
k 
(c) x[k] = : 
~ 13 
4 ee +4 
0 MO SBOE 4 


I 2k 
(e) x{kl = (;) 


(a) 10th term = 35, 19th term = 62 
(b) 10th term = —19, 19th term = —46 


2b b(4—k) 45 
(a) a (b) 3. (c) 6 
1 1 
2° 64 


1536, 1572 864 


Only (c) must be true 


13 +128 
14 3 
15 0 


1 
16 (a) Limit does not exist (b) 5 (c) 1 
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17 16 


18 (a) Limit does not exist 
(b) Limit does not exist 


(c) 0 (d) 5 (e) 0 


SERIES 


Whenever the terms of a sequence are added together we obtain what is known as a 


series. For example, if we add the terms of the sequence 1, wit we obtain the series 


S, where 


sap bela 
~~ "2° 4° 8 


This series ends after the fourth term and is said to be a finite series. Other series we 
shall meet continue indefinitely and are said to be infinite series. 
Given an arbitrary sequence x[k], we use the sigma notation 


n 


S.= >: iF 


k=1 
to mean the sum x[1] + x[2] + --- + x[n], the first and last values of k being shown 
below and above the Greek letter &, which is pronounced ‘sigma’. If the first term of 
the sequence is x[0] rather than x[1] we would write > x[k]. 


Sum of a finite arithmetic series 
An arithmetic series is the sum of an arithmetic progression. Consider the sum 
S=14+2+34+4+5 


Clearly this sums to 15. When there are many more terms it is necessary to find a more 
efficient way of adding them up. The equation for S can be written in two ways: 


S=14+2+3+4+5 

and 
S=5354+4+34+2+1 

If we add these two equations together we get 
2S =6+6+6+6+6 


There are five terms so that 


25=5 x 6= 30 
that is 
S=15 


Now a general arithmetic series with k terms can be written as 
S, =a+(a+d)+ (a+2d) +---+ (a+ (k— 1)d) 
but rewriting this back to front, we have 


S, = (a+ (k— 1d) + (a+ (k—2)d) +--+ (at) +a 
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Example 6.12 


Solution 


Example 6.13 


Solution 


Adding together the first term in each series produces 2a + (k — 1)d. Adding the second 
terms together produces 2a + (k — 1)d. Indeed adding together the ith terms yields 
2a + (k — 1)d. Hence, 


2S, = (2a 4+ (k — 1)d) + (Qa 4 (k — 1)d) + +--+ Qa+t (k — 1)d) 


k times 
that is 


2S, = k(2a + (k — 1)d) 
so that 
k 
S, = 5 (2a + (k—1)d) 
This formula tells us the sum to k terms of the arithmetic series with first term a and 


common difference d. 


k 
Sum of an arithmetic series: S$, = 5 (2a+ (k — 1)d) 


Find the sum of the arithmetic series containing 30 terms, with first term 1 and common 
difference 4. 


We wish to find S,: 
S,=14+54+9+.-:: 
—— 


30 terms 


k 30 
Using S, = 5 (2a + (k — 1)d) we find Sy) = (2 + 29 x 4) = 1770. 


Find the sum of the arithmetic series with first term 1, common difference 3 and with 
last term 100. 


We already know that the kth term of an arithmetic progression is given by a+ (k—1)d. 
In this case the last term is 100. We can use this fact to find the number of terms. Thus, 


100 = 1+3(k—1) 


that is 
3(k-—1) = 99 
k—1= 33 
k = 34 


So there are 34 terms in this series. Therefore the sum, S,,, is given by 


34 
S34 = F 20) + B3)G)} 
= 170101) 
S717 
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Sum of a finite geometric series 


A geometric series is the sum of the terms of a geometric progression. If we sum the 


; ; 1 
geometric progression 1, = 


, =.=, — We find 
24 8 16 
ae ee eee (6.1) 
7 oe es a : 


If there had been a large number of terms it would have been impractical to add them all 
1 


directly. However, let us multiply Equation (6.1) by the common ratio, 5 


We ee (6.2) 
2° 2 4 °8° 16° 32 , 


so that, subtracting Equation (6.2) from Equation (6.1), we find 


S Ly : 

a 32 
tt 1 out. Theref Ee d Gat 
since most terms Cancel out. ere oa = 32 and SO = 16 16° 


We can apply this approach more generally: when we have a geometric progression 
with first term a and common ratio r, the sum to k terms is 


S; =atar+ar?+ar>+---+ar™ 
Multiplying by r gives 

rs, = ar tar* tar? +---+ar! 4ar* 
Subtraction gives S, — rS, = a —ar*, so that 


a(1—r*) 


l-r 


S,= provided r + 1 
This formula gives the sum to k terms of the geometric series with first term a and 
common ratio r. 


al = 7) 


Sum of a geometric series: S, = i 
—r 


Pp se il 


Sum of an infinite series 


When dealing with infinite series the situation is more complicated. Nevertheless, it is 
frequently the case that the answer to many problems can be expressed as an infinite 
series. In certain circumstances, the sum of a series tends to a finite answer as more and 


more terms are included and we say the series has converged. To illustrate this idea, 


1 1 
consider the graphical interpretation of the series 1 + 5 +—+—+---, as given in 


, 4 8 
Figure 6.6. 


Start at 0 and move a length 1: total distance moved = 1 


il 1 
Move further, a length e total distance moved = a 


1 3 
Move further, a length re total distance moved = 7 
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6.3.4 


acer ar 7. Se Figure 6.6 
0 1 Iz 141g 2 Graphical interpretation of the series 
1 1 1 
fo ae te 
Pon Ge 
At each stage the extra distance moved is half the distance remaining up to the point 
x = 2. It is obvious that the total distance we move cannot exceed 2 although we 


can get as close to 2 as we like by adding on more and more terms. We say that the 


series | + ~ + — +--- converges to 2. The sequence of total distances moved, given 


; 4 
previously, 
1.3 27 
1,1=,1-,1-,... 
2 4 8 


is called the sequence of partial sums of the series. 
For any given infinite series °° , x[k], we can form the sequence of partial sums, 


1 

S, =) xk] = x1] 
k= 

2 


S, = ) x(k] = x[1] + x12] 


3 


3 = > afk] = afl] + x(2] + x13] 


If the sequence {S,,} converges to a limit S, we say that the infinite series has a sum S 
or that it has converged to S. Clearly not all infinite series will converge. For example, 
consider the series 


1424344454: 


The sequence of partial sums is 1, 3, 6, 10, 15, .... This sequence diverges to infinity 
and so the series 1+ 2+3+4-+5+--- is divergent. 

It is possible to establish tests or convergence criteria to help us to decide whether 
or not a given series converges or diverges, but for these you must refer to a more 
advanced text. 


Sum of an infinite geometric series 


In the case of an infinite geometric series, it is possible to derive a simple formula for 
its sum when convergence takes place. We have already seen that the sum to k terms is 
given by 

ad —r*) 


S,= rAzl 


l-r 

What we must do is allow k to become large so that more and more terms are included 

in the sum. Provided that —1 < r < 1, then ré + 0 ask > oo. Then Ss, > 7 
-—r 

When this happens we write 
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where S,, is known as the ‘sum to infinity’. If r > 1 or r < —1, r* fails to approach a 
finite limit as k —> oo and the geometric series diverges. 


Sum of an infinite geometric series: S,, = 


a 


—ll<pell 
—r 


Find the sum to k terms of the following series and deduce their sums to infinity: 


1 
B) 1264 S415 ts: 


(a) This is a geometric series with first term 1 and common ratio 1/3. Therefore, 


ai—r*) 10-—(1/3)5 :( (:)) 
S;, — — — 1 
l-r 2 3 


1 
(b) This is a geometric series with first term 12 and common ratio 2 Therefore, 


As k — oo, (1/2)* > 00 that S,. = 24. This could, of course, have been obtained 


Example 6.14 
(a) 1+ : oF ; 2s ! =F 
a el een eee Dee 
3. 9 27 
Solution 
2/3 
As k + 00, (1/3) > 0 so that S$, = 3/2. 
S, = 241 — (1/2)*) 
directly from the formula for the sum to infinity. 
EXERCISES 6.3 


An arithmetic series has a first term of 4 and its 30th 
term is 1000. Find the sum to 30 terms. 


Find the sum to 20 terms of the arithmetic series with 
first term a, and common difference d, given by 
(a) a=4,d=3 (b) a=4,d = —-3 


If the sum to 10 terms of an arithmetic series is 100 


and its common difference, d, is —3, find its first term. 


The sum to 20 terms of an arithmetic series is 
identical to the sum to 22 terms. If the common 
difference is —2, find the first term. 


Find the sum to five terms of the geometric series 
with first term 1 and common ratio 1/3. Find the sum 
to infinity. 


Find the sum of the first 20 terms of the geometric 
series with first term 3 and common ratio 1.5. 


Solutions 


15 060 


(a) 650 (b) —490 


11 
12 


Find the sum of the arithmetic series with first term 2, 
common difference 2, and last term 50. 


The sum to infinity of a geometric series is four times 
the first term. Find the common ratio. 


The sum to infinity of a geometric series is twice the 
sum of the first two terms. Find possible values of the 
common ratio. 
Express the alternating harmonic series 

1 1 


ie ; + ae +--+ in sigma notation. 
k 


Write down the first six terms of the series )772.9 z*. 


Explain why 72; x[k] is the same as 77°, x[7]. 
Further, explain why both can be written as 
eo xk + 1. 


23.5 
41 
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5 1.494, S,, = 3 


6 19946 
7 650 
i]; 


1 


9 +— 
J/2 
co (—])rtl 
10 —— 
a 
1 1 1 1 1 
171 14+-+ + t t 
z 2 Bp Ao 2 


Technical Computing Exercises 6.3 


Most technical computing languages have built-in 
functions for generating geometric series. MATLAB® 
has the function cumprod which calculates the 
cumulative product of the numbers passed to its 
input. It takes the first term and then successively 


1.000000 0.500000 0.250000 
0.125000 0.062500 


1. Calculate the sum of all of the elements in the finite 
series given above. 


multiplies the succeeding arguments in turn by the 2 Increase the number of elements in the series to 10 


result. 


and note the difference in your answer. 


For example, in MATLAB® we could type: 3 Compare the answers to the previous two exercises to 


S=cumprod([1 0.5 0.5 0.5 0.5]) 
to produce the geometric series and store it in a row 


vector S. 


the exact equation given at the end of Section 6.3.2 
with a = | andr = 0.5. What would you expect to 
happen if there were 100 elements in the series? 


THE BINOMIAL THEOREM 


It is straightforward to show that the expression (a + b)? can be written as a? + 
2ab + b’. It is slightly more complicated to expand the expression (a + b)? to a® + 
3a°b + 3ab* + b’. However, it is often necessary to expand quantities such as (a + b)°® 
or (a +b)", say, and then the algebra becomes extremely lengthy. A simple technique 
for expanding expressions of the form (a+ b)”, where n is a positive integer, is given by 
Pascal’s triangle. 

Pascal’s triangle is the triangle of numbers shown in Figure 6.7, where it is observed 
that every entry is obtained by adding the two entries on either side in the preceding 
row, always starting and finishing a row with a ‘1’. You will note that the third row 
down, 1 2 1, gives the coefficients in the expansion of (a + b)* = la? + 2ab + 1b’, 
while the fourth row, 1 3 3 1, gives the coefficients in the expansion of (a+b)? = 
la? + 3a°b + 3ab* + 1b3. Furthermore, the terms in these expansions are composed of 
decreasing powers of a and increasing powers of b. When we come to expand the quan- 
tity (a+))‘ the row beginning ‘1 4’ in the triangle will provide us with the necessary 
coefficients in the expansion and we must simply take care to put in place the appropriate 
powers of a and b. Thus (a+ b)* = lat + 4a°b + 607’ + 4ab? + 1b. 


1 1 
1 2 1 
1 3 3 1 
i (Ot)  * 2 
1 5 Coy 10 5 1 Figure 6.7 
1 6 Is 7 20 15 6 1 


Pascal’s triangle. 


Example 6.15 


Solution 


Example 6.16 


Solution 


Example 6.17 


Solution 
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Use Pascal’s triangle to expand (a + b)°. 
We look to the row commencing ‘1 6’, thatis1 6 15 20 15 6. 1, because 
a-+ bis raised to the power 6. This row provides the necessary coefficients. Thus, 


(a+b)® =a° + 6ab + 15a‘b’ + 20a*b® + 15a°b* + 6ab> + b° 


Expand (1 +x)’ using Pascal’s triangle. 


Forming the row commencing ‘1 7’ we select the coefficients 
1 7 21 35 35 21 7 1 
In this example, a = 1 and b = x so that 


(1 +x)? = 14+ 7x + 21a? + 352° + 35x47 + 210° + 7x° + x7 


When it is necessary to expand the quantity (a + b)” for large n, it is clearly in- 
appropriate to use Pascal’s triangle since an extremely large triangle would have to be 
constructed. However, it is frequently the case that in such situations only the first few 
terms in the expansion are required. This is where the binomial theorem is useful. 


The binomial theorem states that when n is a positive integer 


—1 —1 —2 
(a+ b)" =a" +na"™'b+ ip 


+...+ p" 


It is also frequently quoted for the case when a = | and b = x, so that 


CS oe ies a) ae 


7 31 op ae (6.3) 


n 
+x)" =1+nx+ 


Expand (1 +.x)!° up to the term in x’. 


We could use Pascal’s triangle to answer this question and look to the row commencing 
‘1 10° but to find this row considerable calculations would be needed. We shall use the 
binomial theorem in the form of Equation (6.3). Taking n = 10, we find 


10 
) 2 (10) 9) (8) 3 a 
2! 3! 
=14+10x+ 45.7 + 120° +--- 


(1+x)'°=1410x+ 


so that, up to and including x°, the expansion is 


1+ 10x + 45x? + 120x3 
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Example 6.18 


Solution 


Example 6.19 


Solution 


We have assumed in the foregoing discussion that 7 is a positive integer in which case 
the expansion given by Equation (6.3) will eventually terminate. In Example 6.17 this 
would occur when we reached the term in x!°. It can be shown, however, that when n is 
not a positive integer the function (1 + x)” and the expansion given by 


—1 —1 —2 
“ eae a Soe 


(l4+x)"=1ltnm+- (6.4) 
have the same value provided —1 < x < 1. However, when n is not a positive integer the 
series does not terminate and we must deal with an infinite series. This series converges 
when —1 < x < | and the expansion is then said to be valid. When x lies outside this 
interval the infinite series diverges and so bears no relation to the value of (1 +x)”. The 
expansion is then said to be invalid. 


The binomial theorem: 
-—1 —2 
na—Wn=2) 9 


31 —-l<x<l 


=I 
(Cee Sle ees ao oe 


Use the binomial theorem to expand i+ in ascending powers of x up to and including 
x 


the term in x’. 


can be written as (1 + x)~!. Using the binomial theorem given by Equation (6.4) 
x 
with n = —1, we find 


(—1)(—2)(-3) , 
3! - 


(+x)? =14(-1)x+ CO + foe 


=l—-x4+x7-xv4+... 


provided —1 < x < 1. Consequently, if in future applications we come across the series 
1—x+2x — x3 +..., we shall be able to write it in the form (1 + x)~!. This closed 
form avoids the use of an infinite series and so it is easier to handle. We shall make use 
of this technique in Chapter 22 when we meet the z transform. 


Obtain a quadratic approximation to (1 — 2x)!/* using the binomial theorem. 


1 
Using Equation (6.4) with x replaced by —2x andn = 5 we have 


(1/2)(-1/2) 


a ee es 


1 
(= 29)!'? =14(5)¢ 2x) + 
1 
= Pax a? ees 
x oT 


1 1 
provided that —1 < —2x < 1, that is —5 <x< 7 A quadratic approximation is 


therefore 1 — x — =x’. 


1 


a F WwW 


EXERCISES 6.4 


Use the binomial theorem to expand 


4 
(a) d+x)3— (b) (1 +x)4 © (449) 


5 5 4 
(d) (: - 5) (e) (2+ ;) (f) ( " ‘) 
2 2 4 


Use Pascal’s triangle to expand (a + b)8. 
Use Pascal’s triangle to expand (2x + 3y)*. 
Expand (a — 2b). 


Use the binomial theorem to find the expansion of 


(3 — 2x)® up to and including the term in x, 


Obtain the first four terms in the expansion of 


igs : 
=x}. 
2 


Obtain the first five terms in the expansion of 

(1 + 2x)!/2. State the range of values of x for which 
the expansion is valid. Choose a value of x within the 
range of validity and compute values of your 
expansion for comparison with the true function 
values. 


-4 
1 
Expand (1 + 5*) in ascending powers of x up to 


the term in x4, stating the range of values of x for 
which the expansion is valid. 


-1/2 
Expand (1 + :) in descending powers up to the 
x 


fourth term. 


Solutions 


(a) 1+3x4+3x24+x3 
(b) 1+ 4x + 6x? + 4x3 + x4 


4x 2x2 
(c) 1 + 


5x4 5 
te) 32+ 40x-+ 20x? + 5x3 + = + 


x4 2x2 3x3 4 x4 
x 
8 16 256 


a’ + 8a’b + 28a°b? + 56a°b? + 70a*h* + 56a2b> + 
28a2b® + 8ab’ + b8 


(f) 81 
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(a) Expand (1 +x)*. 
(b) Expand (1 + 1/x2)4. 


A function, f(x), is given by 


i 1/2 
foo= (144) 
x 


(a) Obtain the first four terms in the expansion of 
J (x) in descending powers of x. State the range 
of values of x for which the expansion is valid. 


(b) By writing f(x) in the form 
foes P04" 


obtain the first four terms in the expansion of 
f (x) in ascending powers of x. State the range of 
values of x for which the expansion is valid. 


The function, g(x), is defined by 


1 
~ (L4x2)4 
(a) Obtain the first four terms in the expansion of 


g(x) in ascending powers of x. State the range of 
values of x for which the expansion is valid. 


g(x) 


(b) By rewriting g(x) in an appropriate form, obtain 
the first four terms in the expansion of g(x) in 
descending powers of x. State the range of values 
of x for which the expansion is valid. 


16x* + 96x3 y + 216x2y2 + 216xy? + 81y4 


@ — 10a*b + 40a°b? — 80a2b? + 80ab* — 32b° 


729 — 2916x + 4860x2 — 4320x3 
45x2 
L+5x-+ + 152° 
1+ aaa sx valid for — 1 : 
——+— — — valid for -—= <x < = 
a BR 2~**39 
2 3 4 
(ame 2 ie Dede 
2° 2 
t.3 5 
1 mr 


218 Chapter 6 Sequences and series 


2 4 6 8 2 3 
m ©) ~~ we we (b) wim(ia5- 3 +3] valid for |x| <1 
b) 1 
(b) +atatuts 


12 (a) 1— 4x? + 10x* — 20x valid for |x| <1 


1 1 1 
: 4 10 20 
11 (a) it Se ee Oe (b) (1-4 BB) vsti > 


x2 xt x6 
| 65 | POWER SERIES 


A particularly important class of series are known as power series and these are infinite 
series involving integer powers of the variable x. For example, 


Ltx+xr4xe4--- 


and 
2 3 


oe 
ee 
a0 G 


are both power series. Note that a power series can be regarded as an infinite polynomial. 
Many common functions can be expressed in terms of a power series, for example 


xr 


a ame Geo ane x in radians 
which converges for any value of x. For example, 
0,5)" 0.5)° 
(0.5) ri (0.5) 
6 120 
Taking just the first three terms, we find 


sin(0.5) = 0.5 


sin(0.5) © 0.5 — 0.020 833 3 + 0.000 2604 = 0.479 427 1 


as compared with the true value, sin0.5 = 0.479 425 5. 

More generally, a power series is only meaningful if the series converges for the 
particular value of x chosen. We define an important quantity known as the radius of 
convergence, R, as the largest value for which an x chosen in the interval -R<x<R 
causes the series to converge. 

The open interval (—R, R) is known as the interval of convergence. Tests for conver- 
gence of a power series are the subject of more advanced texts. Further consideration will 
be given to power series in Chapter 18, but for future reference we give some common 
expansions now: 


ee x in radians 
Ce ; ; 

cosx=l1——-+—-—.-.. x in radians 
2 A! 

ee 

= Tact ee ay qq © 


all of which converge for any value of x. 


1 


2 
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Each of these series converges rapidly when x is small, and so can be used to obtain 
useful approximations. In particular, we note that 


If x is small and measured in radians 


sinx + x and 


x 


cosx ¥ | - — 


2) 


These formulae are known as the small-angle approximations. 


EXERCISES 6.5 


The power series expansion of e* is given by 
2: x3 


cas a = ree 
e han yt ay oe 


and is valid for any x. Take four terms of the series 
when x = 0, 0.1, 0.5 and 1, to compare the sum to four 
terms with the value of e* obtained from your 
calculator. Comment upon the result. 


Using the power series expansion for cos x: 
(a) Write down the power series expansion for 
cos 2x. 
(b) Write down the power series expansion for 
cos(x/2). 
Solutions 
x e* Sum to 4 terms 
1 1 
0.1 1.1052 1.1052 
0.5 1.6487 1.6458 
1 2.7183 2.6667 
Values are in close agreement when x is small. 
2 2x4 x x4 
=o Sam Cb) ae reais 
(a) w+ (b) g Sea 


By considering the power series expansion for 
cos(—x) show that cos x = cos(—x). 


By considering the power series expansion of e* find 
o,e) 

Dix=o I /kt. 

Obtain a cubic approximation to e* sin x. 


(a) State the power series expansion for e*. 


(b) By using your solution to (a) and the expansion 
for e*, deduce the power series expansions of 
coshx and sinh x. 


2 


1 Xx 
ee a 


2 x4 


x 
(b) coshie = lap tay ts 
Pr 

sinha =X oo tee 


GT] SEQUENCES ARISING FROM THE ITERATIVE SOLUTION 


OF NON-LINEAR EQUATIONS 


It is often necessary to solve equations of the form f(x) = 0. For example, 


f(x) =x? —3x74+7=0, 


fake <0 
x 


To solve means to find values of x which satisfy the given equation. These values are 
known as roots. For example, the roots of x* — 3x + 2 = 0 are x = 1 and x = 2 because 
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Example 6.20 


Solution 


when these values are substituted into the equation both sides are equal. Equations where 
the unknown quantity, x, occurs only to the first power are called linear equations. 
Otherwise an equation is non-linear. A simple way of finding the roots of an equation 
J (x) = 0 is to sketch a graph of y = f(x) as shown in Figure 6.8. 

The roots are those values of x where the graph cuts or touches the x axis. 
Generally, there is no analytical way of solving the equation f(x) = 0 and so it is often 
necessary to resort to approximate or numerical techniques of solution. An iterative 
technique is one which produces a sequence of approximate solutions which may con- 
verge to a root. Iterative techniques can fail in that the sequence produced can diverge. 
Whether or not this happens depends upon the equation to be solved and the availability 
of a good estimate of the root. Such an estimate could be obtained by sketching a graph. 
The technique we shall describe here is known as simple iteration. It requires that the 
equation be rewritten in the form x = g(x). An estimate of the root is made, say x), and 
this value is substituted into the r.h.s. of x = g(x). This yields another estimate, x,. The 
process is then repeated. Formally we express this as 


X41 = g(x,) 


This is a recurrence relation which produces a sequence of estimates x, X,,X,,.... 
Under certain circumstances the sequence will converge to a root of the equation. It 
is particularly simple to program this technique on a computer. A check would be built 
into the program to test whether or not successive estimates are converging. 


Solve the equation f(x) = e* — x = 0 by simple iteration. 


x 


The equation must first be arranged into the form x = g(x), and so we write e* — x = 0 


as 


In this example we see that g(x) = e™*. The recurrence relation which will produce 
estimates of the root is 


Xt = en 
Table 6.1 
Iterative solution of 
e*—x=0. 
n X 
0 0 
F(x) 1 1 
2 0.368 
3 0.692 
4 0.501 
5 0.606 
x 6 0.546 
Figure 6.8 
A root of f(x) = 0 occurs where the : 0.567 


graph touches or crosses the x axis. 
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Suppose we estimate x, = 0. Then 


xy =e =e! = 0.368 


The process is continued. The calculation is shown in Table 6.1 from which we see that 
the sequence eventually converges to 0.567 (3 d.p.). We conclude that x = 0.567 is a 
root ofe* —x = 0. 


Note that if the equation to be solved involves trigonometric functions, angles will usu- 
ally be measured in radians and not degrees. 

It is possible to devise a test to check whether any given rearrangement will converge. 
For details of this you should refer to a textbook on numerical analysis. There are other 
more sophisticated iterative methods for the solution of non-linear equations. One such 
method, the Newton—Raphson method, is discussed in Chapter 12. 


EXERCISES 6.6 
1. Show that the quadratic equation x? —5x—7=Ocan is 
be written in the form x = /7 + 5x. With x) = 6 3—e% 
locate a root of this equation. a4 = 10 


2 For the quadratic equation of Question 1 show that an 
alternative rearrangement is 


7 . 5 (a) Show that the cubic equation x +4+3x—-5=0 
x = —.—. With xp = 0.6 find the second . 
5 can be written as 


With x9 = 0 locate a root of the given equation. 


solution of this equation. : 


3 For the quadratic equation of Question 1 show that (i) x= a 
7 
another rearrangement is x = — +5. Try to >, 5 
x (ii) x= 5 . 
solve the equation using various initial estimates. x +3 
Investigate further alternative arrangements of the (b) By sketching a graph for values of x between 0 


original equation. 


Show that one recurrence relation for the solution of 
the equation 


e*+ 10x -3=0 


and 3 obtain a rough estimate of the root of the 
equation given in part (a). 


(c) Determine which, if either, of the arrangements 
in part (a) converges more rapidly to the root. 


Solutions 
1 6.14 5 (b) 1.15 
mM 144 (c) Arrangement (11) converges more rapidly 


4 0.18 
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Technical Computing Exercises 6.6 


Write a computer program to implement the simple 
iteration method. By comparing successive estimates 


the program should check whether convergence is 
taking place. 


REVIEW EXERCISES 6 


Write down and graph the first five terms of the 
sequences x[k] defined by 


(a) x{k] = (-1)*, k=0,1,2,3,... 
b ya k=-0,1,2,3 
(D) SI =e =0,1,2,3,... 


Find expressions for the kth terms of the sequences 
whose first five terms are 


(a) 1,9, 17, 25, 33 
(b) —I,1,—-1,1,-1 


For the Fibonacci sequence 


V9, 2,353 853.0 
show that 
x[k + 1] 1 
= (1 5 
k>oo [Kk] At +¥5) 


[Hint: write x[k + 1] = x[k] + x[k — 1], form 
x[k + 1]/x[k] and take limits.] 


Use the binomial theorem to expand (1 +x + x25 as 


far as the term in x°. 


1 
Use the binomial theorem to expand -—— 
(3 +x) 


in ascending powers of x as far as the term in x. 


The power series expansion for In(1 + x) is given by 


5 Ae ee 


In ee ee ay 
n(l+x)=x za a7 


and is valid for —1 <x < 1. Take a number of values 
of x in this interval and obtain an approximate value 
of In(1 + x) by means of this series. Compare your 
answers with the values obtained from your calculator. 


By multiplying both numerator and denominator of 


fhe Lae 
_ Oe nie 
Vk+1— Sk 


lim 
k-00 2, 


3 
: : 3k-—1 
Find lim | ——— ] . 
k-00\ 2k +7 


2k — 3k? 
Find lim —-——_. 
k>oo Tk! + 2k 


Write down the first eight terms of the series )“7_, k. 
By noting that this is an arithmetic series show that 


ye are 
k=1 


Write down the first six terms of the sequence defined 
by the recurrence relation 


x[n + 3] = x[n + 2] — 2x[n] 
x[0] = 0 x{1] = 2 x[2] = 3 
Find the limit, if it exists, as k > oo of the geometric 
progression 


a,ar,ar’,...,ar*—|,... 


when 


(a) -l<r<l 

(b) r>1 

(c) r<-l 

(d) r=1 

(e) r=-l 

An arithmetic series has a first term of 4 and the 10th 
term is 0. 

(a) Find S59. 

(b) If S,, = 0, find n. 


A geometric series has 
S= 37 _ 3367 
aa: ame) 


Find the first term and the common ratio. 


= 


Write down the series given by 7 re. 


(b) The sum of the squares of the first n whole 
numbers can be found from the formula 


% >  nn+1)(Qn+1) 
i SS 
6 
r=1 
Use this formula to find 
Oyo, Vr, Gee. 


16 (a) Write down the series given by =o I r 


(b) The sum of the cubes of the first n whole 
numbers can be found from the formula 


3 ; - (9 a) 


Use hae — to find 
OL: Oars @ Dar 


17 The third term of an arithmetic progression is 18. The 
fifth term is 28. Find the sum of 20 terms. 


18 (a) Find an expression for the general term in the 


sequence 
2,5,10,17,... 
Solutions 
(a) 1,—-1,1,—-1,1 (b) 1 ae 
a) 1,—1,1,—1, po aieh Sie 
3°5° 7°9 
(a) 8-7 k>1l (b) (-1)k k>1 


1+ 5x + 15x? + 30° 


aq 0, 2,3, 3, —1, -7 


12 (a) 0 
(d) a 


(b) no limit 
(e) no limit 


(c) no limit 


13 (a) -5 (b) 19 


ID) 


20 


21 


14 


15 


16 


17 
18 


19 


20 


21 
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(b) Define the sequence in terms of a recurrence 
relation. 
(a) Show that the equation x? + 2x — 14 = 0 can be 


rearranged into the form x = /14 — 2x. With 
Xg = 2 use simple iteration to find a root of the 
equation. 


(b) Rearrange the equation 0.8 sin x — 0.5x = 0 into 
the form x = g(x). With x) = 2 use simple 
iteration to find a root of the equation. 


(c) Rearrange the equation x? = 2e~* into the form 
x = g(x). With x) = 0 use simple iteration to find 
a root of the equation. 


Write out explicitly the first four terms of the series 
3 (-1)" 2m 
2m (yy1)2 * 
m=0 ae) 
Write out explicitly the first four terms of the series 
oo (= l yn 2m-+1 


2 22m+lm\(m + 1)! 


m=0 


5 

(a) 14449416425 

(b) @) 55. ~— ii): 385—SSs(tt).: 3300 
(a) 14+8+4+274 64 + 1254216 

(b) (i) 441 ~— ii) 6084_— (iii) 5643 
1110 


(a) x[kKJ =k? +1,k =1,2,3,... 
(b) x[k + 1] = x[k] +2k+1 


(a) 2.13 
(b) x = 1.6sinx, 1.6 
(c) x = V2e-*, 0.93 
1 1 1 
1 at + g* GP teas 
1 

Tee C7 

2 16 384 
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INTRODUCTION 


Certain physical quantities are fully described by a single number: for example, the mass 
of a stone, the speed of a car. Such quantities are called scalars. On the other hand, 
some quantities are not fully described until a direction is specified in addition to the 
number. For example, a velocity of 30 metres per second due east is different from a 
velocity of 30 metres per second due north. These quantities are called vectors and it is 
important to distinguish them from scalars. There are many engineering applications in 
which vector and scalar quantities play important roles. For example, speed, potential, 
work and energy are scalar quantities, while velocity, electric and magnetic forces, the 
position of a robot and the state-space representation of a system can all be described 
by vectors. A variety of mathematical techniques have been developed to enable useful 
calculations to be carried out using vectors and in this chapter these will be discussed. 


VECTORS AND SCALARS: BASIC CONCEPTS 


Scalars are the simplest quantities with which to deal; the specification of a single num- 
ber is all that is required. Vectors also have a direction and it is useful to consider a graph- 
ical representation. Thus the line segment AB of length 4 in Figure 7.1 can represent 


7.2.1 


7.2.2 


7.2.3 
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ee ae 
er B AB 
a A ee ae D 
CD 


A Cc 
Figure 7.1 Figure 7.2 
A vector, AB, of length 4. Two equal vectors. 


= 
a vector in the direction shown by the arrow on AB. This vector is denoted by AB. 
> — a — 


Note that AB + BA. The vector AB is directed from A to B, but BA is directed from 
BtoA. a 

An alternative notation is frequently used: we denote AB by a. This bold notation 
is commonly used in textbooks but the notation a is preferable for handwritten work. 
We now need to refer to the diagram to appreciate the intended direction. The length of 
the line segment represents the magnitude, or modulus, of the vector and we use the 


notation |AB|, |a| or simply a to denote this. Note that whereas a is a vector, |a| is a 
scalar. 


Negative vectors 


The vector —a is a vector in the opposite direction to, but with the same magnitude as, 
> — 


a. Geometrically it will be BA. Thus —a is the same as BA. 


Equal vectors 


Two vectors are equal if they have the same magnitude and direction. In Figure 7.2 vec- 
=> a 


tors CD and AB are equal even though their locations differ. This is a useful property of 
vectors: a vector can be translated, maintaining its length and direction without changing 
the vector itself. There are exceptions to this property. For example, we shall soon meet 
position vectors which are used to locate specific fixed points in space. They clearly 
cannot be moved around freely. Nevertheless most of the vectors we shall meet can be 
translated freely, and as such are often called free vectors. 


Vector addition 


It is frequently useful to add two or more vectors together and the addition of vectors is 


defined by the triangle law. Referring to Figure 7.3, if we wish to add AB to CD, CD is 
translated until C and B coincide. As mentioned earlier, this translation does not change 


the vector CD. Then the sum, AB + CD, is defined by the vector represented by the third 
side of the completed triangle, that is AD (Figure 7.4). Thus, 


AB + CD = AD 
Similarly, if AB is denoted by a, CD is denoted by b, AD is denoted by c, then we have 
c=a+b 


We note that to add a and b a triangle is formed using a and b as two of the sides in such 
a way that the head of one vector touches the tail of the other as shown in Figure 7.4. 
The sum a + b is then represented by the third side. 


226 


Chapter 7 Vectors 


BC 
Cc AB 
B — 
CD 
A 
a 
b 
- AD 
D 
D 
Figure 7.4 
Figure 7.3 - os Addition of the two vectors of Figure 7.3 using 
Two vectors, AB and CD. the triangle law. 
It is possible to prove the following rules: 
a+b=b+a vector addition is commutative 


a+(b+c)=(a+b)+c _ vector addition is associative 


To see why it is appropriate to add vectors using the triangle law consider Examples 7.1 
and 7.2. 


Engineering application 7.1 


Routing of an automated vehicle 


Automated vehicles are a common feature of the modern warehouse. Sometimes they 
are guided by tracks set into the factory floor. Consider the case of an automated 
vehicle carrying electrical components from stores at A to workers at C as illustrated 
in Figure 7.5. 

The vehicle may arrive at C either directly, or via point B. The movement from A 


to B can be represented by a vector AB known as a displacement vector, whose mag- 
nitude is the distance between points A and B. Similarly, movement from B to C is 


represented by BC, and movement directly from A to C is represented by AC. Clearly, 
since A, B and C are fixed points, these displacement vectors are fixed too. Since the 


head of the vector AB touches the tail of the vector BC we are ready to use the triangle 
law of vector addition to find the combined effect of the two displacements: 


AB+BC=A 


This means that the combined effect of displacements AB and BC is the displacement 
AC. We say that AC is the resultant of AB and BC. 
€ 


Figure 7.5 
Routing of an automated vehicle from stores at 
o AB A to workers at C. 
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zi 
In considering motion from point A to point B, the vector AB is called a displace- 
ment vector. 


Engineering application 7.2 


Resultant of two forces acting on a body 


Sometimes it is useful to be able to calculate the resultant force when several different 
forces act on a body. Consider the simplest case when there are two forces. 

A force F, of 2 N acts vertically downwards, and a force F, of 3 N acts horizontally 
to the right, upon the body shown in Figure 7.6. Translating F, until its tail touches 
the head of F, we can apply the triangle law to find the combined effect of the two 
forces. This is a single force R known as the resultant of F, and F,. We write 


R=F,+F, 


and say that R is the vector sum of F, and F,. The resultant force, R, acts at an angle 
of @ to the vertical, where tan@ = 3/2, and has a magnitude given by Pythagoras’s 


theorem as ./2? + 32 = /13N. 


Figure 7.6 
Resultant force, R, produced by a vertical force, F,, and a 
horizontal force, F 5. 


Engineering application 7.3 


Resolving a force into two perpendicular directions 


In the previous example we saw that two forces acting upon a body can be replaced by 
a single force which has the same effect. Equivalently we can consider a single force 
as two forces acting at right angles to each other. Consider the force F in Figure 7.7. 
It can be replaced by two forces, one of magnitude |F| cos 6 and one of magnitude 
|F| sin 6 as shown. We say that the force F has been resolved into two perpendicular 
components. 

For example, Figure 7.8 shows a force of 5 N acting at an angle of 30° to the x 
axis. It can be resolved into two components. The first component is directed along 
the x axis and has magnitude 5 cos 30° N. The second component is perpendicular 
to the first and has magnitude 5 sin30° N. These two components combine to have 
the same effect as the original 5 N force. 
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Example 7.1 


Solution 


7.2.4 


Example 7.2 


ene 5 sin 30° 
F cos @ 5 cos 30° 
Figure 7.7 Figure 7.8 
A force F can be resolved into two The 5 N force can be resolved into two 
perpendicular components. perpendicular components. 


Vectors p, q, r and s form the sides of the square shown in Figure 7.9. Express 


(a) pin terms of r 
(b) sin terms of q 


(c) diagonal BD in terms of q andr 


(a) The vector p has the same length as r but has the opposite direction. Therefore 
p=-r. 
(b) Vector s has the same length as q but has the opposite direction. Therefore s = —q. 
(c) The head of BC coincides with the tail of CD. Therefore, by the triangle law of 
addition, the third side of triangle BCD represents the sum of BC and CD, that is 
BD =BC+CD 
= q + r 


Vector subtraction 


Subtraction of one vector from another is performed by adding the corresponding neg- 
ative vector; that is, if we seek a — b, we form a+ (—b). 


Consider the rectangle illustrated in Figure 7.10 with a and b as shown. Express in terms 
of A, B, C or D the vectors a — b and b — a. 


B a C 

B Cc 
p r 

a 
A s D 

A Db D 
Figure 7.9 
The vectors p, q, r and s form the sides Figure 7.10 


of a square. The rectangle ABCD. 


Solution 


Example 7.3 


Solution 
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Figure 7.11 - = Figure 7.1 2. = 
The vectors AB and DA. The vectors BA and AD. 
We have 
b=AD 
hence 
—b=DA 
Then 
a—b=a+(-b) 
= AB+DA 


= DA+AB by commutativity 


DA and AB are shown in Figure 7.11. Their sum is given by the triangle law, that is 
a—b=DA+AB = DB 
Similarly, 
a= AB 
hence 
—a=BA 
Then 
b—a=b+ (—a) 
= AD +BA 
= BA +AD 
BA and AD are shown in Figure 7.12. Again their sum is given by the triangle law, that is 
b-—a= BA +AD = BD 


Referring to Figure 7.13, ifr, and r, are as shown, find the vector b = QP represented 
by the third side of the triangle OPQ. 


From Figure 7.13 we note from the triangle law that 


OP = QO + OP = OP + QO by commutativity. 
But QO = —r, and QP = b, and so 


b=r,—-r, 
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Figure 7.13 
The two vectors r, and r5. 


Example 7.4 


Solution 


20 


Q a 
PQ Q 
E P 
Pp ) q 
R 
O . O 
Figure 7.14 Figure 7.15 
The tetrahedron OPQR. The triangle OPQ. 


Vectors do not necessarily lie in a two-dimensional plane. Three-dimensional vectors 
are commonly used as is illustrated in the following example. 


OPQR is the tetrahedron shown in Figure 7.14. Let OP =p, 00 = q and OR = r. 
Express PQ, QR and RP in terms of p, q andr. 


Consider the triangle OPQ shown in Figure 7.15. We note that OQ represents the third 


side of the triangle formed when p and P@ are placed head to tail. Using the triangle law 
we find 


OP + PQ=0Q 
Therefore, 
PQ = OQ — OP 
=q-—Pp 


Similarly, QR = r —q and RP = p—r. 


Multiplication of a vector by a scalar 


If k is any positive scalar and a is a vector then ka is a vector in the same direction as a 
but k times as long. If k is any negative scalar, ka is a vector in the opposite direction to 
a, and k times as long. By way of example, study the vectors in Figure 7.16. Clearly 2a 


is twice as long as a but has the same direction. The vector rg is half as long as b but 


has the same direction as b. It is possible to prove the following rules. 


For any scalars k and /, and any vectors a and b: 
k(a+b) =ka+kb 
(k+)Da=ka+la 
k(la) = (kl)a 


The vector ka is said to be a scalar multiple of a. 


Example 7.5 


Solution 


7.2.6 
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A 


\ 
\ 
o 
erie 
> 
> 
: = 


B 
Figure 7.17 
Figure 7.16 The triangle ABC, with midpoint M 
Scalar multiplication of a vector. of side AB. 


In a triangle ABC, M is the midpoint of AB. Let AB be denoted by a, and BC by b. 
Express AC, CA and CM in terms of a and b. 


The situation is sketched in Figure 7.17. Using the triangle rule for addition we find 
AB + BC = AC 

Therefore, 
AC =a+b 


It follows that CA = —AC = —(a+b). 
Again by the triangle rule applied to triangle CMB we find 


CM = CB + BM 


=> 1= 1 
Now BM = i = =a0 and so 


Unit vectors 
Vectors which have length 1 are called unit vectors. If a has length 3, for example, 
then a unit vector in the direction of a is clearly at More generally we denote the unit 


vector in the direction a by a. Recall that the length or modulus of a is |a| and so we can 
write 


1 
Note that |a| and hence jal are scalars. 
a 
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1 
2 
4 
5 


7.2.7 Orthogonal vectors 


If the angle between two vectors a and b is 90°, that is a and b are perpendicular, then a 


and b are said to be orthogonal. 


EXERCISES 7.2 


For the arbitrary points A, B, C, D and E, find a single 
vector which is equivalent to 
> => => 


(a) DC+ CB (b) CE+ DC 
=> > 
Figure 7.18 shows a cube. Let p = AB, q = AD and 


=> > OS 
r = AE. Express the vectors representing BD, AC and 
= 


AG in terms of p, q and r. 


Figure 7.18 


Solutions 


(a) DB (b) DE 
q—-P.4+P.4+rt+p 
q,—-P.4—P,-P—-4q 
See Figure S.16. 


13.9 N at an angle of 59.7° to the negative x axis 


6.34 N, 13.59 N 


In a triangle ABC, M is the midpoint of BC, and N is 
=> > 
the midpoint of AC. Show that NM = 3AB. 


Consider a rectangle with vertices at E, F, G and H. 
> => 
Suppose EF = p and FG = q. Express each of the 
> — — => 
vectors EH, GH, FH and GE in terms of p and q. 


If a is an arbitrary vector, represent on a diagram the 


vectors a, ty - a 3a, —3a and a. 

2 +4 
A particle is positioned at the origin. Two forces act 
on the particle. The first force has magnitude 7 N 
and acts in the negative x direction. The second force 
has magnitude 12 N and acts in the y direction. 
Calculate the magnitude and direction of the resultant 
force. 


A force of 15 N acts at an angle of 65° to the x axis. 
Resolve this force into two forces, one directed along 
the x axis and the other directed along the 

y axis. 


— > 
a 
1 
7a 
—ub 
_a 
4 
eee 
3a 
es 
3a 
Figure S.16 


CARTESIAN COMPONENTS 


Consider the x—y plane in Figure 7.19. The general point P has coordinates (x, y). We 


can join the origin to P by a vector OP, which is called the position vector of P, which 


we often denote by r. The modulus of r is |r| = r, and is the length of OP. It is possible 


Figure 


7.19 


The x-y plane with 
point P. 


Example 7.6 


Solution 
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to express r in terms of the numbers of x and y. If we denote a unit vector along the x 
axis by i, and a unit vector along the y axis by j (we usually omit the * here), then it is 
> — 


clear from the definition of scalar multiplication that OM = xi, and MP = yj. It follows 
from the triangle law of addition that 


r = OP = OM + MP = xi+ yj 


Clearly the vectors i and j are orthogonal. The numbers x and y are the i and j compo- 
nents of r. Furthermore, using Pythagoras’s theorem we can deduce that 


r= (249 


Alternative notations which are sometimes useful are 


r= or =(*) 
y 


and 
r= OP = (x,y) 


When written in these forms (*) is called a column vector and (x, y) is called a row 
y 


vector. To avoid confusion with the coordinates (x, y) we shall not use row vectors here 
but they will be needed in Chapter 26. We will also use the column vector notation for 
more general vectors, thus, 


a 
ai+ bj= ( ) 
IN 
We said earlier that a vector can be translated, maintaining its length and direction with- 
out changing the vector itself. While this is true generally, position vectors form an im- 


portant exception. Position vectors are constrained to their specific position and must 
always remain tied to the origin. 


If A is the point with coordinates (5, 4) and B is the point with coordinates (—3, 2) find 
the position vectors of A and B, and the vector AB. Further, find \AB\. 


The position vector of A is 5i+ 4j = i which we shall denote by a. The position 
vector of B is —3i + 2j = (>). which we shall denote by b. Application of the 
triangle law to triangle OAB (Figure 7.20) gives 

OA +AB = OB 
that is 


a+AB=b 
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; 114 tt Figure 7.20 
6-5 4-3 -2-1 0 1 2 3 4 5 6 x Points A and B in the x-y plane. 


Therefore, 
AB=b-a 


(—3i + 2j) — (5i+ 4j) 
= —8i — 2j 


Alternatively, in terms of column vectors 


We note that subtraction (and likewise addition) of column vectors is carried out compo- 


nent by component. To find |AB | we must obtain the length of the vector AB. Referring 
to Figure 7.20, we note that this quantity is the length of the hypotenuse of a right-angled 


triangle with perpendicular sides 8 and 2. That is, |AB| = /8? + 27 = /68 = 8.25. 
More generally we have the following result: 


Given vectors a = OA =ai+a,jandb= OB = b,i+ b,j (Figure 7.21), then 
AB= po aah Sail =a) 
and 
|AB| = |b —al = |(b, — 4,)i + @, —4,)jl 
= /,- 4,2 +, - 4? 


Example 7.7 Ifa= ({) and b = ee 


(a) find a+ b, a — b, b + a and b — a, commenting upon the results 
(b) find 2a — 3b 
(c) find Ja — bj. 
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Figure 7.21 
The quantity Figure 7.22 
|b-—al = {@ - a)? + (by — ay). The quantity |r| = x? + y?2 4 22. 


Solution (2) a+b= ({) + (5) - (3) 
(CD 
C00 
Seria) 2) 


We note that addition is commutative whereas subtraction is not. 


w¥e2(0)-9(3)-(9)-C)-2) 


The previous development readily generalizes to the three-dimensional case. Taking 
Cartesian axes x, y and z, any point in three-dimensional space can be represented by 
giving x, y and z coordinates (Figure 7.22). Denoting unit vectors along these axes by i, 
jand k, respectively, we can write the vector from O to P(x, y, z) as 


Be % 
OP =r=xi+yj+zk= | y 


& 


The vectors i, j and k are orthogonal. 


Example 7.8 If r = xi + yj + zk show that the modulus of r is r = /x2 + y? + 2. 


Solution Recalling Figure 7.22 we first calculate the length of OB. Now OAB is a right-angled 
triangle with perpendicular sides OA = x and AB = y. Therefore by Pythagoras’s 
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theorem OB has length ,/x? + y?. Then, applying Pythagoras’s theorem to right-angled 
triangle OBP which has perpendicular sides OB and BP = z, we find 


|r| = OP = VOB’ + BP” 
H=V@+Y)+2 
HV¥P+ty+7 


as required. 


Ifr=xi+ yj + zk then rl = VP +y+2 


In three dimensions we have the following general result: 


Given vectors a = OA = a,i+ a,j + a,k and b = OB = b,i+ b,j + bk, then 
AB = b—a= (b, —a,)i+ (b) — a,j + @, —a,)k 
and 
|AB| = |b — al = |(b, —a,)i + (b, —@,)j + (0; — a,)k| 
= / ©, - 4)? +, -—4P + 6-45)? 


Example 7.9 Ifa =3i—2j+kandb = —2i+j —5k, find 
(a) [al (b) a (c) |b| (d) b (e)b—a (f) |b — al 


Solution (a) ja| = //3? + (—2)? + 2= /14 


pie ee pas i 
la} = 14 J/14 J14 14 

(c) |b] = /(—2)? + 12 + (—5)? = V30 

Ope = sie ae 
Ib} /30 J30 =—V30—S( 30 


(ce) b—a=—Si + 3j — 6k 
(f) |b —al = (5)? +3? + (—6)? = V70 


7.3.1 The zero vector 


A vector, all the components of which are zero, is called a zero vector and is denoted 
by 0 to distinguish it from the scalar 0. Clearly the zero vector has a length of 0; it is 
unusual in that it has arbitrary direction. 
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Engineering application 7.4 


Robot positions 


Position vectors provide a useful means of determining the position of a robot. There 
are many different types of robot but a common type uses a series of rigid links con- 
nected together by flexible joints. Usually the mechanism is anchored at one point. 
A typical example is illustrated in Figure 7.23. 


Figure 7.23 
A typical robot configuration with vectors representing 
the robot links. 


The anchor point is X and the tip of the robot is situated at point Y. The final link 
is sometimes called the hand of the robot. The hand often has rotating and gripping 
facilities and its size relative to the rest of the robot is usually quite small. Each of the 
robot links can be represented by a vector (see Figure 7.23). The vector d corresponds 
to the hand. A common requirement in robotics is to be able to calculate the position 
of the tip of the hand to ensure it does not collide with other objects. This can be 
achieved by defining a set of Cartesian coordinates with origin at the anchor point 
of the robot, X. Each of the link vectors can then be represented in terms of these 
coordinates. For example, in the case of the robot in Figure 7.23: 


a=ai+aj+a,k c=ci+oj+c,k 
b=),i+ b,j +),k d=d,i+ d,j+d,k 


The position of the tip of the hand can be calculated by adding together these vectors. 
So, 


p=a+bt+c+d 
= @ shy HEE FP GA IE (] AE Dy AP © FE CHM ae (GFE Dg Pe EGAN 


7.3.2 Linear combinations, dependence and independence 


Suppose we have two vectors a and b. If we form arbitrary scalar multiples of these, that 
is k,a and k,b, and add these together, we obtain a new vector c where c = k,a+k,b. 
The vector c is said to be a linear combination of a and b. Note that scalar multipli- 
cation and addition of vectors are the only operations allowed when forming a linear 
combination. Vector ¢ is said to depend linearly on a and b. Of course we could also 
write 


k 
a=—c-— 7 provided k, 4 0 
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so that a depends linearly on c and b. Provided k, + 0, then 


b 1 ky 
=—c-——a 
k, k, 
so that b depends linearly on c and a. The set of vectors {a, b, c} is said to be linearly 
dependent and any one of the set can be written as a linear combination of the other 


two. In general, we have the following definition: 


A set of n vectors {a,, a,,..., a,} is linearly dependent if the expression 
ka, +k,a,+---+k,a, =0 


can be satisfied by finding scalar constants k,,k,,...,k,,, not all of which are zero. 


If the only way we can make the combination zero is by choosing all the k,s to be 
zero, then the given set of vectors is said to be linearly independent. 


Example 7.10 Show that the vectors i and j are linearly independent. 


Solution We form the expression k,i+ k,j = 0 and try to choose k, and k, so that the equation is 
satisfied. Using column vectors we have 


_()e40)-0) 
DRORORG 


The only way we can satisfy the equation is by choosing k, = 0 and k, = 0 and hence 
we conclude that the vectors i and j are linearly independent. Geometrically, we note 
that since they are perpendicular, no scalar multiple of i can give j and vice versa. 


Example 7.11 The vectors 


1 5 13 
2 1 —1 
3 9 21 
are linearly dependent because, for example 
5 1 13 0 
37, 1)7-—272]—-—1]—-1]= {0 
9 3 21 0 
EXERCISES 7.3 
1 Pand Q lie in the x-y plane. Find PQ, where P is the 2 AandB lie in the x-y plane. If A is the point (3, 4) 
point with coordinates (5, 1) and Q is the point with and B i B is the point (1, —5) write down the vectors 
> 
coordinates (—1, 4). Find |PQ|. OA, OB and AB. Find a unit vector in the direction 


of AB. 


If a = 4i — j + 3k and b = —2i + 2j — k, find unit 
vectors in the directions of a, b and b — a. 


Ifa = 5i — 2j, b = 3i — 7j and c = —3i-+ 4j, express, 


in terms of i and j, 


a+b a+ec c—b 3c — 4b 


Draw diagrams to illustrate your results. Repeat the 
calculations using column vector notation. 


Write down a unit vector which is parallel to the line 
y=7x—-3. 


Find PQ where P is the point in three-dimensional 
space with coordinates (4, 1, 3) and Q is the point 
with coordinates (1, 2, 4). Find the distance between 
P and Q. Further, find the position vector of the point 
dividing PQ in the ratio 1:3. 


Solutions 


> > 
PQ = —6i+ 3j |PQ| = 6.71 
3i + 4j, i — 5j, —2i — 9j 

—1 
unit vector: —— (2i+ 9j) 

V85 , 

1 (4i 
i- 
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1 
—= (—6i + 3j — 4k) 
V6l ? 


1 
j+ 3k), eo + 2j —k), 


Technical Computing Exercises 7.3 


To plot a displacement vector between points (1,2) and 
(3,4) in MATLAB® we could type: 


X=[1 3]; 
y=[2 4]; 
plot (X,Y) 


which would result in a plot containing a single line 
between the two points. Notice the order in which the 
coordinates are passed to the plot function. The first 
entry in X and the first entry in Y are for the first 
location and so on. If we wanted to plot a second 
displacement vector from (3, 4) to (5, 6) we would 
type: 


X=[1 3 5]; 
y=[2 4 6]; 
plot (X,Y) 
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If P, Q and R have coordinates (3, 2, 1), (2, 1, 2) and 
(1, 3, 3), respectively, use vectors to determine which 
pair of points are closest to each other. 


Consider the robot of Example 7.4. The link vectors 
have the following values: 


a = 121+ 18j+k 
b = 6i— 3j + 8k 
c = 3i+ 2j — 4k 
d = 0.5i — 0.2j + 0.6k 


Calculate the length of each of the links and the 
position vector of the tip of the robot. 


Show that the vectors a = i+ j and b = —i+j are 
linearly independent. 


Show that the vectors i, j and k are linearly 
independent. 


8i — 9j, 21 + 2j, —6i + 11j .—21i + 40j 

1 
50 
PO = —3i+j+k, distance from P to Q = 3.32, 
513i + 5j + 13k) 


(i+ 7j) 


PandQ 


21.66, 10.44, 5.39, 0.81, 21.5i+ 16.8j + 5.6k 


Now consider the problem of routing an automated 
vehicle between the following positions on a factory 


floor: 

Doorway: A = (0,0) 
Resistor bin: B=(1, 1) 
Capacitor bin: C= (1, -0.5) 
PCB rack: D = (2,0) 
Assembly line: E = (2.5, 0) 


For simplicity you may assume that the robot will 
not collide with any objects if it travels directly 
between any of those two points. 


Plot the displacement vectors for the robot if it 
enters through the doorway and visits the resistor 
bin, the PCB rack and then the assembly line. How 
would this action be written in vector notation? 
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2 Calculate the distance travelled by the robot. aut throgeh ihe doarway and finish at the . 
assembly line. By trial and error produce a solution 
3 Now consider the problem of visiting all of the to the problem, using the computer to plot each 
positions A, B, C, D and E one time only, with the trial. Express your final calculated solution in 
robot travelling the least distance. The robot must vector notation. 
ye/— SCALAR FIELDS AND VECTOR FIELDS 


Imagine a large room filled with air. At any point, P, we can measure the temperature, ¢, 
say. The temperature will depend upon whereabouts in the room we take the measure- 
ment. Perhaps, close to a radiator the temperature will be higher than near to an open 
window. Clearly the temperature ¢ is a function of the position of the point. If we label 
the point by its Cartesian coordinates (x, y, z), then @ will be a function of x, y and z, 
that is 


b = (%, y, 2) 


Additionally, @ may be a function of time but for now we will leave this additional 
complication aside. Since temperature is a scalar what we have done is define a scalar 
at each point P(x, y, z) ina region. This is an example of a scalar field. 

Alternatively, suppose we consider the motion of a large body of fluid. At each point, 
fluid will be moving with a certain speed in a certain direction; that is, each small fluid 
element has a particular velocity, v, depending upon whereabouts in the fluid it is. Since 
velocity is a vector, what we have done is define a vector at each point P(x, y, z). We 
now have a vector function of x, y and z, known as a vector field. Let us write 


V= (v,, U,, U,) 
so that v,, v, and v, are the i, j and k components respectively of v, that is 
v=v,i+ v,j+ou,k 


We note that v,, v, and v, will each be scalar functions of x, y and z. 


Engineering application 7.5 


Electric field strength E and electric displacement D 


Electrostatics is the study of the forces which stationary positive and negative electric 
charges exert upon one another. Consider Figure 7.24 which shows a single positive 
charge placed at O. The presence of this charge gives rise to an electric force field 
around it. Faraday introduced the idea of lines of force to help visualize the field. 
At any point, P, there exists a vector which gives the direction and magnitude of 
the electrostatic force at P. Because all the lines of force emerge radially from O, 
the direction of the electrostatic force is radially outwards. It can be shown that the 
magnitude of the force is inversely proportional to the square of the distance from O. 


b 
i 
a 


Figure 7.25 


Two vectors a and b 
separated by 
angle @. 
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Figure 7.24 
A charge at O gives rise to an electric field E. 


If a second charge is placed in this field it experiences a force. An important quan- 
tity is the electric field strength, E. This is a vector field which describes the force 
experienced by a unit charge. 

A related quantity is the electric displacement, D, also called the electric flux 
density, defined as D = cE, where « is called the permittivity of the medium in 
which the field is located. Note that D is a scalar multiple of E. 


Engineering application 7.6 


Electrostatic potential V 


An important electrostatic field is the electrostatic potential V. This is an example 
of a scalar field. The difference between the potential measured at any two points in 
the field is equal to the work which needs to be done to move a unit charge from one 
point to the other. Later, in Chapter 26, we will see that the scalar field V is closely 
related to the vector field E. 


THE SCALAR PRODUCT 


Given any two vectors a and b, there are two ways in which we can define their product. 
These are known as the scalar product and the vector product. As the names suggest, the 
result of finding a scalar product is a scalar whereas the result of finding a vector product 
is a vector. The scalar product of a and b is written as 


a-b 


This notation gives rise to the alternative name dot product. It is defined by the formula 


a-b = |al|b| cos 0 


where 6 is the angle between the two vectors as shown in Figure 7.25. 
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Example 7.12 


Solution 


Example 7.13 


Solution 


From the definition of the scalar product, it is possible to show that the following 
rules hold: 


a-b=b-a the scalar product is commutative 
k(a-b) = (ka-b) where k is a scalar 
(a+b)-c=(a-c)+(b-c) _ the distributive rule 


It is important at this stage to realize that notation is very important in vector work. You 
should not use a x to denote the scalar product because this is the symbol we shall use 
for the vector product. 


If a and b are parallel vectors, show that a- b = |a||b|. If a and b are orthogonal show 
that their scalar product is zero. 


If a and b are parallel then the angle between them is zero. Therefore a-b = 
|a||b| cos O° = |al|b|. If a and b are orthogonal, then the angle between them is 90° 
and a+b = |a||b| cos 90° = 0. 

Similarly we can show that if a and b are two non-zero vectors for which a- b = 0, 
then a and b must be orthogonal. 


If a and b are parallel vectors, a- b = |a||b]. 
If a and b are orthogonal vectors, a- b = 0. 


An immediate consequence of the previous result is the following useful set of formulae: 


Ifa = a,i+ a,j +a,k and b = 5,i+ b,j + b,k show that a- b = a,b, +. a,b, + ayb,. 


We have 


a-b = (a,i+ a,j + a,k)- (b)i+ b,j + bk) 
= a,i- (b)i+ b,j + b3k) + a,j- (b)i+ b,j + bk) 
+a,k- (b,i+ b,j + 63k) 
=a,bi-i+a,b,i-j+a,b,i-k+a,b,j-i+a4,b,j-j+ab,j-k 
+a,b,k-i+a,b,k-j+a,b,k-k 
= a,b, + ab, + azb, 
as required. Thus, given two vectors in component form their scalar product is the sum 


of the products of corresponding components. 


The result developed in Example 7.13 is important and should be memorized: 


Ifa =a,i+a,j+a,k and b = b,i+ b,j + ),k, 
then a-b = a,b, + a,b, + azb3. 


Example 7.14 


Solution 


Example 7.15 


Solution 


Example 7.16 


Solution 
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If a = 5i — 3j + 2k and b = —2i + 4j + k find the scalar product a-b. 


Using the previous result we find 


a-b = (Si — 3j + 2k) - (-2i+ 4j +k) 
(5)(—2) + (—3)(4) + (2)d) 
=—-10—124+2 

—20 


Ifa = a,i+ a,j + ak, find 
(a) ava (b) Jal? 


(a) Using the previous result we find 


a-a= (a,i+ a,j + a,k) + (a,i+ a,j + a;k) 


Ao 54 3 
=a +4, + 43 


(b) From Example 7.8 we know that the modulus of r = xi + yj + zk is /x? +y? +2 
and therefore 


lal = ,/aj +45 +4; 


so that 


la’ =ai+a+a3 


We note the general result that 


a-a= |al 


If a = 3i+- j — k and b = 2i+ j + 2k find a- b and the angle between a and b. 


We have 
a-b = (3)(2) + (I) (1) + (-1)(2) 
=6+1-2 
=> 


Furthermore, from the definition of the scalar product a+b = |a||b| cos 8. Now, 
laj=J9+14+1=V11 and [bl =V44+14+4=3 
Therefore, 
a-b 2) 
~ falibl ~ 3VT1 


from which we deduce that 6 = 59.8° or 1.04 radians. 


cos 0 


244 


Chapter 7 Vectors 


Engineering application 7.7 


Work done by a force 


If a force is applied to an object and the object moves, then work is done by the force. 
It is possible to use vectors to calculate the work done. Suppose a constant force F 
is applied and as a consequence the object moves from A to B, represented by the 
displacement vector s, as shown in Figure 7.26. 


one perpendicular to s. The work done by each component is equal to the product of 
its magnitude and the distance moved in its direction. The component perpendicular 
to s will not do any work because there is no movement in this direction. For the 
component along s, that is |F| cos 6, we find 


From the definition of the scalar product we see that the r.h.s. of this expression is 
the scalar product of F and s. 


We can resolve the force into two perpendicular components, one parallel to s and 


work done = |F| cos @|s| 


Figure 7.26 
The component of F in the direction of s 
is F cos@. 


The work done by a constant force F which moves an object along the vector s is 
equal to the scalar product Fs. 


Example 7.17 A force F = 3i + 2j — k is applied to an object which moves through a displacement 


s = 2i+ 2j +k. Find the work done by the force. 


Solution The work done is equal to 


F-s = (3i+ 2j —k)- (i+ 2) +k) 
=6+4-1 
= 9 joules 


Engineering application 7.8 


Movement of a charged particle in an electric field 


Figure 7.27 shows two charged plates situated in a vacuum. Plate A has an excess of 
positive charge, while plate B has an excess of negative charge. Such an arrangement 
gives rise to an electric field. An electric field is an example of a vector field. 
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x 


t++eee tlt +444 


- Figure 7.27 
i E Two charged plates situated in a vacuum. 

In Figure 7.27 the electric field vector E in the region of space between the charged 
plates has a direction perpendicular to the plates pointing from A to B. The magnitude 
of the electric field vector is constant in this region if end effects are ignored. If 
a charged particle is required to move against an electric field, then work must be 
done to achieve this. For example, to transport a positively charged particle from the 
surface of plate B to the surface of plate A would require work to be done. This would 
lead to an increase in potential of the charged particle. 

If s represents the displacement and V the potential it is conventional to write ds 
to represent a very small change in displacement, and dV to represent a very small 
change in potential. 

If a unit positive charge is moved a small displacement 6s in an electric field (Fig- 
ure 7.27) then the change in potential 6V is given by 

6V = —-E- 5s (7.1) 
This is an example of the use of a scalar product. Notice that although E and és are 
vector quantities the change in potential, dV, is a scalar. 

Consider again the charged plates of Figure 7.27. If a unit charge is moved a small 
displacement along the plane X, then ds is perpendicular to E. So, 

6V = —E- ds = —|B||6s| cos 0 
With 6 = 90°, we find 5V = 0. The surface X is known as an equipotential surface 
because movement of a charged particle along this surface does not give rise to a 
change in its potential. Movement of a charge in a direction parallel to the electric 
field gives rise to the maximum change in potential, as for this case 6 = 0°. 


EXERCISES 7.5 
If a = 3i — 7j and b = 2i+ 4j finda-b, b-a,a-a 5 Find the angle between the vectors 7i + j and 4j — k. 
and b- b. 


6 Find the angle between the vectors 4i — 2j and 3i — 3j. 


Ifa = 4i + 2j —k, b = 31 — 3j + 3k and 


c = 2i—j-—k, find 


7 Ifa=7i+ 8jand b = Sifinda-b. 


(a) a-a_ ss (b) a-b 3 5 
(c) a-e (d) b-c 8 Ifr,; =| 1] andr, =| 1 | findr,-r,,r,-r, 
Evaluate (—13i — 5j) - (—3i+ 4j). 

and r, +P. 


Find the angle between the vectors p = 7i+ 3j + 2k 
and q = 2i-—j+k. 


Given that p = 2q simplify p-q, (p + 5q) - q and 
(q—P):P. 
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10 Find the modulus of a = i—j —k, the modulus of 


b = 2i+j + 2k and the scalar product a - b. Deduce 
the angle between a and b. 


11 Ifa =2i+2j —k and b = 3i— 6j + 2k, find 


|a|, |b], a- b and the angle between a and b. 


12 Use a vector method to show that the diagonals of the 


rF WwW 


rhombus shown in Figure 7.28 intersect at 90°. 


13 


14 


15 


B 
16 
A Cc 
17 
D 
Figure 7.28 
The rhombus ABCD. 
Solutions 
—22, —22, 58, 20 9 
(a) 21 (b) 3 (c) 7 (d) 6 10 
19 11 
47.62° 13 
82.11° 15 
18.4° 16 
7 17 
14, 16, 26 
y¥A-m@ THE VECTOR PRODUCT 


Use the scalar product to find a two-dimensional 
vector a = a,i+ a,j perpendicular to the vector 
4i — 2j. 


Ifa = 3i — 2j, b = 7i+ Sj and c = 9i — j, show that 
a-(b—c) = (a-b) — (a-c). 


Find the work done by the force F = 3i—-j +k 
in moving an object through a displacement 
s= 3i+ 5j. 


A force of magnitude 14 N acts in the direction 

i+ j-+k upon an object. It causes the object to move 
from point A(2, 1, 0) to point B(3, 3, 3). Find the 
work done by the force. 


(a) Use the scalar product to show that the 


component of a in the direction of b is a- b, 
where b is a unit vector in the direction of b. 


(b) Find the component of 2i + 3j in the direction of 
i+ 5j. 


2Iql?, 7lql?, —2iq/? 
3,3, =1, 101.1" 
3,7, =8, 119 4° 

c(i + 2j), c constant 
4J 

48.5 J 

17/26 


The result of finding the vector product of a and b is a vector of length |a||b| sind, 
where @ is the angle between a and b. The direction of this vector is such that it is 
perpendicular to a and to b, and so it is perpendicular to the plane containing a and 
b (Figure 7.29). There are, however, two possible directions for this vector, but it is 
conventional to choose the one associated with the application of the right-handed screw 
rule. Imagine turning a right-handed screw in the sense from a towards b as shown. A 
right-handed screw is one which, when turned clockwise, enters the material into which 
it is being screwed. The direction in which the screw advances is the direction of the 
required vector product. The symbol we shall use to denote the vector product is x. 


Formally, we write 


ax b= |allb| sindé 
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b 
Plane containing 
xb 
7 g aandb 
a b 
Figure 7.29 ‘ 
a x b is perpendicular to the plane containing Figure 7.30 
a and b. The right-handed screw rule allows Right-handed screw rule allows the 
the direction of a x b to be found. direction of b x a to be found. 


where é is the unit vector required to define the appropriate direction, that is é is a unit 
vector perpendicular to a and to b in a sense defined by the right-handed screw rule. To 
evaluate b x a we must imagine turning the screw from the direction of b towards that 
of a. The screw will advance as shown in Figure 7.30. 

We notice immediately that a x b # b x a since their directions are different. From 
the definition of the vector product, it is possible to show that the following rules hold: 


ax b=—(b x a) the vector product is not commutative 
ax (b+c) =(axb)+(axc) _ the distributive rule 
k(a x b) = (ka) x b=ax (kb) where kis a scalar 


Example 7.18 If aand b are parallel show that a x b = 0. 


Solution If a and b are parallel then the angle between a and b is zero. Therefore, a x b = 
|a||b| sin 0 é = 0. Note that the answer is still a vector, and that we denote the zero vector 
Oi + Oj + Ok by 0, to distinguish it from the scalar 0. In particular, we note that 


ixi=jxj=kxk=0 


If a and b are parallel, then a x b= 0. 


In particular: 


ixi=jxjokxk=0 


Example 7.19 Show that i x j = k and find expressions for j x k andk x i. 


Solution We note that i and j are perpendicular so that |i x j| = |i||j| sin90° = 1. Furthermore, 
the unit vector perpendicular to i and to j in the sense defined by the right-handed screw 
tule is k. Therefore, i x j = k as required. Similarly you should be able to show that 


jx k=iandk xi=j. 


ixj=k jxk=i kxi=j 
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Example 7.20 


Solution 


Example 7.21 


Solution 


7.6.1 


Simplify (a x b) — (b x a). 


Use the result a x b = —(b x a) to obtain 


(a x b) — (b x a) = (a x b) — (—(a x b)) 
= (a x b) + (ax b) 
= 2(a x b) 


(a) Ifa =a,i+a,j+a,k and b = b,i+ b,j + b,k, show that 
a x b = (a,b, — a,b,)i — (a,b, — a,b,)j + (a,b, — a,b, )k 


(b) Ifa = 2i+j + 3k and b = 3i+ 2j +k finda x b. 


(a) ax b= (a,i+ a,j + a,k) x (b,1+ b,j + 65k) 
=ai x (bhi+ b,j + bk) + aj x (b)i+ b,j + bk) 
+a,k x (b,i+ b,j + b,k) 
=a,b,ixi)+a,b,0 x j) +4), x k) +4,),9 x i) +4a,b,4 x j) 
+ a,b, x k) + a,b, (k x i) + a,b, (k x j) +. a,b, (k x k) 
Using the results of Examples 7.18 and 7.19, we find that the expression for a x b 
simplifies to 


ax b = (a,b, — a,b,)i — (a,b, — a,b,)j + (a,b, — a,b, )k 


(b) Using the result of part (a) we have 


ax b = ((1)(1) — 3)(2))i — (2) CZ) — 3)B3))j + (2) (2) — (1) GB))k 
= —5i+7j+k 


Verify for yourself that b x a = 5i— 7j —k. 


Using determinants to evaluate vector products 


Evaluation of a vector product using the previous formula is very cumbersome. A more 
convenient and easily remembered method is now described. The vectors a and b are 
written in the following pattern: 


ij k 
a, a, a; 
b, b, dy 


This quantity is called a determinant. A more thorough treatment of determinants is 
given in Section 8.7. To find the i component of the vector product, imagine crossing 
out the row and column containing i and performing the following calculation on what 
is left, that is 


a,b, — axb, 
The resulting number is the i component of the vector product. The j component is found 


by crossing out the row and column containing j, performing a similar calculation, but 
now changing the sign of the result. Thus the j component equals 


—(a,b; — ayb,) 
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The k component is found by crossing out the row and column containing k and per- 
forming the calculation 
a,b, — ayb, 
We have 
Ifa = a,i+ a,j + a,k and b = b,i+ b,j + bk, then 


ij k 
axb=|a, a a, 
b, b, b; 


(a,b, — a,b,)i — (a,b, — a3b,)j + (a,b, — ayb,)k 


Example 7.22 Find the vector product of a = 2i + 3j + 7k and b =i+ 2j +k. 


Solution The two given vectors are represented in the following determinant: 


i k 
2 7 
1 1 


bh UO Seme 


Evaluating this determinant we obtain 

ax b= 3- 14)i- Q@-7)j+ 4-3)k=—-11i+5j+k 
You will find that, with practice, this method of evaluating a vector product is simple to 
apply. 


7.6.2 Applications of the vector product 


The following three examples illustrate applications of the vector product. 


Engineering application 7.9 


Magnetic flux density B and magnetic field strength H 


It is possible to model the effect of magnetism by means of a vector field. A magnetic 
field with magnetic flux density B is a vector field which is defined in relation to the 
force it exerts on a moving charged particle placed in the field. Consider Figure 7.31. 
If a charge g moves with velocity v in a magnetic field B it experiences a force F 
given by 
F=qvxB 

Note that this force is defined using a vector product. The unit of magnetic flux den- 
sity is the weber per square metre (Wb m7”), or tesla (T). The direction of this force 


is at right angles to both v and B, in a sense defined by the right-handed screw rule. 
Its magnitude, or modulus, is 


F = qvBsin@ 
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q Figure 7.31 
Force, F, exerted on a particle with charge, g, when moving with 
Vi velocity, v, in a magnetic field, B. 


where @ is the angle between v and B, v is the modulus of v and B is the modulus 
of B. 
: : la 
Note that if 9 = 90°, sind = 1, then B = ie 

These formulae are useful because they can be used to calculate the forces exerted 
on a conductor in an electric motor. They are also used to analyse electricity gener- 
ators in which the motion of a conductor in a magnetic field leads to movement of 
charges within the conductor, thus generating electricity. 

A related quantity is the magnetic field strength, or the magnetic field intensity, 
H, defined from 


B= wH 


jz is called the permeability of a material and has units of webers per ampere per 
metre (Wb A-'m~!). The units of H are amperes per metre (A m™'). Confirm for 
yourself that the units match on both sides of the equation. 


Engineering application 7.10 


Magnetic field due to a moving charge 


A charge g moving with velocity v gives rise to a magnetic field with magnetic flux 
density B in its vicinity. As a result of this, another moving charge placed in this field 
will experience a magnetic force. The magnetic flux density is given by 
pa fo 
Arr? 
where r is a vector from the charge to the point at which B is measured, and ju, is a 
constant called the permeability of free space. 

This equation can be used to find the magnetic field due to a current in a wire. Sup- 
pose a small portion of wire has length 6s and contains a current J. By writing ds as a 
vector of length 6s in the direction of the wire, it can be shown that the corresponding 
contribution to the magnetic flux density is given by 


(Vv x Pf) 


= Hol ‘A 
6B = ae x r) 


This is the Biot—Savart law. Techniques of integration are required in order to com- 
plete the calculation, but using this it is possible to show, for example, that the mag- 
Mol 


netic flux density a distance r from a long straight wire has magnitude B = rae 
Tr 
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Engineering application 7.11 


The Hall effect in a semiconductor 


A frequent requirement in the semiconductor industry is to be able to measure the 
density of holes in a p-type semiconductor and the density of electrons in an n-type 
semiconductor. This can be achieved by using the Hall effect. We will consider the 
case of a p-type semiconductor but the derivation for an n-type semiconductor is 
similar. 


Figure 7.32 
Hall effect in a p-type semiconductor. 


Consider the piece of semiconductor shown in Figure 7.32. A d.c. voltage, V, is 
applied to the ends of the semiconductor. This gives rise to a flow of current com- 
posed mainly of holes as they are the majority carriers for a p-type semiconductor. 
This current can be represented by a vector pointing in the x direction and denoted 
by I. A magnetic field, B, is applied to the semiconductor in the y direction. The 
moving holes experience a force, F,, per unit volume, caused by the magnetic field 
given by 


1 
Fk, = —I1xB 
Bia 


where A is the cross-sectional area of the semiconductor. This causes the holes to 
drift in the z direction and so causes an excess of positive charge to appear on one 
side of the semiconductor. This gives rise to a voltage known as the Hall voltage, 
V,,. As this excess charge builds up it creates an electric field, E,,, in the negative z 
direction, which in turn exerts an opposing force on the holes. This force is given by 
F, = 9py)Ey where gq = elementary charge = 1.60 x 107" C, and py = density of 
holes (holes per cubic metre). Equilibrium is reached when the two forces are equal 
in magnitude, that is |F,| = |F,|. Now, 

\Ix B| JB 


| Fal qPol HI |F3| me A 
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In equilibrium the magnitude of the electric field, |E,,|, is constant and so we can 


; Vv, ; i i 
write |E,,| = a where L is the width of the semiconductor. Hence, 


IB Vu 
ee IPoT 
so that 
_ BIL 
ky = VagA 


So, by measuring the value of the Hall voltage, it is possible to calculate the density 
of the holes, po, in the semiconductor. 


EXERCISES 7.6 
1. Evaluate 
ij k ij k 
(a) | 3 1 2 (b) | -1 2 -3 
21 4 —4 0 1 
ij k i j k 
(c) | O 1 O (d) 3 2 
10 4 —-3 -1 4 

2 Ifa=i-2j+3kandb = 2i—j-—k, find 
(a) axb 
(b) bxa 

3 Ifa =i-— 2j and b = 5i+5k finda x b. 

4 Ifa=i+j—k,b=i-—jandc = 2i+k find 
(a) (ax b) xe 
(b) ax (bx c) 

5 Ifp=6i+ 7j — 2k and q = 3i —j + 4k find |p|, |q| 
and |p x q|. Deduce the sine of the angle between p 
and q. 

6 For arbitrary vectors p and q simplify 
(a) (p+q) xp 
(b) (p+q) x (pP—q) 

7 Ife=i+jandd = 2i+k, find a unit vector 
perpendicular to both ¢ and d. Further, find the sine of 
the angle between ¢ and d. 

8 A,B, Care points with coordinates (1, 2,3), (3, 2, 1) 
and (—1, 1, 0), respectively. Find a unit vector 
perpendicular to the plane containing A, B and C. 

9 Ifa=7i—2j —5k and b = 5i+j + 3k, find a vector 


perpendicular to a and b. 


10 


11 


12 


13 


14 


Ifa =7i—j+k, b = 3i—j —2kand 
c = 91+ j — 3k, show that 


ax (b+c) = (ax b)+(axc) 


(a) The area, A, of a parallelogram with base b 
and perpendicular height h is given by A = bh. 
Show that if the two non-parallel sides of the 
parallelogram are represented by the vectors a 
and b, then the area is also given by 
A=|ax bj. 


(b) Find the area of the parallelogram with sides 
represented by 2i + 3j + k and 3i+ j —k. 


The volume, V, of a parallelepiped with sides a, b and 
c is given by V = |a- (b x c)|. Find the volume of the 
parallelepiped with sides 3i + 2j + k, 2i+-j+k and 
i+ 2j+ 4k. 


Suppose a force F acts through the point P with 
position vector r. The moment about the origin, M, 
of the force is a measure of the turning effect of the 
force and is given by M = r x F. A force of 4 N acts 
in the direction i+ j + k, and through the point with 
coordinates (7, 1, 3). Find the moment of the force 
about the origin. 


In the theory of electromagnetic waves an important 
quantity associated with the flow of electromagnetic 
energy is the Poynting vector S. This is defined as 

S =E x H where E is the electric field strength and 
H the magnetic field strength. Suppose that in a plane 
electromagnetic wave 


27z . 
E = E)cos = ee j 


co a fF | w& 


| 
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Poynting vector and confirm that the direction of 
energy flow is the z direction. 


and | where A, @, jt, and E, are constants. Find the 


Solutions 
(a) 21—-8j+k  (b) 214+ 137+ 8k mg | G_sj4hm 
(c) 4i-k (d) 22i— 18j + 12k /27 
(a) 5i+7j + 3k (b) —5i—7j — 3k 9 —-i-46j + 17k 
—10i — 5j + 10k 11 (b) /90 = 9.49 
(a) -i-3j+2k  (b) i-j 12 5 
V89, /26, 48.01, 0.9980 (21 — 4) + ok) 
V3 
(a) qxp (b) 2q x p 2 
2nE, 3 
l 14 = 9 Cos? — — wo k, 
—(i—j — 2k), 0.775 Og 
ies 


which is a vector in the z direction. 


Technical Computing Exercises 7.6 


Vector and scalar products are readily calculated in most producing the answer: 
technical computing languages. For example in 
MATLAB® we could calculate the scalar product of 


a=i-—2j+3kandb=2i-—j-—k by typing: . % 4 


=(1 -2 3]; : o 
6 - -1 a m 1 Confirm using MATLAB®, or a similar language, that 
dot (a,b) ix j=k. 


2 (a) Select two different vectors a and b and 
calculate |a x b|* + (a.b)?. 
ae: AD (b) Calculate |a|2|b|? 
and the vector product by then typing: (c) What are your observations about the results 
from (a) and (b)? Verify your conclusions by 


changing the vectors a and b and repeating 
the calculations. 


producing the answer: 


cross(a,b) 


¥im NVECTORS OF n DIMENSIONS 


The examples we have discussed have all concerned two- and three-dimensional vec- 
tors. Our understanding has been helped by the fact that two-dimensional vectors can 
be drawn in the plane of the paper and three-dimensional vectors can be visualized in 
the three-dimensional space in which we live. However, there are some situations when 
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the generalization to higher dimensions is appropriate, but no convenient geometrical 
interpretation is available. Nevertheless, many of the concepts we have discussed are 
still applicable. For example, we can introduce the four-dimensional vectors 


and b= 


BNF WwW 
ePWoOrF 


It is natural to define the magnitude, or norm, of a as /3? + 1? + 2? + 42 = /30 and 
the scalar product of a and b as a-b = (3)(1) + 1) (0) + (2)8) + (4)C) = 13. An 
n-dimensional vector will have n components. Operations such as addition, subtraction 
and scalar multiplication are defined in an obvious way. 

It is also possible to define a set of variables as a vector. This turns out to be a use- 
ful way of modelling a physical system. The system is described by means of a vector 
which consists of an ordered set of variables sufficient to describe the state of the sys- 
tem. Such a vector is called a state vector. This concept is explored in more detail in 
Chapter 20. 


Engineering application 7.12 


Mesh current vector 


When analysing a complex circuit it can be convenient to assign a current to each 
small independent loop within the circuit. Each of these currents is known as a mesh 
current. It is possible to collect these individual currents together to form a vector 
quantity. Consider the following example. 

A circuit as shown in Figure 7.33 has a set of mesh currents {J,, J,, /,, 1,} from 
which we can form a current vector 


No geometrical interpretation is possible but nevertheless this vector provides a use- 
ful mathematical way of handling the mesh currents. We shall see how vectors such 
as these can be manipulated in Section 8.12. 


Figure 7.33 
A circuit with mesh currents 
shown. 
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EXERCISES 7.7 
1 following are mutually orthogonal. 
1 3 
T 9) 1 2 3 
a=] 0 and b=] 1 a=| 2 b=|! ea? 
1 0 4 0 1 
iT l —l 0 0 
find the norm of a, the norm of b and a- b. Further, 0 3 
find the norm of a — b. d 0 0 
— e=> 
2 Two non-zero vectors are mutually orthogonal if their 7 —2 
scalar product is zero. Determine which of the 2 —3 
Solutions 
M2, /15,6, 77 | (2 aande,bandd 
REVIEW EXERCISES 7 
1 Finda-banda x b when 8 Show that the vectors p = 3i — 2j +k, 
(a) a= 7i—j+k,b = 31+ 2j+5k gq = 2i+-j — 4k and r =i — 3j + 5k form the three 
(b) a= 6i — 6j — 6k, b =i—j—k. sides of a right-angled triangle. 
2 Fora triangle ABC, express as simply as possible the 9 Find a unit vector parallel to the line y = 7x — 3. Find 
seeaeAR 4 BC 4 CA a unit vector parallel to y = 2x + 7. Use the scalar 


product to find the angle between these two lines. 
3 Ifa=7i—j+2k and b = 8i+ j+k, find |al, |b] and 


a-b. Deduce the cosine of the angle between a and b. 10 Anelectric charge g which moves with a velocity v 


produces a magnetic field B given by 
bqvxt 

~ 4n Ir[2 
Find B if r = 3i+ j — 2k and v = i — 2j + 3k. 


4 Ifa=6i—j+2k andb = 3i —j + 3k, find 
jal, |b], |a x b|. Deduce the sine of the angle between 
a and b. 


5 Ifa =7i+9j — 3k and b = 2i— 4j, find a, b, a x b. 


where 44 = constant 


> = => 
11 Ina triangle ABC, denote AB by c, AC by b and CB 
6 By combining the scalar and vector products other by a. Use the scalar product to prove the cosine rule: 
types of products can be defined. The triple scalar a =—b? +c? —2becosA. 
product for three vectors is defined as (a x b)-¢ 


which is a scalar. Ifa = 3i1-j+2k,b —2i—2j—k, | MA Evaluate 


c = 3i+j, find a x b and (a x b) -¢. Show that i i: ij k 
(ax b)-c=a-(bxc). (a) | -—4 0 -3 (b) /8 2 5 
qT Al 4 1 0 0 
7 The triple vector product is defined by (a x b) x ¢. 
Find the triple vector product of the vectors given in 13 Find the area of the parallelogram with sides 
Question 6. Also find a-c, b+ c¢ and verify that represented by 3i + 5j — k andi+ 3j —k. 
(a-c)b — (b-c)a= (ax b) xe 14 Find the angle between the vectors 7i + 2j and i — 3j. 


15 Find a unit vector in the direction of the line joining 


Further, find d confirm that 
GE eh ne ae De) Aad oman a the points (2, 3, 4) and (7, 17, 1). 


ax (bx c) #(axb) xc. 
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Show that the vectors i — j and —3i — 3j are 
perpendicular. 


Find the norm of each of the vectors 


7 2 

2 1 
4 and 0 
—4 


(a) Use the scalar product to find the value of the 
scalar yz so that i+ j + jk is perpendicular to the 
vectori+j+k. 


(b) Use the vector product and the results from part (a) 
to find a mutually perpendicular set of unit vectors 
¥,, V5 and V3, where ¥, is inclined equally to the 
vectors i, j and k. 


The points A, B and C have coordinates (2, —1, —2), 
(4, —1, —3) and (1,3, —1). 


> > 
(a) Write down the vectors AB and AC. 


(b) Using the vector product find a unit vector 
which is perpendicular to the plane containing 
A, B and C. 


Solutions 


10 


12 


(a) 24,-7i- 32+ 17k — (b) 18,0 


0 
V54, V 66, 57, 0.9548 


J41, /19, /154, 0.4446 
1 1 1 
Vi39 20” J2296 

5i + 7j — 4k, 22 


(—12i — 6j — 46k) 


(ax b) x c= 4i — 12j — 16k 

a-c=8, b-c=4 

a x (b x c) = —2i — 22j — 8k 
1 1 

/50 Jf5 


(i+ 11j + 7k) 


(i+ 7j), (it 2j), 18.4° 
AG a, 
56/141 
(a) 3i—5j —4k 
(b) 5j — 2k 


20 


21 


22 


23 


13 
14 


15 


17 
18 


19 


20 


23 


(c) If Dis the point with coordinates (3, 0, 1), use 
the scalar product to find the perpendicular 
distance from D to the plane ABC. 


The condition for vectors a, b and ¢ to be coplanar 
(i.e. they lie in the same plane) is a- (b x c) = 0. 


(a) Show that the vectors a = 4i + 5j + 6k, 
b = 6i — 3j — 3k andc = —i+ 2j + 2k are 
not coplanar. 

(b) Given d = —i+ 2j +k, find the value of 4 so 
that a, b and d are coplanar. 


Points A and B have position vectors a and b 
respectively. Show that the position vector of an 
arbitrary point on the line AB is given by 
r=a+A (b-—a) for some scalar i. This is the 
vector equation of the line. 


Use vector methods to show that the three medians of 
any triangle intersect at a common point (called the 
centroid). 


Use the vector product to find the area of a triangle 
with vertices at the points with coordinates (1, 2, 3), 
(4, —3, 2), and (8, 1, 1). 


V24 = 4.90 
87.5° 


1 
——= (5i+ 14j — 3k) 
230 : 


V58, V21 


(a) w=—2 
i 1,1), ¥ meer 1, -2) 
ES ig 

1 


V2 
(a) (2,0, —1), (-1, 4, 1) 


(b) #) = 


a 


V3 (-1, 1,0) 


1 
(b) 9 -1,8) © 3 
(a) a+ (b x c) = 9 ~ 0 and hence the vectors are not 


coplanar 
(b) A = 31/14 


1 
al 1106 
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INTRODUCTION 


Matrices provide a means of storing large quantities of information in such a way that 
each piece can be easily identified and manipulated. They permit the solution of large 
systems of linear equations to be carried out in a logical and formal way so that computer 
implementation follows naturally. Applications of matrices extend over many areas of 
engineering including electrical network analysis and robotics. 
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Example 8.1 


Solution 


An example of an extremely large electrical network is the national grid in Britain. 
The equations governing this network are expressed in matrix form for analysis by com- 
puter because solutions are required at regular intervals throughout the day and night in 
order to make decisions such as whether or not a power station should be connected to, 
or removed from, the grid. 

To obtain the trajectory of a robot it is necessary to perform matrix calculations to find 
the speed at which various motors within the robot should operate. This is a complicated 
problem as it is necessary to ensure that a robot reaches its required destination and does 
not collide with another object during its movement. 


BASIC DEFINITIONS 
A matrix is a rectangular pattern or array of numbers. 


For example, 


123 
12-2 
A=| 456 B= ( ) C=(1 -1 1) 
<iga 3 4 0.5 


are all matrices. Note that we usually use a capital letter to denote a matrix, and enclose 
the array of numbers in brackets. To describe the size of a matrix we quote its number 
of rows and columns in that order so, for example, an r x s matrix has r rows and s 
columns. We say the matrix has order r x s. 


Anr X s matrix has r rows and s columns. 


Describe the sizes of the matrices A, B and C at the start of this section, and give examples 
of matrices of order 3 x 1,3 x 2 and 4 x 2. 


A has order 3 x 3, B has order 2 x 3 and C has order | x 3. 


—1 12 
—2] isa3 x | matrix 3 4] isa3 x 2 matrix 
3 5 6 
and 
-1 -tl 
—1 2 ; . 
> _05 isa4 x 2 matrix 


1 0 


More generally, if the matrix A has m rows and n columns we can write 


Gi, Ayo «+s A 


a a geeree SA 
21 22 2, 
A= 
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Example 8.2 


Solution 


Example 8.3 


Solution 
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where a; ; represents the number or element in the ith row and jth column. A matrix with 
a single column can also be regarded as a column vector. 

The operations of addition, subtraction and multiplication are defined upon matrices 
and these are explained in Section 8.3. 


ADDITION, SUBTRACTION AND MULTIPLICATION 


Matrix addition and subtraction 


Two matrices can be added (or subtracted) if they have the same shape and size, that is the 
same order. Their sum (or difference) is found by adding (or subtracting) corresponding 
elements as the following example shows. 


If 


£5. 23 L 2.0 
A= (5 1 i) = a=(j 1 :) 


findA+ BandA — B. 


1S i 2D) 4k ee 
A+B=(; 1 +( i 1 =( 2 ) 
oe { 2 0) fo 3. 2 
A-B=(; I = (4 I a= (4 0 =) 


On the other hand, the matrices (( ) and (7) cannot be added or subtracted because 


they have different orders. 


ve=(? ?\andb=l" 2 \ owt Capea ed 
cd gh 
_ f(a b e f\ fate b+f 
ee = (: “)+(5 es ea a) 
_ fe f a b\_ feta f+b 
aks (: i)+(2 = ees 


Now a + e is exactly the same as e + a because addition of numbers is commutative. 
The same observation can be made of b+ f,c + gandd+h.HenceC+D=D+C. 
The addition of these matrices is therefore commutative. This may seem an obvious 
statement but we shall shortly meet matrix multiplication which is not commutative, so 
in general commutativity should not be simply assumed. 
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The result obtained in Example 8.3 is true more generally: 


Matrix addition is commutative, that is 
AC Bie eA\ 


It is also easy to show that 


Matrix addition is associative, that is 
A+ (B+C)=(A+B)+C 


8.3.2 Scalar multiplication 


Given any matrix A, we can multiply it by a number, that is a scalar, to form a new matrix 
of the same order as A. This multiplication is performed by multiplying every element 
of A by the number. 


Example 8.4 If 


1 3 2 6 
Solution JA=9 |) =2° 1 )-= 4 2 
0 1 QO. 2 
1 3 —3 -9 
—3A=-3]/-2 1l]J= 6 -3 
0 1 QO -3 
and 
1 3 
i ee ° i 
s4=5(-2 1} =| -1 5 
0 1 1 
@) 7 
2 


8.3.3 


Example 8.5 


Solution 


Example 8.6 


Solution 
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In general we have 


Cy Chg Gi. ig Midis, ccs Bhi 
a a Eee CL ka ka cos 1G 
21 22 2n 21 D 2n 
1fAe— : ; r : then kA = ; 
Gl (hy ne (he gig Ukiln cao bt 


Matrix multiplication 


Matrix multiplication is defined in a special way which at first seems strange but is in 
fact very useful. If A is a p x g matrix and B is an rr x s matrix we can form the product 
AB only if g = r; that is, only if the number of columns in A is the same as the number 
of rows in B. The product is then a p x s matrix C, that is 


C=AB where Aispxq 
Bisqxs 
Cispxs 


3 7 


Given A = (4 2) and B= (5 > 


5) can the product AB be formed? 


A has size | x 2 
Bhas size 2 x 3 


Because the number of columns in A is the same as the number of rows in B, we can 
form the product AB. The resulting matrix will have size | x 3 because there is one row 
in A and three columns in B. 


Suppose we wish to find AB when A = (4 2) and B = (; 


size 2 x 1 and so we can form the product AB. The result will be a | x | matrix, that is 
a single number. We perform the calculation as follows: 


) Anas siz 1 x 2 and B has 


AB = (4 2) (7) =4x 342% 7= 12+ 14=26 


Note that we have multiplied elements in the row of A with corresponding elements in 
the column of B, and added the results together. 


2 
Find CD when C = (1 9 2) andD= {6 
8 


2 
CD=(1 9 2)|6] =1x24+9x64+2x8=2+4+54+16=72 
8 


Let us now extend this idea to general matrices A and B. Suppose we wish to find C 
where C = AB. The element c,, is found by pairing each element in row | of A with the 
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Example 8.7 


Solution 


corresponding element in column | of B. The pairs are multiplied together and then the 
results are added to give c,,. Similarly, to find the element c,,, each element in row | of 
A is paired with the corresponding element in column 2 of B. Again, the paired elements 
are multiplied together and the results are added to form c,,. Other elements of C are 
found in a similar way. In general the element c;; is found by pairing elements in the ith 
row of A with those in the jth column of B. These are multiplied together and the results 
are added to give c;;. Consider the following example. 


IfA = ¢ :) and B = is) find, if possible, the matrix C where C = AB. 


We can form the product 


coe) 


t 
2x22x1 


because the number of columns in A, that is 2, is the same as the number of rows in B. 
The size of the product is found by inspecting the number of rows in the first matrix, 
which is 2, and the number of columns in the second, which is 1. These numbers give 
the number of rows and columns respectively in C. Therefore C will be a2 x | matrix. 

To find the element c,, we pair the elements in the first row of A with those in the 
first column of B, multiply and then add these together. Thus 


cy, =1x54+2x-3=5-6=-1 


Similarly, to find the element c,, we pair the elements in the second row of A with those 
in the first column of B, multiply and then add these together. Thus 


6 =4x543x-3=20-9=11 


The complete calculation is written as 
_ (12 5 1x5+2x—3 
ap= (i ayes) fee) 
5-6 
20 —9 
_ f-il 
~ A dl 


If A is a p x q matrix and B is aq x s matrix, then the product C = AB will be a 
p x s matrix. To find c;; we take the ith row of A and pair its elements with the jth 
column of B. The paired elements are multiplied together and added to form c¢;,. 


Example 8.8 


Solution 


Example 8.9 


Solution 


Example 8.10 


Solution 
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1 


IfB= P23 and C = | 2 | find BC. 
456 4 


B has order 2 x 3 and C has order 3 x 1 so clearly the product BC exists and will have 
order 2 x 1. BC is formed as follows: 


got 23 : _ (ixl+2x243x4\_ (17 
~ 445 6) \A] 4x 1t5x246x4) ~ (38 


Note that the order of the product, 2 x 1, can be determined at the start by considering 
the orders of B and C. 


Find AB where 
1, 2 —1 
A= 3 4 and B= 2 
-1 0 —l1 


A and B have orders 3 x 2 and 3 x 1, respectively, and consequently the product, AB, 
cannot be formed. 


Given 
1 11 0 31 
A={2 10 and B= 1|4 -10 
3 -1 2 2 21 


find, if possible, AB and BA, and comment upon the result. 


A and B both have order 3 x 3 and the products AB and BA can both be formed. Both 
will have order 3 x 3. 


1 11 0 31 6 42 
AB = |2 10]]4 -1 OJ =]4 52 
3 =1 2 2 21 0 14 5 
0 31 1 141 922 
BA = {4 -1 OJ 2 1 O0]}]=]2 3 4 
25 ede I 3 -1 2 3 4 


Clearly AB and BA are not the same. Matrix multiplication is not usually commutative 
and we must pay particular attention to this detail when we are working with matrices. 


In general AB ¥ BA and so matrix multiplication is not commutative. 


In the product AB we say that B has been premultiplied by A, or alternatively A has 
been postmultiplied by B. 
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Example 8.11 Given 
1-1 2 
A= 3 02 
-1 35 


find BC, A(BC), AB and (AB)C, commenting upon the result. 


—l 2 9 
Solution BC= 1 0 0 
3 —2 1 
1 -1l 2 
A(BC)=| 3 2 
_ 3 5 
1 -l 2 
AB= 3 0 2 
—l 3 5 
4 —2 11 


(AB)C=| 3. 2. 29 
io 213 <4 


1 5 0 14 42 
3 1}]=]4 ly 3 
1.5 8 —2 18 
14 42 12 9 73 
S| = | 16 38 162 
18 52 —21 63 
2 9 4 -—2 11 
0 OJ =] 3 2 29 
2 I 19 -12 —-4 
4 15 12 9 73 
2 3 I] =] 16 38 162 
0 15 52 —21 63 


We note that A(BC) = (AB)C so that in this case matrix multiplication is associative. 


The result obtained in Example 8.11 is also true in general: 


Matrix multiplication is associative: 


(AB)C = A(BC) 


EXERCISES 8.3 


Evaluate 


esa())2-Q)eo(29 
) 


23: & : ; 
and E = (; > 4) find, if possible, 


(a) A+ D,C—AandD-E 
(b) AB, BA, CA, AC, DA, DB, BD, EB, BE and AE 
(c) 7C, —3D and kE, where k is a scalar. 
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3 Plot the points A, B, C with position vectors given by (A+B)? = A? + AB + BA+ B* 
(:) (2) (2) Why is (A + B) not equal to A* + 2AB + B?? 
v= WYW= i— 
1 \0 ame ae 7 Find, if possible, 
respectively. Treating these vectors as matrices of 10 OO iT 
order 2 x | find the products Mv,, Mv,, Mv3 when 00-10 1 
; & ™ lo1 ooff2 
= 0 01 1 
(a) M ¢ a :) 0 
001 0\ /2 
w m=(} ) w | 9190/45 
Le -1000]]5 
— 0001 ii 
if ite 0-1 
1 0 
1 2 
’ . ; 3 6 
In each case draw a diagram to illustrate the effect 8 Find 2-57 3: 3 
upon the vectors of multiplication by the matrix. 6 7 
4 Find AB and BA where 1 
9 Given the vector v = | 2 | calculate the vectors 
13 2 3 
A= 1|/-10 4 obtained when v is premultiplied by the following 
5 1-1 matrices: 
5 2 1 62 9 -1 03 
B= 1034 (a) 13 2 (b) 719 
135 —1 2 -3 134 
: 2 . : 13 1 
5 Given that A~ means the product of a matrix A with 3.12 
42 oe (eee Ole sa 
itself, find A? when A = ¢ ): Find A’, 280 
1.3 21 oe? 
6 IfA= B= find AB, BA,A+B 96 4 
—2 4 —4 5 (e) 539 
and (A + B)*. Show that 252 
Solutions 
1 (a) 15 (b) —19 —4 —3 -—7 5 
12 16)’ \—21 19)’ 
47 —7 1 
(c) é ;) (d) (_ 14 i. DA does not exist, DB does not exist, 


Cee 
(ec) ¢ 4 () 15 321)" 


EB does not exist, BE does not exist, 


(g) (25 12) (h) (19 19 19) % #9 
las 17 :) 


17 1 09 
O(a) Go) 
1 © (- —49 4) (-9 -6—3), G 3k ad 


0 28 k 2k —k 
: -8 0 
BI (2) A+ D does not exist, & ) B « ) e 3 " (°).(°).@) 
D — E does not exist 
0 
1 


(b) 6) BA does not exist, (c) 
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7 17 23 8 16 17 


—1 10 19],]17 4 8 
24 10 4 23 8 9 


5 18 14 86 78 
7 11 39 47 
6 —10 16 0 10 
—20 18 —-14 8)’ 
4 —15 48 
9]? \-72 57 


Technical Computing Exercises 8.3 


Technical computing languages such as MATLAB® are 
usually designed to perform operations on scalars, 

vector and matrices. However, as has been demonstrated 
in this chapter, different mathematical rules apply when 


: i : ; 1. Find the product of the scalars a and b. Is this a 
working with vectors and matrices. Consider the valid calculation? 
following quantities: 
2. Find the product of the scalar a and the matrix c. Is 
this calculation valid? Similarly find the product of 
a=-2 (ascalar quantity) the matrix c and scalar a. Do these calculations 
: produce the same result? 
b=7 (ascalar quantity) 
3. Find the products of the matrices c and d, that is cd 
c= € “) (a 2 by 2 matrix) and dc. Are these calculations valid and do they 
produce the same result? 
d= € ) (a2 by 2 matrix) 4. Find the products of the matrices e and f, that is ef 
6 8 and fe. Are these calculations valid and do they 
14 produce the same result? 
e= ( 5], (a3 by 2 matrix) 5. Find the products of the matrices d and e, that is de 
3 6 and ed. Are these calculations valid and do they 
produce the same result? 
1.3.5 ; 
i= é 4 a (a2 by 3 matrix) 6. Find c?. Is this calculation valid? 
7. Find e?. Is this calculation valid? 
Solutions 


1. Yes, the calculation is valid. It is a scalar multiplied 


by a scalar. In MATLAB®: 


axb 


1 5 
2 5 
@} a} ©] 5 
1 1 
46 29 
29 78 
a7 8 10 
(a) {| 13 (b) | 36 (c) {31 
6 19 18 
31 
ll 33 
(d) (3s) (©) | 38 
18 


Load these as variables in a technical computing 
language and then attempt to perform the following 
calculations and answer the questions. 


Yes, the calculations are valid and produce the same 
result. 
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c*a 


=2..==6 
-4 -8 


Multiplication of a scalar by a matrix is 
commutative. It does not matter in which order the 
calculation is performed. Each element in the matrix 
is multiplied by the scalar value. 


The calculations are valid but they do not produce the 
same result because matrix multiplication is not 
commutative. 


cxd 
ans = 
23 31 
34 46 
>> d*c 
ans = 
19 43 
22 50 


This result emphasizes the need for care when 
manipulating matrix equations. 


Both calculations are valid but the results are not the 
same. 


exf 
ans = 
9 19 29 
12 26 40 
15 33 51 
fxe 
ans = 
22 49 
28 64 


de is not valid as there are 2 columns in d and 3 rows 
in e. It should generate an error in a technical 
computing language. It is worth observing the error 
message and understanding why this happens for 
future reference. 


ed is valid, 

exd 

ans = 
29 39 
40 54 
51 69 


It is valid. We take this to mean the matrix 
multiplication of c by itself: 


10 22 


Notice that this is different to taking the square of 
each element within c. If we wanted to do that it 
would be necessary to use the ‘dot’ notation: 


16 


The command c.*2 means ‘take each element of c and 
raise it to the power of 2. There is a set of operators 
within MATLAB® which allow multiplication of 
matrices on an element by element basis. Care has to 
be taken as (for example) c * d and c. * d are both 
valid operations which generate a 3 by 3 result but the 
elements of the matrix are different. In other words, it 
could give an incorrect result but MATLAB® would 
not give any error messages. 


It is not valid. It is only possible to multiply a matrix 
by itself if there are the same number of columns as 
rows. 


USING MATRICES IN THE TRANSLATION AND ROTATION 


OF VECTORS 


There are many applications in engineering that require vectors to be manipulated into 
new configurations. One mechanism for achieving this is by the use of matrices. We will 
introduce this topic through the example of robotics. However, it is important to note 
that the ideas are more generally applicable in engineering. 
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Engineering application 8.1 


Robot coordinate frames 


In Chapter 7 we saw that vectors provide a useful tool for the analysis of the position 
of robots. By assigning a vector to each of the links the position vector corresponding 
to the tip of the robot can be calculated (Figure 8.1). In practice the inverse problem 
is more likely: calculate the link vectors to achieve a particular position vector. Usu- 
ally a desired position for the tip of the robot is known and link positions to achieve 
this are required. The problem is made more complicated because the position of a 
link depends upon the movements of all the joints between it and the anchor point. 
The solution of this problem can be quite complicated, especially when the robot has 
several links. One way forward is to define the position of a link by its own local 
set of coordinates. This is usually termed a coordinate frame because it provides a 
frame of reference for the link. Matrix operations can then be used to relate the coor- 
dinate frames, thus allowing a link position to be defined with respect to a convenient 
coordinate frame. A common requirement is to be able to relate the link positions to 
a world coordinate frame. If a robot is being used in conjunction with other ma- 
chines then the world coordinate frame may have an origin some distance away from 
the robot. The advantage of defining link coordinate frames is that the position of a 
link is easily defined within its own coordinate frame and the movement of coordinate 
frames relative to each other can be expressed by means of matrix equations. 


p=atb+cr+d 


Figure 8.1 
A robot with links represented by vectors 
a, b, c and d. 


8.4.1 Translation and rotation of vectors 


An introduction to the mathematics involved in analysing the movement of robots can 
be obtained by examining the way in which vectors can be translated and rotated using 
matrix operations. 

Consider the point P with position vector given by 


In order to translate and rotate this vector it is useful to introduce an augmented vector 


V given by 


V= (8.1) 


PN SS & 
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It is then possible to define several matrices: 


1 O 0 
0 cos@ —sin@ 
0 sind cos 0 
0 O 0 


Rot(x, 6) (8.2) 


ee OCS 


cos@ 0 sin@ 
0 1 O 
—sind O cosd@ 
0 0 O 


Rot(y, 0) (8.3) 


- OOO 


cos@é —sind 0 


sin0 cos@ 0 


Rot(z,@) = (8.4) 


re COO °O 


Trans(a, b,c) = (8.5) 


Matrices (8.2) to (8.4) allow vectors to be rotated by an angle 6 around axes x, y and z, 
respectively. For example, the product Rot(x, 0)V has the effect of rotating r through 
an angle @ about the x axis. Matrix (8.5) allows a vector to be translated a units in the x 
direction, b units in the y direction and c units in the z direction. 

It is possible to combine these matrices to calculate the effect of several operations 
on a vector. In doing so, it is important to maintain the correct order of operations as 
matrix multiplication is non-commutative. 

For example, the position of a vector that has first been translated and then rotated 
about the x axis can be defined by 


View = Rot(x, 6) Trans(a, b, c) Voi 


A few examples will help to clarify these ideas. 


Example 8.12 Rotate the vector 


II 
= 


through 90° about the x axis. 


I 
fl 
Solution Toa = | 1 Vou = | 2 

2 

fl 

1 0 0 0 10 00 

s [0 cos90° —sind0° 0} [00-10 

RotG.20)) = 19 sind? cos’ 01 10 1 0 0 

0 0 0 1 00 01 
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Solution 
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10 00 1 1 
00-10 1 —2 
View 01 OO0;]2] — 1 
00 O11 1 1 
So, 
1 
Trew = —2 
1 
Translate the vector 
1 
r= ]{3 
2 
by 
1 
2 
3 
and then rotate by 90° about the y axis. 
! : 
Toa = | 3 Vo6 = 2 
: 1 
To translate r,j,, we form 
1001 
0102 
Trans(1, 2,3) = 0013 
0001 
Then 
1001 1 2 
010 2)43 a) 
Trans(1, 2, 3)Vij4 = 0013 r,=15 
0001 1 1 
To rotate by 90° about the y axis we require 
cos 90° 0 sin90° 0 0010 
ee 0 1 O OF; _ 0100 
Rot(y, 90") = | _ singar 0 cos90° 0| =] -1 0 0 0 
0 0 0 1 0001 


The vector 


2 
a]. ee : ; ‘ 
5 | is premultiplied by this matrix to give 
1 


8.5.1 


8.5.2 


8.5.3 


Example 8.14 


Solution 
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View = Rot(y, 90°)Trans(1, 2, 3) Voi 
0010 2 5 
_ 0100 5] _ 5 
~ |-1000 5] | -—2 
0001 1 i 
Hence 
a) 
Thew = 5 
—2 
SOME SPECIAL MATRICES 


Square matrices 


A matrix which has the same number of rows as columns is called a square matrix. Thus 


12 3 


—1 0 1] is asquare matrix, while ea aes is not 
324 241 


Diagonal matrices 


Some square matrices have elements which are zero everywhere except on the leading 
diagonal (top-left to bottom-right). Such matrices are said to be diagonal. Thus 


3000 
i ; 10 0200 
00-1 0 b 0010 
0000 
are all diagonal matrices, whereas 
124 
010 
3.0 1 


is not. 


Identity matrices 


Diagonal matrices which have only ones on their leading diagonals, for example 


100 


Ce, and 010 
001 


are called identity matrices and are denoted by the letter /. 


01 3 °=1..0 


10\(2 44\_(2 44 
a=(6 WG -1 = (5 -1 5) 


Find JA where [ = (( 4 and A = € i ) and comment upon the result. 
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8.5.4 


Example 8.15 


Solution 


Example 8.16 


Solution 


8.5.5 


Example 8.17 


Solution 


8.5.6 


The effect of premultiplying A by / has been to leave A unaltered. The product is identical 
to the original matrix A, and this is why / is called an identity matrix. 


In general, if A is an arbitrary matrix and / is an identity matrix of the appropriate 
size, then 


IA=A 
If A is a square matrix then JA = AJ = A. 


The transpose of a matrix 


If A is an arbitrary m x n matrix, a related matrix is the transpose of A, written A’, found 
by interchanging the rows and columns of A. Thus the first row of A becomes the first 
column of A? and so on. A? is ann x m matrix. 


IfA = i =) find A’. 


2 4 
I 2 
To 
(4) 
IfA = (; : *) find A’ and evaluate AA’. 
41 4 1 
, (426 _ (56 62 
122 Me 2 81=\6 114 
67 67 


Symmetric matrices 


If a square matrix A and its transpose A’ are identical, then A is said to be a symmetric 
matrix. 


5-4 2 
IfA=|-4 6 9) find A’. 
2 9 13 

5 -4 2 
A’=|-4 6 9 
2 9 13 


which is clearly equal to A. Hence A is a symmetric matrix. Note that a symmetric matrix 
is symmetrical about its leading diagonal. 


Skew symmetric matrices 


If a square matrix A is such that A?’ = —A then A is said to be skew symmetric. 
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Example 8.18 IfA = e a): find A’ and deduce that A is skew symmetric. 


QO -— 


Solution Wehave A? = é 


EXERCISES 8.5 


3 1 -14 
1 a= (; 5) and B= ( 3 :) 


(a) find A’, 
(b) find BY , 
(c) find AB, 
(d) find (AB)?, 
(e) deduce that (AB)? = BTA’. 
x 


Treating the column vector x = | y | asa3 x 1 
Zz 


matrix, find /x where J is the 3 x 3 identity matrix. 


IfA= (: ‘) show that AA’ is a symmetric 


matrix. 
213 1 -—7 0 
IfA= 42 1]andB=|0 25 
-1 32 3 45 


find A?, B?, AB and (AB)’. 
Deduce that (AB)? = BTA’. 


Determine the type of matrix obtained when two 
diagonal matrices are multiplied together. 


Solutions 


@oGS Cs) 


0 20 0 16 
(©) le a @ é 7) 


x 
y 


_ 
ePnN 


) which is clearly equal to —A. Hence A is skew symmetric. 


IfA= (: ‘) is skew symmetric, show that 


a = d = 0, that is the diagonal elements are zero. 


1 13 
ta=( 8) 


(a) find A’, 
(b) find (AT)’, 
(c) deduce that (AT)T is equal to A. 


94 
IfA = c ) 


(a) find A + A’ and show that this is a symmetric 
matrix, 


(b) find A — A? and show that this is a skew 
symmetric matrix. 


The sum of the elements on the leading diagonal 
of a square matrix is known as its trace. Find the 
trace of 


© 9(39 


100 7 2 #1 
(c) |O0 1 0 (d) }8 2 3 
001 9 -1 -4 


11 0 20 11 7 5 
7 —20 15],{ 0 —20 21 
> 21 25 20 15 25 


Diagonal matrix 


1 15 1 13 
(5 ;) (45 |) 
18 7 01 
o ( 7 i) (b) & 


(a) 12 (b) 0 (c) 3 (d) 5 
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| 8.6 | THE INVERSE OF A 2 x 2 MATRIX 


When we are dealing with ordinary numbers it is often necessary to carry out the opera- 
tion of division. Thus, for example, if we know that 3x = 4, then clearly x = 4/3. If we 
are given matrices A and C and know that 


AB=C 


how do we find B? It might be tempting to write 


B=— 
A 


Unfortunately, this would be entirely wrong since division of matrices is not defined. 
However, given expressions like AB = C it is often necessary to be able to find the 
appropriate expression for B. This is where we need to introduce the concept of an inverse 
matrix. 

If A is a square matrix and we can find another matrix B with the property that 


AB=BA=I 
then B is said to be the inverse of A and is written A~!, that is 
AA =A'A=I 


If B is the inverse of A, then A is also the inverse of B. Note that A~! does not mean a 
reciprocal; there is no such thing as matrix division. A~! is the notation we use for the 
inverse of A. 


Multiplying a matrix by its inverse yields the identity matrix /, that is 


AA'=A'A=I1 


Since A is a square matrix, A~! is also square and of the same order, so that the products 
AA~! and A~!A can be formed. The term ‘inverse’ cannot be applied to a matrix which 
is not square. 


Example 8.19 IfA = (; 7 show that the matrix & =) is the inverse of A. 


Solution Forming the products 
24I\f 2-41. (10 
3 2/\-3 2) ~~ \ol 
221) 7/23)... fiw 
—3 2)/\3 2) ~~ \Ol 


we see that e ~ ») is the inverse of A. 


8.6.1 


Example 8.20 


Solution 


Example 8.21 


Solution 


Example 8.22 


Solution 


8.6 The inverse of a 2 x 2 matrix 


Finding the inverse of a matrix 


For 2 x 2 matrices a simple formula exists to find the inverse of 


(2) 


This formula states 


_ [ab Spee 1 d —b 
a= (2 2) then A -aaE (3 “) 


_(3> -1 
ra=(; ) find A~’. 


Clearly ad — bc = 6 — 5 = 1, so that 


1 2-5 2-5 
= ares _ 
a sel 


The solution should always be checked by forming AA~!. 


_(15 -1 
a= (} _) find A~’. 


Here we have ad — bc = 4 — 10 = —6. Therefore 


1 
This time, ad — bc = 4 — 4 = 0, so when we come to form q 
ad — bc 


is not defined. We cannot form the inverse of A in this case; it does not exist. 
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we find 1/0 which 


Clearly not all square matrices have inverses. The quantity ad — bc is obviously the 
important determining factor since only if ad — bc 4 0 can we find A~!. This quan- 
tity is therefore given a special name: the determinant of A, denoted by |A|, or det A. 
Given any 2 x 2 matrix A, its determinant, |A|, is the scalar ad — bc. This is easily 


remembered as 


[product of \, diagonal] — [product of /“ diagonal] 
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: . (ab we : a b 
If A is the matrix (: i‘) we write its determinant as en 


lines || indicate that we are discussing the determinant, which is a scalar, rather than 
the matrix itself. If the matrix A is such that |A| = 0, then it has no inverse and is said 
to be singular. If |A| 4 0 then A! exists and A is said to be non-singular. 


| Note that the straight 


A singular matrix A has |A| = 0. 
A non-singular matrix A has |A| + 0. 


Example 8.23 If A = € _) and B = & ') find |A], |B| and |AB]. 
12 
Solution |A| = ; I = (1)() — @)G) = —10 
[B| = ie |=Cbw-@Ca=s 
12/12 7 4 
ar (5 °) e 4 ~ ‘e 7. 
[AB] = (—7)(10) — 4)(—5) = —50 


We note that |A||B| = |AB]. 


The result obtained in Example 8.23 is true more generally: 


If A and B are square matrices of the same order, |A||B| = |AB]. 


8.6.2 Orthogonal matrices 


A non-singular square matrix A such that A? = A7! is said to be orthogonal. Conse- 
quently, if A is orthogonal AA? = ATA =I. 


0 -l 


Example 8.24 Find the inverse of A = ¢ 0 


) Deduce that A is an orthogonal matrix. 


Solution From the formula for the inverse of a2 x 2 matrix we find 


1 01 0 1 
Sf 35 = 
“=7(40)=Cr 9) 


This is clearly equal to the transpose of A. Hence A is an orthogonal matrix. 


To find the inverses of larger matrices we shall need to study determinants further. This 
is done in Section 8.7. 
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EXERCISES 8.6 
5 6 = Find values of the constants a and b 
1 IfA=(_4 g) finda”. snchithatA +aA—! = BI, 
11 2-4] 
2 Find the inverse, if it exists, of each of the following 6 IfA= ¢ ,) and B = (_ 1 :) 
matrices: ; ; ot 
= find AB, (AB)~*, B~*, A~* and B~*A~*. 
(a) (; o (b) ( ' ) ©) i; ) 
- Deduce that (AB)~! = B-!A7!. 
(d) & _) (e) € ;) (f) (Cs | 7 Given that the matrix 
coswt —sinwt 0 
1 1 M= | sinat coswt O 
(g) . : 0 0 1 
0 5 is orthogonal, find M~!. 
3 IfA= (| i) oe ¢ ;) 8 (a) IfA= [ ‘) and k is a scalar constant, 
show that the inverse of the matrix kA 
find |AB|, |BA|. 1 
; is os 
4 waa (“ asl, 1) = 
e 8 (b) Find the inverse of and hence write 
find AB, |Al, |BI, |ABI. 10 
1 1 
Verify that |AB| = |A||B]. a 
12 down the inverse of ; 3 
= -1 
5 a= (5 ;) find 4 : , 0 
Solutions 
1 (8 -6 —2 1 
64\4 5 5 3 1 a=-—2,b=5 
1 3 2 2 
10 -1 0 ~ 10 10 _{ 14 LL es 
a (; 1 () ( 0 : Ol) a 4 S ap=(_4 3) 4B) at Vas ay? 
7 : pt=2(i ea =) 
=] 0 "Tf Is : 
(d) 1 1 (e) Noinverse (f) coswt sinwt 0 
“a 7 lot 7 —sinwt cosat 0 
— 4 6 0 0 1 
(g) (; 3) may \(% 3 
1 -1)’>\3 -3 
3 —132, —132 


b ae+bg af+bh 
ce+dg cf+dh)’ 
ad — bc, eh — fg, 


(ad — bc)(eh — fg) 
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DETERMINANTS 


Gy, Ayn 43 
IfA = | a5, Gy) dy, J, the value of its determinant, |A], is given by 
G3; Az. a3 
bbe @ Ga. a, a 
|A| =a,, (02 93) —a@,, | ® ; 21 422 
Az A133 a3; A, a3; Az. 


If we choose an element of A, a; j Say, and cross out its row and column and form the 
determinant of the four remaining elements, this determinant is known as the minor of 
the element a; . 


A moment’s study will therefore reveal that the determinant of A is given by 
|A| = (a,, X its minor) — (a,, x its minor) + (a,3 x its minor) 


This method of evaluating a determinant is known as expansion along the first row. 


Example 8.25 Find the determinant of the matrix 


121 
A= |-134 
5 12 


Solution The determinant of A, written as 


al =f af-215 otal 5 a 
= 12)=3-227 4 (E16) 
= 2+44-16 
= 30 


Example 8.26 Find the minors of the elements | and 4 in the matrix 


72.3 
B= 1|1 03 
042 
Solution To find the minor of | delete its row and column to form the determinant : ah The 
required minor is therefore 4 — 12 = —8. 
7 3 


Similarly, the minor of 4 is 13 


[=21-3=18 


Example 8.27 


Solution 


8.7.1 
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In addition to finding the minor of each element in a matrix, it is often useful to find a 
related quantity — the cofactor of each element. The cofactor is found by imposing on 
the minor a positive or negative sign depending upon its position, that is a place sign, 
according to the following rule: 
an = =e 
= + — 
a Err 


If 
3 27 
A=/9 10 
3 -1 2 
find the cofactors of 9 and 7. 


The minor of 9 is = 4— (—7) = 11, but since its place sign is negative, the 


27 
-12 
required cofactor is —11. 


The minor of 7 is = —9 —3 = —12. Its place sign is positive, so that the 


5 = 
required cofactor is simply —12. 


Using determinants to find vector products 
Determinants can also be used to evaluate the vector product of two vectors. If a = 
a,i+a,j+a,k and b = b,i+ b,j + b,k, we showed in Section 7.6 that a x b is the 
vector defined by 

a x b = (a,b, — a,b,)i+ (a,b, — a,b,)j + (a,b, — a,b, )k 


If we consider the expansion of the determinant given by 


ij k 
a; a, a, 
b, by by 


we find the same result. This definition is therefore a convenient mechanism for evalu- 
ating a vector product. 


If a = a,i+ a,j + a,k and b = b,i+ b,j + bjk, then 
Dei ok 
axb=|a, a, a, 
b, by b, 
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Example 8.28 Ifa=3i+j-— 2k and b = 4i + 5k finda x b. 


Solution 


8.7.2 


Example 8.29 


Solution 
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We have 
i 
axb = j3 
4 


Si — 23j — 4k 


Cramer's rule 


A useful application of determinants is to the solution of simultaneous equations. Con- 


sider the case of three simultaneous equations in three unknowns: 


ay X + apy +ayz = d, 
Ay)X + Ayy + Ay,zZ = by 


3X + Ayy + dy3z = b, 


Cramer’s rule states that x, y and z are given by the following ratios of determinants. 


Cramer’s rule: 
by ay 
by ay 
bs a3 


ay 
a3 
a33 
a13 


— 

op) 
Ay 
Ey) 


Ay, 
a33 


Note that in all cases the determinant in the denominator is identical and its elements 
are the coefficients on the I.h.s. of the simultaneous equations. When this determinant is 
zero, Cramer’s method will clearly fail. 


Solve 
3x+2y-—z=4 
2x—y+2z = 10 
x—3y—4z =5 
We find 
4 2-1 
10 -1 2 
5 -—3 —4 165 
x= 4 =3 
3 2-1 55 
2-1 2 
1 -—3 —4 


Verify for yourself that y = —2 and z= 1. 


a3 ay 

a3 a) 

Az a3; 
a — 

a3 ay, 

Ay3 Ay, 

a33 a3) 


ay d, 
Ay by 
ax, b; 
Qn %3 
Ay) 23 
G39, A33 
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EXERCISES 8.7 
ag l(t 3 4 67 2 (d) 3x+2y=1 
1 Fin | ,|2 1 Ofand| 1 4 3 xt+y-z=1 
2 8 
3. =1 -1 14 2x+3z=-1 
a) Fin | Cos wt sin ot} 6 Given 
—sin@t cos at 
Sale “6 
5 00 900 A={-—2 1 0 
3 Evaluate |6 3 2] and|0 7 0 42 -5 
457 00 8 (a) find |A| 
2-17 (b) find the cofactors of the elements of row 2, that is 
4&4 IfA=|0_ 8 4}, find JA| and |A7|. —2,1,0 
3 64 


(c) calculate 
Comment upon your result. 


—2 x (cofactor of —2) 


5 Use Cramer’s rule to solve +1 x (cofactor of 1) 
(a) 2x-3y+z=0 +0 x (cofactor of 0). 
5x+4y+z= 10 
| What do you deduce? 
(b) 3xty=-l 7 Wfa=7i+ 11j — 2k and b = 6i — 3j + k finda x b. 


2x-—y+z=-l 8 
5x + 5y —7z = —16 
(c) 4x+y+z= 13 


Find a x b when 


(a) a=3i—j,b=i+j+k 
(b) a=2i+j+k,b=7k 


2x-y=4 
xty-z=-3 (c) a= —7j —k,b = —3i1+-j 
Solutions 
1 20, 33, 39 (c) x=2,y=0,2=5 
(d) x=l,y=-l,z=-1 
2 1 
6 (a) —133 = (b) 47, -39,22. (ce) —133 
3 55,504 
7 5i— 19j — 87k 
4 —-164, —164 Note |A| = |A"| 
8 (a) -i—3j+4k 
m @) x=y=z=1 (b) 7i— 14j 
(b) x=—l,y=2,z=3 (c) i+3j—21k 


THE INVERSE OF A 3 x 3 MATRIX 


Given a 3x3 matrix, A, its inverse is found as follows: 


(1) Find the transpose of A, by interchanging the rows and columns of A. 


(2) Replace each element of A” by its cofactor; by its minor together with its associated 
place sign. The resulting matrix is known as the adjoint of A, denoted adj(A). 
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(3) Finally, the inverse of A is given by 


Av = adj (A) 
|A| 


Example 8.30 Find the inverse of 


Solution 


Replacing each element of A’ by its cofactor, we find 


7 @ =i 
adj(A) = [-14 3. —5 
7h 4 


The determinant of A is given by 


15 3 5 3 1 
a= ah [|i 3+o| 
= (1)(-7) + (2)14) 
aA | 
Therefore, 
: —7 6 —10 
eee oe 
|A| 21 70 7 


Note that this solution should be checked by forming AA™! to give /. 


It is clear that should |A| = 0 then no inverse will exist since then the quantity 1/|A| is 
undefined. Recall that such a matrix is said to be singular. 


For any square matrix A, the following statements are equivalent: 
|A| = 0 
A is singular 


A has no inverse 


EXERCISES 8.8 


1. Find adj(A), |A| and, if it exists, Aa! if 


2-3 1 
(ay A=[5 4 1 
aoe 2) 4P z 
3 1 0 _ o 
(b) A=|2-1 1 
5 5 -7 Deduce P~!. 
Solutions 
1 (a) |A|=—43 (A= 0 


adj(A) 


AS 
(b) |A] = 25 


adj(A) 


AH! 


Example 8.31 


Solution 


8.9 Application to the solution of simultaneous equations 


tf 4 
(c) A=|5 —2 9 
a 


|P| = 230 


adj(P) 


adj(A) = ( 


A7! does not exist. 
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5 
10 —3 }, find adj(P) and |P}. 
= 3 


—16 7-1 
32 -14 2 


71 352: 55 
52 64 50 
55 50 75 


71 32. 55 
52 64 50 
55 50 75 


230 


APPLICATION TO THE SOLUTION OF SIMULTANEOUS 


EQUATIONS 


The matrix techniques we have developed allow the solution of simultaneous equations 


to be found in a systematic way. 


Use a matrix method to solve the simultaneous equations 


2x+4y = 14 
x—3y=-8 


(8.6) 


We first note that the system of equations can be written in matrix form as follows: 


(i 3) ()-) 


(8.7) 
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To understand this expression it is necessary that matrix multiplication has been fully 
mastered, for, by multiplying out the l.h.s., we find 


(i 3) 6) = (23) 


and the form (8.7) follows immediately. 
We can write Equation (8.7) as 


AX =B (8.8) 


where A is the matrix (; 2) , X is the matrix (;) and B is the matrix 3). 


In order to find X = (; 


) it is now necessary to make X the subject of the equation 


AX = B. We can premultiply Equation (8.8) by A~', the inverse of A, provided such an 
inverse exists, to give 


A 'AX =A7'B 


Then, noting that A~'!A = J, we find 


IX =A7'B 
that is 
X=A'B 


using the properties of the identity matrix. We have now made X the subject of the equa- 
tion as required and we see that to find X we must premultiply the r.h.s. of Equation (8.8) 
by the inverse of A. 

In this case 


2 i (2 
AT = (Fi ) 
_ (3/10 2/5 
= (ano -1/5 
ip _ (3/10 2/5 (14 
cay 4s) (‘4) 
fi 
= Ae 


that is, X = (;) = (3) so that x = | and y = 3 is the required solution. 


and 


If AX = B then X = A7'B provided A“! exists. 


This technique can be applied to three equations in three unknowns in an analogous way. 
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Example 8.32 Express the following equations in the form AX = B and hence solve them: 


3x+2y-z=4 
2x —y+2z = 10 
x—3y—-4z =5 


Solution Using the rules of matrix multiplication, we find 


3 2=1\ fx 4 
2-1 2}/]y|=]10 
19 4) Ve 5 


which is in the form AX = B. The matrix A is called the coefficient matrix and is simply 
the coefficients of x, y and z in the equations. As before, 


AX =B 
A 'AX = A7!B 
IX =X=A™'B 


We must therefore find the inverse of A in order to solve the equations. 
To invert A we use the adjoint. If 


3 2-1 
A= |2-1 2 
1 -—3 -4 
then 
3 2 1 
A= 2-1 -3 
—-1 2-4 


and you should verify that adj(A) is given by 


10 11 3 
adj(A) = | 10 —-11 —8 
==. it =7 


The determinant of A is found by expanding along the first row: 


—-1 2 2 2 2-1 
t= [3 )-2ft alt] 3 
= (3)(10) — (2)(—10) — (1)(-5) 
= 304+ 20+5 
= 55 
Therefore, 
: 10 Il 3 
14) || io -1 -8 
|A| 55 
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Finally, the solution X is given by 


x 10 11 3 4 
X= /y)] =A 'B=—] 10 -11 -8]] 10 
z hs. WN AT NS 
3 
= [| -2 
1 
that is, the solution is x = 3, y= —2 andz= 1. 
EXERCISES 8.9 
1 By expressing the following equations in matrix form (b) G ) (; = (i") 
and finding an inverse matrix, solve 1 3} \y 1 
2 -1)\ (x —4 
4x —2y = 14 = 
oats © 3 3)Q)-) 
2x+ y=5 
ek (d) {2 -1 4] | y}] =] 20 
el ia a lS) \g 20 


(c) 8x+3y=59 


4 13 
—2x+ y=—13 (c) ( 4 ‘ 
2 Solve the following equations AX = B by finding 09 15 
A7!, if it exists. ( ) 


» (5 2)G)=G) : 


Solutions 

1 (@) x=3,y=-1 2 (a) x=ly=2 (b) x=2,y=3 
(b) x=—2,y=-2 (c) x=—l,y=2 (d) x=2,y=0,z2=4 
() x=7,y=1 HQtahy=te=s Uta ea0 


| 8.10 | GAUSSIAN ELIMINATION 


An alternative technique for the solution of simultaneous equations is that of Gaussian 
elimination which we introduce by means of the following trivial example. 


Example 8.33 Use Gaussian elimination to solve 


2x+3y=1 
x+y=3 


Solution 
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First consider the equations with a step pattern imposed as follows: 


lax + 3y = 1 (1) 
xi+y =3 (2) 


Our aim will be to perform various operations on these equations to remove or eliminate 
all the values underneath the step. You will probably remember from your early work 
on simultaneous equations that in order to eliminate a variable from an equation, that 
equation can be multiplied by any suitable number and then added to or subtracted from 
another equation. In this example we can eliminate the x term from below the step by 
multiplying the second equation by 2 and subtracting the first equation. Since the first 
equation is entirely above the step we shall leave it as it stands. This whole process will 
be written as follows: 


R, [2x + 3y = 1 (8.9) 
R, > 2R,—R, Ox|- y =5 ; 


where the symbol R, means that Equation (1) is unaltered, and R, — 2R, — R, means 
that Equation (2) has been replaced by 2 x Equation (2) — Equation (1). All this may 
seem to be overcomplicating a simple problem but a moment’s study of Equation (8.9) 
will reveal why this ‘stepped’ form is useful. Because the value under the step is zero 
we can read off y from the last line, that is —y = 5, so that 


y=) 

Knowing y we can then move up to the first equation and substitute for y to find x. 
2x + 3(-5) =1 
x= 8 


This last stage is known as back substitution. 


Before we consider another example, let us note some important points: 
(1) It is necessary to write down the operations used as indicated previously. This aids 
checking and provides a record of the working used. 


(2) The operations allowed to eliminate unwanted variables are: 


(a) any equation can be multiplied by any non-zero constant; 

(b) any equation can be added to or subtracted from any other equation; 

(c) equations can be interchanged. 
It is often convenient to use matrices to carry out this method, in which case the opera- 
tions allowed are referred to as row operations. The advantage of using matrices is that 


it is unnecessary to write down x, y (and later z) each time. To do this, we first form the 
augmented matrix: 


231 
113 


so called because the coefficient matrix (; ') is augmented by the r.h.s. matrix CG) 


It is to be understood that this notation means 2x+3y = 1, and so on, so that we no longer 
write down x and y. Each row of the augmented matrix corresponds to one equation. 
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Example 8.34 


Solution 


Example 8.35 


Solution 


The aim, as before, is to carry out row operations on the stepped form 
231 
1}1 3 
in order to obtain values of zero under the step. Clearly to achieve the required form, the 
row operations we performed earlier are required, that is 


R, G iS, 


R, > 2R,—R, 0|=1 5 


The last line means Ox — ly = 5, that is y = —5, and finally back substitution yields x, 
as before. 

This technique has other advantages in that it allows us to observe other forms of 
behaviour. We shall see that some equations have a unique solution, some have no solu- 
tions, while others have an infinite number. 


Use Gaussian elimination to solve 


2x+3y=4 
4x+6y =7 


In augmented matrix form we have 


(67) 


We proceed to eliminate entries under the step: 


R, (G0 . 


R, > R, —2R, 0/0 -1 


Study of the last line seems to imply that 0x + Oy = —1, which is clearly nonsense. 
When this happens the equations have no solutions and we say that the simultaneous 
equations are inconsistent. 


Use Gaussian elimination to solve 


x+y=0 
2x + 2y =0 


In augmented matrix form we have 
110 
2|2 0 


Eliminating entries under the step we find 


R, }1 10 

R, > R, —2R, 0/0 0 
This last line implies that 0x + Oy = 0. This is not an inconsistency, but we are now 
observing a third type of behaviour. Whenever this happens we need to introduce what 


are called free variables. The first row starts off with a non-zero x. There is now no 
row which starts off with a non-zero y. We therefore say y is free and choose it to be 


Example 8.36 


Solution 
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anything we please, that is 
y= 4 is our free choice 
Then back substitution occurs as before: 
x+A=0 
x=-ik 


The solution is therefore x = —A, y = A, where A is any number. There are thus an 
infinite number of solutions, for example 


or 


and so on. 


Observation of the coefficient matrices in the last two examples shows that they have a 
determinant of zero. Whenever this happens we shall find the equations either are incon- 
sistent or have an infinite number of solutions. We shall next consider the generalization 
of this method to three equations in three unknowns. 


Solve by Gaussian elimination 
x —4y-—2z = 21 
2x+y+2z = 3 
3x+2y-—z = -—2 


We first form the augmented matrix and add the stepped pattern as indicated: 


[1-4 -2 21 
a| to 3 
3° 2|-1 -2 


The aim is to eliminate all numbers underneath the steps by carrying out appropriate 
row operations. This should be carried out by eliminating unwanted numbers in the first 
column first. We find 


R, [1-4-2 21 
R, > R, —2R, 0] 9 6 —39 
RR, = 3B, 0 14] 5 -65 


We have combined the elimination of unwanted numbers in the first column into one 
stage. We now remove unwanted numbers in the second column: 


R, jl—4 =—2 21 

R, 0) 9 6 39 

14 B 13 

ae ee a . t= 
se ae 


14 
and the elimination is complete. Although R, > R, + a would eliminate the 14, it 


would reintroduce a non-zero term into the first column. It is therefore essential to use 
the second row and not the first to eliminate this element. We can now read off z since 


290 Chapter 8 Matrix algebra 


13 13 
the last equation states 0x + Oy — 3° aes. that is z = 1. Back substitution of z = 1 


in the second equation gives y = —5 and, finally, substitution of z and y into the first 
equation gives x = 3. 


Example 8.37 Solve the following equations by Gaussian elimination: 


eH ytz=3 
x+5y—5z = 2 
xwx+y-z=1 


Solution Forming the augmented matrix, we find 


1-1 13 
Ls 3 3.2 
2 I1|-1 1 


Then, as before, we aim to eliminate all non-zero entries under the step. Starting with 
those in the first column, we find 


R, 1-1 1 3 

R, > R, —-R, 0; 6 -6 —-l 

R, > R, —2R, 0 3)/-3 —5 
Then, 

R, 1-1 1 3 

R, 0; 6 -6 -1 

R, > 2R,—R, 0 OO; O -9 


This last line implies that 0x-+ Oy+0z = —9, which is clearly inconsistent. We conclude 
that there are no solutions. 


You will see from Examples 8.36 and 8.37 that not only have all entries under the step 
been reduced to zero, but also each successive row contains more leading zeros than the 
previous one. We say the system has been reduced to echelon form. More generally the 
system has been reduced to echelon form if for i < j the number of leading zeros in row 
J is larger than the number in row i. Consider Example 8.38. 


Example 8.38 Solve the following equations by Gaussian elimination: 


2x-y+z=2 
—2x+y+z=4 
6x — 3y —2z = —9 


Solution Forming the augmented matrix, we have 


22>) L£ 2 
—2; 1 I 4 
6 —3)/-—2 —9 
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Eliminating the unwanted values in the first column, we find 


R, |2 -1 1 2 
R, > R,+R, 0; 0 2 6 
R, > R, —3R, 0 O;-—5 —15 


Entries under the step are now zero. To reduce the matrix to echelon form we must ensure 
each successive row has more leading zeros than the row before. We continue: 


R, |2 -112 
R, 0) 026 
R= 2h, +Sk, 0 0|0 0 


which is now in echelon form. 

In this form there is a row which starts off with a non-zero x value, that is the first 
row, there is a row which starts off with a non-zero z value, but no row which starts off 
with a non-zero y value. Therefore, we choose y to be the free variable, y = 4 say. From 
the second row we have z = 3 and from the first 2x — y+ z = 2, so that 2x = 4 — 1, that 
isx = (A—1)/2. 


Engineering application 8.2 


The Vandermonde Matrix 


Many data storage devices, for example the Blu-ray disc”, and data transmission 
standards, for example WiMAX®, have built-in techniques to reduce the effects of 
errors which occur during normal use. These errors originate from noise and inter- 
ference and they can result in the loss of data. One such technique is the use of an 
error-correcting code to provide a measure of protection against data errors. Error- 
correcting codes are used to encode the data when it is stored or transmitted. The 
data is then decoded when it is read or received. In the case of a transmission line, 
the error-correcting code makes it possible to detect errors in the received signal and 
to make corrections, so that the errors are not subsequently retransmitted. 

One important class of error-correcting codes are based on Reed-Solomon codes, 
details of which are beyond the scope of this text. However an important mathemat- 
ical concept used in Reed-Solomon codes is that of a Vandermonde matrix. 

The Vandermonde matrix can be illustrated by considering the problem of rep- 
resenting a signal by a polynomial. For example, to approximate a signal f(t) by a 
second-degree polynomial we write 


f@®) Ya tatt+a? 


where dy, a, and a, are coefficients of the polynomial which must be found. These 
are found by forcing the original signal f(t) and its polynomial approximation to 
agree at three different values of t, say fj, t, and t,. This gives rise to the following 
system of equations: 


2 

1 ty 1 dy f(t) 
1a] la)=|f@) 
1 t L ay f(b) 
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8.10.1 


Example 8.39 


Solution 


The coefficient matrix is known as a Vandermonde matrix. Suppose we wish to find 
TU 

a second-degree polynomial approximation to the signal f(t) = cos — for values of 

t between —1 and 1. We can do this by making the approximating polynomial and 


the original signal equal at three points, say t = —1, t = 0 andt = 1. The equations 
to be solved are then 

1-11 Ay fC1) 0 

LOR ON Sara — sie (O) eal 

itil ay fd) 0 
It is straightforward to solve this system of equations by Gaussian elimination and 
obtain a) =1, a, =0 and a, = —1. Therefore the second-degree polynomial which 


F Tt 2 
approximates f(t) = cos = is 1 —ft*. 


Finding the inverse matrix using row operations 


A similar technique can be used to find the inverse of a square matrix A where this exists. 
Suppose we are given the matrix A and wish to find its inverse B. Then we know 


AB=I 
that is, 


4, Ay. 443 by Dy Dy 
b 


oreo 
- OO 


1 
44, Ag. y3 by, by by, | = | 0 
43, 37 433 b3, bs) bs; 0 
We form the augmented matrix 

a, A a4, 100 

Ay, Any Ay, O10 

dy, Ax a3, 0 0 1 


Now carry out row operations on this matrix in such a way that the Lh.s. is reduced to a 
3 x 3 identity matrix. The matrix which then remains on the r.h.s. is the required inverse. 


Find the inverse of 


af B28 
A=|-6 49 -10 
a a 


by row reduction to the identity. 


We form the augmented matrix 


-l 8 —2 100 
—6 49 -10 010 
—4 34 -5 001 


We now carry out row operations on the whole matrix to reduce the |.h.s. to an identity 
matrix. This means we must eliminate all the elements off the diagonal. Work through 
the following calculation yourself: 


1 
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R, Aha? ioe 
R, +R, —6R,| 01 2-610 
RR —4R, 02 3 4-01 


This has removed all the off-diagonal entries in column 1. To remove those in column 2: 


R, > R, — 8R, 10-18 49 -8 0 
R, 0 1 2-6 10 
R, > R, — 2R, 00 -1l 8 —2 1 
To remove those in column 3: 
R, > R, — 18R, -10 0 —95 28 -—18 
R, > R, +2R, 01 0 10 -3 2 
R, 00 -1 8 —2 1 


We must now adjust the ‘—1’ entries to obtain the identity matrix: 


Reo, (Oo 65 38. 18 
R O10 16 —3 3 
RRA 0 OW 8 2 at 


Finally, the required inverse is the matrix remaining on the r.h.s.: 


95 —28 18 
10 -3 2 
—8 2-1 


You should check this result by evaluating AA~!. 


EXERCISES 8.10 
Solve the following equations by Gaussian 2 Use Gaussian elimination to solve 
elimination: 
x+y+zZ=7 
(a) 2x —3y = 32 3 9 
3x+7y = —21 Pee 
ax+y-z=1 


(b) 


(c 


Nar 


(d) 


(e) 


2x+y—3z=-5 


x-y+22= 12 3 Find the inverses of the following matrices using the 
71x — 2y + 3z = 37 technique of Example 8.39: 

x+y=—z=1 

3x -—y+5z=3 (a) € 

Tx + 2y +32 =7 a2 

Wx+y—-z7=-9 42 

3x —2y4+4z=5 (b) ( 0 3 

2x —y+72= 33 aun 


4x + Ty + 8z =2 
5x+ 8y+ 13z =0 
3x+5y+7z=1 


fo) 

~— 

| 

aNe 

Nr Oo 

Re MN WwW wee 
——" 
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Solutions 


1, (a) x=7,y=-6 


if 21 
a ee ‘) 


(b) x=3,y=-5,z=2 


,{5-5 5 
(c) x=l—p,y=2u,z=p (bt) =] -4 13 —16 
(d) x=-3,y=1,2=4 


(e) Inconsistent 


me x=2,y=1,z=4 


8.11.1 


S\ 3 6 12 


, {79 6-3 
@ — 1-37 2 4 
aN 4 ef 


EIGENVALUES AND EIGENVECTORS 


We are now in a position to examine the meaning and calculation of eigenvalues and 
their corresponding eigenvectors. 


Solution to systems of linear homogeneous equations 
Recall that an equation is linear when the variables occur only to the first power. For 
example, 
2x+3y = 1 (1) 
is a linear equation but 
2? +3y=1 (2) 


is a non-linear equation due to the term 2x”. 
Equation (1) is called inhomogeneous. When the r.h.s. of a linear equation is 0 then 
the equation is homogeneous. For example, 


2x + 3y =0 and 7x —3y=0 


are both homogeneous. This section looks at the solution of systems of linear homoge- 
neous equations. 
Consider the simultaneous linear homogeneous equations 

ax + by = 0 

cx+dy = 0 
where a, b, c and d are constants. Clearly x = 0, y = 0 is a solution. It is called the 
trivial solution. Non-trivial solutions are solutions other than x = 0, y = 0. We now 
study the system to find conditions on a, b, c and d under which non-trivial solutions 


exist. 
For definiteness we consider two cases with values of a, b, c and d given. 


Case 1 


3x —5y = 0 
6x —7y = 0 
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Solving Case 1, for example by Gaussian elimination, leads to x = 0, y = 0 as the only 
possible solution. Thus, the only solution is the trivial solution. 


Case 2 


xt+ty=0 
2x+2y = 0 


This system was solved in Example 8.35 using Gaussian elimination to yield 


where A is any number and y is a free variable. Thus there are an infinite number of 
solutions. Note that in this system the second equation, 2x + 2y = 0, is a multiple of the 
first equation, x + y = 0. The second equation is twice the first equation. 

We now return to the system 


ax + by = 0 
cx+dy = 0 


As seen, depending upon the values of a, b, c and d the system has either only the trivial 
solution or an infinite number of non-trivial solutions. For there to be non-trivial solu- 
tions the second equation must be a multiple of the first. When this is the case, then c is 
a multiple of a and d is the same multiple of 5, that is 


c= aa, d=ab for some value of a 
In this case, consider the quantity ad — bc: 


ad — bc = a(ab) — b(aa) 
aab — aab 


= 0 


Hence the condition for non-trivial solutions to exist is that ad — bc = 0. Writing the 
system in matrix form gives 


(eG) =(0) 


AX =0 


or 


where 


HC =O = 0) 


We note that ad — bc is the determinant of A, so non-trivial solutions exist when the 
determinant of A is zero; that is, when A is a singular matrix. 
In summary: 
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Consider the system 


AX =0 


If |A| = 0, the system has non-trivial solutions. 
If |A| 4 0, the system has only the trivial solution. 


Example 8.40 Decide which of the following systems of equations has non-trivial solutions: 


(a) 


(b) 


Solution (a) 


(b 


wm 


Example 8.41 
(a) 


(b) 


3x + 7y =0 

2x-—y=0 

2x+y=0 

6x+3y=0 

We write the system as 
(> -1)G)=() 
2-1) \y/~ \O 


Let 


Then 
|A| = 3(—1) —2(7) = -17 


Since the determinant of A is non-zero, the system has only the trivial solution. 


We write the system as 


(5 3)()=(¢) 
s=(;) 


Let 


from which |A| = 2(3) — 1(6) = 0. Since the determinant of A is 0, the system has 


non-trivial solutions. 


Determine which of the following systems of equations has non-trivial solutions: 


2x +y—3z=0 
x—3y+2z=0 
5x — 8y + 3z=0 
2x +y—3z=0 
x—3y+2z=0 


5x — 7y+3z=0 


1 


2 
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Solution (a) We have 
AX =0 
where 
1 —3 x 
A=]{1-3 2 X=|y 
-8 3 z 
Evaluation of |A| shows that |A| = 0 and so the system has non-trivial solutions. 
(b) Here 
2 1 -3 
A=|1-3 2 
5 -—7 3 
from which |A| = —7. Since |A| 4 0 the system has only the trivial solution. 
EXERCISES 8.11.1 
Explain what is meant by the trivial solution of a 3 Determine which of the following systems have 
system of linear equations and what is meant by a non-trivial solutions: 
non-trivial solution. 
2y— z=0 
Determine which of the following systems has ie aii 
non-trivial solutions: se+ yt2e=O 
(a) x-2y=0 age =0 
3x — 6y = 0 (b) 2x—3y—2z=0 
(b) 3x t+ y=0 a a a 
9x + 2y=0 x—Ty— z=0 
(c) 4x-—3y=0 (c) x+2y+3z=0 
—4x+ 3y =0 4x —3y-— z=0 
(d) 6x-—2y=0 6x+ y+3z=0 
dx - Fy =0 (d) x+ 3z=0 
(e) y= 2x r= ¥ =0 
x = 3y y+2z=0 
Solutions 
(a), (c) and (d) have non-trivial solutions. | 3 (a) and (b) have non-trivial solutions. 


8.11.2 


Eigenvalues 


We will explain the meaning of the term eigenvalue by means of an example. Consider 


the system 


2x+y =Ax 
3x + 4y = dy 
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where 1 is some unknown constant. Clearly these equations have the trivial solution 
x = 0, y = 0. The equations may be written in matrix form as 


21 x x 
(3.4) ()=#() 
or, using the usual notation, 
AX =iX 


We now seek values of A so that the system has non-trivial solutions. Although it is 
tempting to write (A — 4)X = 0 this would be incorrect since A — A is not defined: A 
is a matrix and A is constant. Hence to progress we need to write the r.h.s. in a slightly 


different way. To help us do this we use the 2 x 2 identity matrix, 7. Now A ') may be 


expressed as 


(5.1) (8 


since multiplying (;) by the identity matrix leaves it unaltered. So AX may be written 


as AIX. Hence we have 
AX = 2IX 
which can be written as 


AX — AIX =0 
(A—AaADX = 0 


Note that the expression (A — A/) is defined since both A and AJ are square matrices 
of the same size. 

We have seen in Section 8.11.1 that for AX = 0 to have non-trivial solutions requires 
|A| = 0. Hence for 


(A —ANX =0 
to have non-trivial solutions requires 


|A—Al| =0 


a= G)-269 


Now 


1 
0 
fz. fat 
~\3 4) loa 
(t= at 
~\ 3 4=) 


So the condition |A — AI| = 0 gives 


2-ir 1 
3 4-2 


It follows that 
(2—A)(4-A)-3 =0 
W-644+5=0 
(A—1)(A—5) =0 
so that 


A=1 or 5 
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These are the values of A which cause the system AX = AX to have non-trivial solutions. 


They are called eigenvalues. 
The equation 


|A—Al| =0 


which when written out explicitly is the quadratic equation in A, is called the charac- 


teristic equation. 


x+4y = Ax 
2x + 3y = hy 


has non-trivial solutions. 


Solution We write the system as 
AX = 1X 


where 


(9). 26 


To have non-trivial solutions we require 
|A —Al| = 0 


Now 


> 
| 
~ 
N 
ll 
a. 
— 
aN 


Il II 
i 
N | 
1oS) 
| = 
» 

Ser 


Hence 
|A—AZ| = (1-—A)G-A)-8 
= 47-4. -5 
To have non-trivial solutions we require 
W—-4,4-5 =0 
(A+1)A—5) =0 


Example 8.42 Find values of 4 for which 
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which yields 
A=-1 or 5 


The given system has non-trivial solutions when A = —1 and A = 5. These are the 
eigenvalues. 


If A is a 2 x 2 matrix, the characteristic equation will be a polynomial of degree 2, 
that is a quadratic equation in A, leading to two eigenvalues. If A is a3 x 3 matrix, the 
characteristic equation will be a polynomial of degree 3, that is a cubic, leading to three 
eigenvalues. In general ann x n matrix gives rise to a characteristic equation of degree 
n and hence to n eigenvalues. 


The characteristic equation of a square matrix A is given by 
|A —Al| = 0 


Solutions of this equation are the eigenvalues of A. These are the values of 4 for 
which AX = AX has non-trivial solutions. 


Example 8.43 Determine the characteristic equation and eigenvalues, A, in the system 
SN EX x 
(1 5)Q)=G) 


Solution In this example the equations have been written in matrix form with A = (; ) The 


characteristic equation is given by 


|A —Al| = 0 
31 10 
(13) i) =° 
Ga3: -i 
ta ea? 


G=1G=a)41=0 
7 —81+16 =0 


The characteristic equation is 47 — 8A + 16 = 0. Solving the characteristic equation 


gives 
7 — 81 +16 = 0 
(A-—4)A-—4) = 0 
dX = 4 (twice) 


There is one repeated eigenvalue, 7 = 4. 
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Example 8.44 Find the eigenvalues A in the system 
4 1)\ (x x 
(2) ()-"6) 


Solution We form the characteristic equation, |A — A/| = 0. Now 


4—-—xX il 
are 


Then 
|A —Al| = (4—A)(2—A) —3 =A?7- 6045 
Solving the characteristic equation, A7 — 6 + 5 = 0, gives 
A=1 or 3 


There are two eigenvalues, A = 1,4 =5. 
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The process of finding the characteristic equation and eigenvalues of a matrix has 
been illustrated using 2 x 2 matrices. This same process can be applied to a square 


matrix of any size. 


Example 8.45 Find (a) the characteristic equation (b) the eigenvalues of A where 


1 2 O 
A=]-l1-1 1 
3 2 2 


Solution (a) We need to calculate |A — AJ|. Now 


1-A 2 0 
A-AI={-1 -1-<A 1 
3 2 —2—-A 
and 
1-A 2 0 
—-1-A 1 —1 1 
-1 -1l-A 1 = 1-2) > 32s rs 


= (1-—A)[(-1 —A)(—2 — A) — 2] 
—2[—1(—2 — 4) — 3] 
Upon simplification this reduces to —4? — 24? + 4 + 2. Hence 
|A—Al| =0 
yields 
— -— 207 4+442=0 
which may be written as 
4217 -2-2=0 


The characteristic equation is 4° + 247-2 —2=0. 
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(b) The characteristic equation is solved to yield the eigenvalues: 


4207 -2-2=0 


Factorizing yields 


A+2)A+1I)A-1)=0 


from which 7 = —2, —1, 1. 
The eigenvalues are A = —2, 


EXERCISES 8.11.2 


Calculate (i) the characteristic equation (ii) the 
eigenvalues of the system AX = XX, 
where A is given by 


5 6 3 4 
(a) € ) (b) e :) 
fo 1 3 
o (4) © (4) 


Calculate (i) the characteristic equation (ii) the 
eigenvalues of the following 3 x 3 matrices: 
1-1 2 
(a) | -3 -—2 3 
2-11 


Solutions 


(a) (i) A27-6A-—7=0 
(ii) A= —1,7 

(b) Gi) a2 —22+1=0 
(ii) A = 1 (twice) 

(c) @) A27—114+30=0 
(ii) A=5,6 

(d) @) A27-13=0 
(ii) A = —V13, S13 


2 (a) Gi) -A3+714+6=0 


Technical Computing Exercises 8.11.2 


The calculation of eigenvalues and eigenvectors is 
usually performed by a built-in function. For example, 
in MATLAB® the function eig can used to calculate 
the eigenvalues. 


=1, 1. 


io -=1 
(b) [3 1 4 
02 2 
a an) 
(c) {-1 1 -1 
ee. 0 
ft 62 
(dd) | 0 34 
7 <3 5 
coe ae 
(e) | 2 -4 3 
16 -4 1 


(ii) A= —2, -1,3 
(b) G@) AX — 422-34 412 =0 
(ii) A= —V3, V3, 4 
(c) Gi) a3 —342-—101 +24 =0 


(ii) A = —3, 2,4 
(d) G@) A2— 622 4+52 =0 
(ii) A =0, 1,5 
(ec) @) a3—13A+12=0 
(ii) A= —4, 1,3 


To produce a solution to question 1. (a) in the previous 
exercise we would type: 


A= (56; 21]; 
eig(A) 


which produces: 


8.11.3 


Example 8.46 


Solution 
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which is a vector containing the eigenvalues. 


Use MATLAB® or a similar language to confirm the 
rest of the eigenvalues given in Exercises 8.11.2. 


Eigenvectors 
We have studied the system 
AX =AX 


and determined the values of 4 for which non-trivial solutions exist. These values of A are 
called eigenvalues of the system, or, more simply, eigenvalues of A. For each eigenvalue 
there is a non-trivial solution of the system. This solution is called an eigenvector. 


Find the eigenvectors of 
AX =AX 


where 


(2) m6) 


We seek solutions of AX = AX which may be written as 
(A —ANX =0 


The eigenvalues were found in Example 8.44 to be 1 = 1,5. 
Firstly we consider A = 1. The system equation becomes 


A311 = 
(A-D)X =0 


()-C10-0 
E00) 


Written as individual equations we have 


3x+y = 0 

3x+y =0 
Clearly there is only one equation which is repeated. As long as y = —3x the equation is 
satisfied. Thus there are an infinite number of solutions such as x = 1, y = —3;x = —5, 


y = 15; and so on. Generally we write 
x=t, y=-3t 


for any number f¢. Thus the eigenvector corresponding to A = | is 


OJ 
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Note that the eigenvector has been determined to within an arbitrary scalar, t. Thus there 
is an infinity of solutions corresponding to A = 1. 
We now consider 2 = 5 and seek solutions of the system equation: 


6) 910-0 
H N6)-(9 


Written as individual equations we have 
—x+y=0 
3x —3y = 0 
We note that the second equation is simply a multiple of the first so that in essence there 


is only one equation. Solving —x+y = 0 gives y = x for any x. So we writex =t,y =f. 


Hence the eigenvector corresponding to A = 5 is X =t¢ : . Again the eigenvector has 


been determined to within an arbitrary scaling constant. 
Sometimes the arbitrary scaling constants are not written down; it is understood that 
they are there. In such a case we say the eigenvectors of the system are 


<Q) =f) 


Determine the eigenvectors of 


(1 5)()=*6) 


In Example 8.43 we found that there is only one eigenvalue, A = 4. We seek the solution 
of (A —ANDX = 0. With A = 4 we have 


a= (4 5)-4(0 1) = (2 1) 
Hence (A — AJ)X = 0 is the same as 
(11) G)=() 
—1 1) \y 0 
Thus there is only one equation, namely 
—x+y=0 


which has an infinity of solutions: x = t, y = t. Hence there is one eigenvector: 


=() 


8.11 Eigenvalues and eigenvectors 


The concept of eigenvectors is easily extended to matrices of higher order. 


1 2 0O x x 
-1 -1 1 yJ=Aly 
3 2-2 Zz Zz 


The eigenvalues were found in Example 8.45. 
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Example 8.48 Determine the eigenvectors of 


Solution From Example 8.45 the eigenvalues are 4 = —2, —1, 1. We consider each eigenvalue in 


turn. 
A=-2 
1 2 0O x x 
-1 -1 1 y} =-2]y 
3 2-2 Zz Zz 
100 x 0 
—-1 -1 1)]42/010 y}] =] 0 
= 001 Zz 0 
3 2 0 x 0 
—-1 11 y| =] 0 
3 2 0 Z 0 
We note that the first and last rows are identical. So we have 
3x + 2y =0 
—x+y+z=0 


Solving these equations gives 


t _ >, 
Pe > =z, = — 
.. og 


Hence the corresponding eigenvector is 


1 
3 
X=r|] 2 
5 
2 
A=-1 
We have 
1 2 0O 100 x 0 
—-1 -1 1]4+1]7010 y}| = | 0 
2 —2 001 Zz 0 
22 0 x 0 
-10 1 y| =] 0 
32 -l z 0 
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Thus we have 


2x+ 2y =0 
—Xx + z=0 
3x+2y — z=0 


We note that the third equation can be derived from the first two equations, by subtracting 
the second equation from the first. If you cannot spot this the equations should be solved 
by Gaussian elimination. In effect we have only two equations: 


2x+2y =0 
—x +z =0 
Solving these gives x = t, y = —t, z = t. The eigenvector is 
X=t!|— 
A=1 
We have 
2 0 100 0 
-1 -1 I}]-|J010 y} = ]0 
2-2 001 Zz 0 
2 0 0 
-1 —2 1 = 10 
3: 23 Zz 0 
Thus we have 
2y — 
—x—2y+ z=0 


3x + 2y —3z=0 
From the first equation, y = 0; putting y = 0 into the other equations yields 


—x+z=0 
3x — 3z = 0 


Here the second equation can be derived from the first by multiplying the first by —3. 
Solving, we have x = t, z = t. So the eigenvector is 


1 
X=t]|0 
1 


EXERCISES 8.11.3 


1. Calculate the eigenvectors of the matrices given in 2 Calculate the eigenvectors of the matrices given in 
Question 1 of Exercises 8.11.2. Question 2 of Exercises 8.11.2. 


Solutions 


1 1 
e(eloye)(usiye) | 


1 1 
2 (ajt|5),¢[12],¢ 
1 5 


1 
(b)t | —5.0981 ],¢ 
2.7321 


8.12 Analysis of electrical networks 307 


Wile OlRm WIN Whee 


i —-D 


1 1 (e)t 6 
0.0981 },¢ | —3 2 
—0.7321 4 3 


ANALYSIS OF ELECTRICAL NETWORKS 


Matrix algebra is very useful for the analysis of certain types of electrical network. For 
such networks it is possible to produce a mathematical model consisting of simultaneous 
equations which can be solved using Gaussian elimination. We will consider the case 
when the network consists of resistors and voltage sources. The technique is similar for 
other types of network. 

In order to develop this approach, it is necessary to develop a systematic method 
for writing the circuit equations. The method adopted depends on what the unknown 
variables are. A common problem is that the voltage sources and the resistor values are 
known and it is desired to know the current values in each part of the network. This can 
be formulated as a matrix equation. Given 


V=RI' 

where 
V = voltage vector for the network 
I’ = current vector for the network 
R = matrix of resistor values 


the problem is to calculate /’ when V and R are known. /’ is used to avoid confusion with 
the identity matrix. 

Any size of electrical network can be analysed using this approach. We will limit the 
discussion to the case where /’ has three components, for simplicity. The extension to 
larger networks is straightforward. Consider the electrical network of Figure 8.2. Mesh 
currents have been drawn for each of the loops in the circuit. A mesh is defined as a loop 
that cannot contain a smaller closed current path. For convenience, each mesh current 
is drawn in a clockwise direction even though it may turn out to be in the opposite 
direction when the calculations have been performed. The net current in each branch of 
the circuit can be obtained by combining the mesh currents. These are termed the branch 
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Figure 8.2 
An electrical network with mesh currents 
shown. 


currents. The concept of a mesh current may appear slightly abstract but it does provide 
a convenient mechanism for analysing electrical networks. We will examine an approach 
that avoids the use of mesh currents later in this section. 

The next stage is to make use of Kirchhoff’s voltage law for each of the meshes in the 
network. This states that the algebraic sum of the voltages around any closed loop in an 
electrical network is zero. Therefore the sum of the voltage rises must equal the sum of 
voltage drops. When applying Kirchhoff’s voltage law it is important to use the correct 
sign for a voltage source depending on whether or not it is ‘aiding’ a mesh current. 

For mesh 1 

BE, = 1R,+1R,+ 1 -L)R,+ G —-1)R; 

E, = 1,(R, +R, + Ry + R3) + L(-R3) +3(-Ry) 
For mesh 2 

—E, — Ey = 1,Rs + Uh — 1) R3 + (hh — 1) Ro + HRs 

—E, — Ey = 1,(-R3) + (Rs + Rs + Re + Rg) + 13(—Ro) 
For mesh 3 

E, = U, —1,)R, + U, —1,)Ry + LR, 

Ex = I(-Ry) + L(-Re) + (Ro + Ry + Bz) 


These equations can be written in matrix form as 


E R, +R, +R, +R; -R, oR; I, 
—E,—E;} = ai Rs +R, +Re +Ry —R i 
E; -R, hte Ro t+ RyRy) \g 


Engineering application 8.3 


Calculation of mesh and branch currents in a network 


Consider the electrical network of Figure 8.3. It has the same structure as that of 
Figure 8.2 but with actual values for the voltage sources and resistors. Branch currents 
as well as mesh currents have been shown. Calculate the mesh currents and hence the 
branch currents for the network. 
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Figure 8.3 

The electrical network of Figure 8.2 
with values for the source voltages and 
resistors added. 


Solution 


We have already obtained the equations for this network. Substituting actual values 
for the resistors and voltage sources gives 


3 Sea ee 
S24 = |o3 14-2 Ee 
4 Sie 2 6) ar 


This is now in the form V = RI’. We shall solve these equations by Gaussian elimi- 
nation. Forming the augmented matrix, we have 


10 -3 -1 3 
—3 14 -2 -6 
=| =2 @ @ 
Then 
R, 10 -—3 -1l 3 
R, = 10K, + 3R, O 131 —23 —51 
R, > 10R,+R, QO -—23 59 43 
and 
R, 10 -3 -l 3 
I Q 13 =23) = oil 
R, — 131R, + 23R, 0 0 7200 4460 
Hence, 
4460 
Ih = =— S619 A 
7200 
Similarly, 
—51 + 23(0.619 
= eae ee age 
131 
Finally, 


7 = 340.619 + 3(-0.281) 


= 0.278 A 
i 10 
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The branch currents are then 
ih, S fh = 23 wv 
hs ih =i = =231l = 2s = —S9 TVA 
= 1h = 1h =O = 2S = syiil iaava\ 
ly = lh = 23il VA 
Sh =1h = —23l = O19 = CUO mv 
I, = L, =619mA 


An alternative approach to analysing an electrical network is to use the node voltage 
method which is often simply called nodal analysis. This technique is fundamental to 
many computer programs which are used to simulate electrical circuits, such as SPICE. 
We introduce this technique by means of an example. 


Engineering application 8.4 


Analysing an electrical network using the node voltage method 


The node voltage method utilises the notion that ‘islands’ of equal potential lie be- 
tween electrical components and sources. The procedure is as follows: 


(1) Pick a reference node. In order to simplify the equations this is usually chosen 
to be the node which is common to the largest number of voltage sources and/or 
the largest number of branches. 


(2 


wm 


Assign a node voltage variable to all of the other nodes. If two nodes are separated 
solely by a voltage source then only one of the nodes need be assigned a voltage 
variable. The node voltages are all measured with respect to the reference node. 


(3 


wm 


At each node, write Kirchhoff’s current law in terms of the node voltages. Note 
that once the node voltages have been calculated it is easy to obtain the branch 
currents. 


We will again examine the network of Figure 8.2, but this time use the node voltage 
method. The network is shown in Figure 8.4 with node voltages assigned and branch 
currents labelled. The reference node is indicated by using the earth symbol. Writing 
Kirchhoff’s current law for each node, we obtain: 


node a 
i, Sil 
Ue OF ei eae 
R, 7 R, 


Vio eh VR VS VaR 
V,(R, + Ry) — VR, — VaR; = E,R, 
node b 
hari srl =O 
Vcr lei Vin Vera 


R, R; Rs 
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Figure 8.4 
The network of Figure 8.2 with node 
voltages labelled. 


Rearrangement yields 


V,R3R; ar E\R3Rs a V,RRs a VR\Rs - VARS aly V,R\R; 
ae alka ite, — WIR Ike, 0 
that is, 
V,R3R, — V,(RR, oF RR; ar R3Rs) ar VR Rs + VR R; 
= ERR; + ERR, 


node c 
fh Ih, sell, 
Vie Gee 
Rie 6 oR) R, 
so that 


V,RyRe = VRARe = VR3Re a VyR3Re a VR3R, — ERR, 
that is, 

Vi Ryko — V(RyRe + R3Re + R3R,) oF ViR3 Re = —E3R,R, 
node d 

ap ll, = th 

aa hea = Va 

R, R, R, 
V,RAR, in ViR,R, AF VAR, a V,R,R, = ViR,Ry 
V,R,R, + V.RoR, — Va (R,R, s R,R, a R,R,) = (I) 


node e 
Hy = 1k, 
Via eee 


R, R, 
V,Ry — V.(Rs + Rg) = —E,Rg 
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These equations can be written in the matrix form AV = B, where A is the matrix 


R, +R, Ss 0 =R 0 
R,R, —R,R, — RR; — R3Rs R,Rs 0 R,R, 
0 RiRg =R,R, — RRs — R3R, R3Re 
R,R; 0 RyRy =R,R,—R,R,—RR, 0 
0 Rg 0 0 Shek, 
and 
We E,R, 
v, E,R,R; + E,R,R, 
Y= ||, and j= —E,R,R, 
Vi 0 
Ue —E,R 


The equations would generally be solved by Gaussian elimination to obtain the node 
voltages and hence the branch currents. 

Using the component values from Engineering application 8.3, these equations 
become 


Gr Aca a a ONY ib 
52 = 3 le lO OL Molly: 57 
pes Wee weer senate | fava (s [eeeeaie) 
Sie OMe 10m |hy 0 
Oe Se Wr Sayan = 


Use of a computer package avoids the tedious arithmetic associated with Gaussian 
elimination and yields 


= 2909) V, = 0.5250 V. = 2.200 V, = 1.858 4 = 1.122 
It is then straightforward to calculate the branch currents: 
W=W, Vane 
== * 2778 mA f, = = © = 558 mA 
R, R, 
WH, V. 
I, = <—* = 342 mA I, = — =281 mA 
R, Rg 
WIE Vv. 
i= —3=-900mA  £=~=619mA 
Rg R, 


Compare these answers with those of Engineering application 8.3. 

It is possible to analyse electrical networks containing more complex elements 
such as capacitors, inductors, active devices, etc., using the same approach. The equa- 
tions are more complicated but the technique is the same. Often it is necessary to use 
iterative techniques in view of the size and complexity of the problem. These are 
examined in the following section. 


eS) ITERATIVE TECHNIQUES FOR THE SOLUTION 
OF SIMULTANEOUS EQUATIONS 


The techniques met so far for the solution of simultaneous equations are known as di- 
rect methods, which generally lead to the solution after a finite number of stages in 
the calculation process have been carried out. An alternative collection of techniques is 


Example 8.49 


Solution 
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available and these are known as iterative methods. They generate a sequence of ap- 
proximate solutions which may converge to the required solution, and are particularly 
advantageous when large systems of equations are to be solved by computer. We shall 
study two such techniques here: Jacobi’s method and Gauss-Seidel iteration. 


Solve the equations 


a2x+y=4 
x—3y = —5 


using Jacobi’s iterative method. 


We first rewrite the equations as 


2x =-y+4 
—3y =-—x—5 
and then as 
x= -=yt+2 
1 ie (8.10) 
= -— xX = 
ag 


Jacobi’s method involves ‘guessing’ a solution and substituting the guess in the r.h.s. of 
the equations in (8.10). Suppose we guess x = 0, y = 0. Substitution then gives 


, a 


Wwlund 


y= 


We now use these values as estimates of the solution and resubstitute into the r.h.s. of 
Equation (8.10). This time we find 


1/5 
i = (3) +2 = 1.1667 (to four decimal places) 


1 5 
y= 3?) + ri = 2.3333 (to four decimal places) 
The whole process is repeated in the hope that each successive application or iteration 
will give an answer close to the required solution, that is successive iterates will con- 
verge. In order to keep track of the calculations, we label the initial guess x, y, the 
result of the first iteration x“, y and so on. Generally, we find 


1 
+h — __,™ 49 
x oe + 


yer) = 
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Solution 


Table 8.1 
Iterates produced by Jacobi’s method. 
Iteration no. (n) x) yo 
0 0 0 
1 2.0000 1.6667 
2 1.1667 2.3333 
3 0.8333 2.0556 
4 0.9722 1.9444 
5 1.0278 1.9907 
6 1.0047 2.0093 
7 0.9954 2.0016 
8 0.9992 1.9985 
9 1.0008 1.9997 
10 1.0002 2.0003 


The results of successively applying these formulae are shown in Table 8.1. The sequence 


of values of x”) seems to converge to 1 while that of y“”) seems to converge to 2. 


Clearly this sort of approach is simple to program and iterative techniques such as 
Jacobi’s method are best implemented on a computer. When writing the program a test 
should be incorporated so that after each iteration a check for convergence is made by 
comparing successive iterates. In many cases, even when convergence does occur, it is 
slow and so other techniques are used which converge more rapidly. The Gauss-Seidel 
method is attractive for this reason. It uses the most recent approximation to x when 
calculating y leading to improved rates of convergence as the following example shows. 


Example 8.50 


Use the Gauss-Seidel method to solve the equations of Example 8.49. 


As before we write the equations in the form 


1 
=-- 2 
x as 
1 48 
— eet, f = 
ae ail 


With x = 0, y© = 0 as our initial guess, we find 
1 
x =—-(0)+2=2 
2 
To find y") we use the most recent approximation to x available, that is x“: 
1 2) 
@) — —(2) + = = 2.3333 
y gts 


Generally, we find 


1 
(n+1) _ __@) 2 
x > + 
yor) = 1 ot a 


3 3 


Example 8.51 


Solution 
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Table 8.2 

Iterates produced by the Gauss-Seidel 
method. 

Iteration no. (n) x) yr) 

0 0 0 

1 2.0000 2.3333 
2 0.8334 1.9445 
3 1.0278 2.0093 
4 0.9954 1.9985 
5 1.0008 2.0003 
6 0.9999 2.0000 


and the results of successively applying these formulae are shown in Table 8.2. As before, 
we see that the sequence x seems to converge to | and y” seems to converge to 2, 


although more rapidly than before. 


Both of these techniques generalize to larger systems of equations. 


Perform three iterations of Jacobi’s method and three iterations of the Gauss—Seidel 


method to find an approximate solution of 


—8x+ y+ z=1 
x—S5y+ z= 16 
x+ y-4z=7 


with an initial guess of x = y=z=0. 


We rewrite the system to make x, y and z the subject of the first, second and third equation, 


respectively: 

ad me 1 1 
ee 8 
1 ye 1 16 
= — JX =f = 

Mae ae eee 

_ il a 1 qd 

2 ATP | 


To apply Jacobi’s method we substitute the initial guess x© = yO = 7 = 0 into 
the r.h.s. of Equation (8.11) to obtain x, y™ and 2", and then repeat the process. In 


general, 

rt) a —y”) + 1 @ = 
8 8 8 
iI 1 16 

(n+l) __ (n) (n) 

SS" SZ 

- 5 5 5 
i 
@H) = Ay@ 4 iy _ 
Zz rh ae 2 4 
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We find 
@ =! _ 91950 
: 
16 
yD = ~~ = —3.2000 
OS a = 2 7500 
Then, 
(2) ! ! i 
x = 3 (—3.2000) + 3(—1.7500) — 5 = —0.7438 
(2) 1 | 1» 
= <(—0.1250) + —(—1.7500) — — = —3.5750 
5 5 5 
2 = 16 1250) + es 2000) — 7 _ 2.5813 
= 4 . 4 : 4 7 . 
Finally, 


ay a I 1 1 
x0 = 5 (-3.5750) + 5 (—2.5813) — 5 = —0.8945 


1 1 16 
gv 01A38) + ge == —3.8650 


1 1 7 
—(—0.7438) + —(—3.5750) — — = —2.8297 
ra J+ 56 \—-q 


To apply the Gauss-Seidel iteration to Equation (8.11), the most recent approxima- 
tion is used at each stage leading to 


1 1 
(n+1) = ,,(n) ae () ae 
* g Ts 
1 1 16 
(n+1) = ~x@t) a 7%) es, 
: 5 5 
1 1 7 
(n+l) _. = (+1) Sty | 
Zz ris + 4 


Starting from x© = y© = z = 0, we find 


1 
xD) = = —0.1250 


© = 19.1250) + «0 — “© = -3.2050 
a a 5 5 


2) = 796 1250) + Ge 2250) — 7 _ -2.5875 
fom 4 . 4 . 4 =a ‘Z 
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Table 8.3 
Comparison of the Jacobi and Gauss-Seidel methods. 
Iteration Jacobi’s method Gauss-Seidel 
et) @” y” 2) x”) y” 2”) 
0 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000 
1 —0.1250 —3.2000 —1.7500 —0.1250 —3.2250 —2.5875 
2 —0.7438 —3.5750 —2.5813 —0.8516 —3.8878 —2.9348 
3 —0.8945 —3.8650 —2.8297 —0.9778 —3.9825 —2.9901 
4 —0.9618 —3.9448 —2.9399 —0.9966 —3.9973 —2.9985 
5 —0.9856 —3.9803 —2.9767 —0.9995 —3.9996 —2.9998 
6 —0.9946 —3.9924 —2.9915 —0.9999 —3.9999 —3.0000 
7 —0.9980 —3.9972 —2.9968 
8 —0.9992 —3.9990 —2.9988 
9 —0.9997 —3.9996 —2.9996 
Then, 

Q) 1 1 1 

x = —(—3.2250) + =(—2.5875) — = = —0.8516 
8 8 8 
Q) 1 1 16 
yo = Bonne) + a eon ie = —3.8878 


z) 16 8516) + wa 8678) = Pe —2.9348 
ae 4°" 40 7° 


a. 1 1 1 
x = 5 (3.8878) + 5 (—2.9348) — 5 = —0.9778 


| 1 16 
5 (0.9778) + =(—2.9348) — = = —3.9825 


Il 


2 = 10 9778) + ee 9825) — 7 _ -2,9901 
40 40 400° 


For completeness, further iterations are shown in Table 8.3. 
As expected the Gauss—Seidel method converges more rapidly than Jacobi’s. This is 
generally the case because it uses the most recently calculated values at each stage. 


Unfortunately, as with all iterative methods, convergence is not guaranteed. However, 
it can be shown that if the matrix of coefficients is diagonally dominant, that is each 
diagonal element is larger in modulus than the sum of the moduli of the other elements 
in its row, then the Gauss—Seidel method will converge. 


Engineering application 8.5 


Finding approximate solutions for the node voltages of an 
electrical network 


When trying to analyse complicated electrical networks it is frequently necessary 
to resort to computer-based methods in order to find the voltages and currents. For 


=> 


318 


Chapter 8 Matrix algebra 


example, large networks for the distribution of electricity from power stations to 
points of consumption can be very complicated to analyse due to the large number 
of loops involved. This example illustrates the method by making use of the circuit 
given in Engineering application 8.3. However, it should be borne in mind that this 
approach is of most value when analysing much larger networks. 

Use Jacobi’s method and the Gauss-Seidel method to obtain approximate so- 
lutions for the node voltages of the electrical network examined in Engineering 
application 8.3. 


Solution 
The node voltage equations are 


6V, — 4V, — 2V, = 12 3V, + 12V, — 19V, =0 
15V, — 31V, + 10V, + 6V, = 57 4V, — 9V, = -8 
2v, — 11V, + 6V, = —12 


These can be rearranged to give 


DY, te We kG 3V, + 12V 

Vs b d w= a Cc 

a 3 ¢ 19 
15V, + 10V, + 6V, — 57 4v,+8 

Y= se 

31 9 
ae OV EEG 12 
om 11 


The results of applying Jacobi’s method with an initial guess of 


yo = ve = vO = vo = vO =( 


are shown in Table 8.4. Convergence was achieved to within 0.001 after 44 itera- 
tions. The results of applying the Gauss—Seidel method are shown in Table 8.5. Con- 
vergence was achieved to within 0.001 after 21 iterations. Clearly the Gauss—Seidel 
method converges more rapidly than Jacobi’s method. 


Table 8.4 

Node voltages derived from Jacobi’s method. 

Iteration no.(n) Vv yo ve? We vo) 
0 0.0000 0.0000 0.0000 0.0000 0.0000 
1 2.0000 —1.8387 1.0909 0.0000 0.8889 

20 2.8710 0.4802 Dall SY) 1.8265 1.0738 


44 2.9679 0.5243 2.1990 1.8578 i ails) 
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Table 8.5 

Node voltages derived from the Gauss-Seidel method. 

Iteration no.(n) ve? ve yo yo ve? 
0 0.0000 0.0000 0.0000 0.0000 0.0000 
1 2.0000 —0.8710 0.9326 0.9048 0.5018 

20 2.9665 0.5226 2.1985 1.8569 1.1212 

21 2.9674 0.5233 2.1989 1.8573 1.1215 

EXERCISES 8.13 
1 Perform three iterations of the methods of Jacobi and (b) Sxt+y—z=4 


Gauss-Seidel to obtain approximate solutions of the 


following. In each case, use an initial guess of 


(Vaya 4 


(a) 4x+y+z=-l 
x+ 6y + 2z =0 
x+2y+4z=1 

Solutions 
1 (a) Jacobi 


(b) 


xX, = —0.2500, y, = 0, z} = 0.2500 

Xy = —0.3125, yy = —0.0417, z, = 0.3125 
X3 = —0.3177, yz = —0.0521, z, = 0.3490 
Gauss-Seidel 

xX, = —0.2500, y; = 0.0417, z; = 0.2917 
Xy = —0.3333, yy = —0.0417, z, = 0.3542 
X3 = —0.3281, yz = —0.0634, z, = 0.3637 
Jacobi 

xX, = 0.8000, y, = 1, z, = 1.5000 

Xy = 0.9000, yy = 1.5750, z, = 2.4000 

X3 = 0.9650, yz = 1.8250, 23 = 2.7375 


(c) 


(c) 


x—4y+z=-4 


2x + 2y —4z = -6 


4x+y+z=17 
x+3y—-z=9 
2x-—y+5z=1 


Gauss-Seidel 

xX, = 0.8000, y,; = 1.2000, z; = 2.5000 
X, = 1.0600, y. = 1.8900, z, = 2.9750 
x3 = 1.0170, yz = 1.9980, z3 = 3.0075 
Jacobi 

x, = 4.2500, y; = 3, z; = 0.2000 

xX, = 3.4500, yy = 1.6500, z, = —0.9000 
x3 = 4.0625, yz = 1.5500, z; = —0.8500 
Gauss-Seidel 

x, = 4.2500, y,; = 1.5833, z; = —1.1833 
xX, = 4.1500, yy = 1.2222, z, = —1.2156 
Xz = 4.2484, y, = 1.1787, z, = —1.2636 


COMPUTER SOLUTIONS OF MATRIX PROBLEMS 


The use of technical computing languages has vastly improved the ability of engineers to 
calculate the solutions to complex engineering problems. As has been demonstrated by 
the examples given in previous chapters, high level mathematical functions are provided 
such as the ability to calculate the roots of an equation and the eigenvalues of a matrix. 
In electrical engineering problems it is frequently necessary to solve matrix equations. 
Consider again Example 8.32. The equations are of the form 


AX =B 
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where 
3 2-1 4 x 
A= |2-1 2 B=110 X=|y 
1 -—3 -4 5 Zz 


In order to solve these equations in a technical computing language the matrices need to 
be loaded into memory. In MATLAB® we would type 


A = (3, 2, —1; 2, —1, 2; 1, —3, —4] 
B = [4; 10; 5] 


We note that commas are used to separate entries on the same row and semicolons are 
used to separate different rows. 
The value of X can be obtained by the method of matrix inversion. The code for this is 


X = inv(A) *B 
Alternatively, X can be obtained by Gaussian elimination. The code for this is 
X=A\B 


Gaussian elimination is the usual method of solving systems of linear equations in 
MATLAB® because it is computationally efficient. If we run this three-line program 
using MATLAB® then we obtain 


A = [3, 2, —1; 2, -1, 2; 1, -3, —4] 


B = [4; 10; 5] 

X = A\B 

X = 3.0000 
—2.0000 
1.0000 


and so x = 3.0000, y = —2.0000, z = 1.0000. 

We see that MATLAB® is a powerful tool for carrying out matrix calculations. Part 
of its power derives from the extremely high-level nature of its commands. A command 
such as inv(A) would require typically 50 lines if written in a normal high-level language 
such as C or FORTRAN. 


Engineering application 8.6 


Solution of an electrical network using MATLAB” 


Consider again the electrical network of Engineering application 8.3. Write the 
MATLAB? code to solve this network. 


Solution 

The equations are of the form V = RI’ where V is the voltage vector of the network, 
R is the matrix of resistor values and /’ is the current vector of the network. We know 
V and R and we wish to obtain J’. Using the values of Engineering application 8.3 
we can write 


V = [8; —6; 4] 


IPRIME = inv(R) *V 
Running this program gives 
DERIMES— S027 


—0.2806 
0.6194 


1 = (10, —8, —ilg =e), 14, —28 =i, 
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REVIEW EXERCISES 8 


1 


Evaluate the following products: 


(a) (—4 1) (5) 


(c) (-4 1) € ) 


i 1 
(e) (2 -1 0)/2 40 
07 


if7 3 2 
@5(1 01) 


(b) (—4 1) (¢) 
GC) 
()5 é ‘) 


Simplif cosh@ sinh@ 
P'NY | sinh@ coshé|’ 
0 23 
Given thatA=|2 0 0] find A7! and A2. Show 
1-10 


that A? + 6A7! —7/ = 0, where J denotes the 3 x 3 
identity matrix. 


2-14 
IfA= {1 0 O}) find 
1 2 0 
(a) |A| 
(b) adj(A) 
(c) A! 


Find the inverse of the matrix 


1 —2 0 
2 5 
-1 23 


Hence solve the equations 


x—2y=3 
3x+y+5z= 12 
—x+2y+3z=3 


6 


Use Gaussian elimination to solve 
x+2y—3z+2w=2 
2x + Sy — 8g +6w =5 
3x + 4y —5z+2w=4 


Use Jacobi’s method to obtain a solution of AX = B 
to three decimal places where 


10 1 0O 1 
A=] 1 10 1 and B={|2 
0 1 10 1 


Use a matrix method to solve 


2x+y-—z=3 
x-—y+2z=1 
3x+4y+3z=2 


Consider the Vandermonde matrix 
i 

V= [1 

1 


(a) Find det V and show that it can be written as 
(a—c)(a—b)(c —b). 


(b) Show that if a, b and c are all different, then the 
Vandermonde matrix is non-singular. 


(a) The signal f(t) = sin : is to be approximated 
by a third-degree polynomial for values of t 
between —1 and 2. By forcing the original signal 
and its approximating polynomial to agree at 
t=—1,t=0,t=1 andt = 2, find this 
approximation. [Hint: see Engineering 
application 8.2.] 


(b) Use a graphics calculator or graph plotting 
package to compare the graphs of f(t) and its 
approximating polynomial. 
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11 Suppose A = ¢ a) 
: —2 
(a) Find Av when v = ( : 
(b) Find 3v when v = (~): Deduce that Av = 3v. 
(c) Find Av when v = ©) for any constant jz. 
Deduce that Av = 3v. 
12 Use the Gauss-Seidel method to find an approximate 
solution of 
(a) 5x+3y = —34 
2x — Ty = 93 
(b) 3x +y+z=6 
2x+5y—-—z=5 
x— 3y4+ 8z= 14 
13 Determine which of the following systems have 
non-trivial solutions. 
(a) 2x-y=0 
3x — 1.5y =0 
(b) 6x+5y=0 
5x + 6y =0 
(c) —x—-4y=0 
2x + 8y = 0 
(d) 7x—-3y=0 
1.4x — 0.6y = 0 
(e) —4x+5y=0 
3x —4y = 0 
14 Determine which of the following systems have 
non-trivial solutions. 
(a) 3x—2y+2z=0 
x-y+z=0 
2x+2y—z=0 
(b) x+3y—-z=0 
4x-—y+2z=0 
6x + Sy = 0 
Solutions 
1 (a) -1 (b) —6 
—5 
(c) (-I -6) (d) 1 
10 5 
(© @ 2 2) (f te an) 
7 3 
sa ° 
@ |, 1 
~ 0 - 
2 2 


15 


16 


17 


(c) x+2y—z=0 
x—3z=0 
5x + 6y —9z=0 


The matrix A is defined by 


s=(5 i) 


(a) Determine the characteristic equation of A. 
(b) Determine the eigenvalues of A. 
(c) Determine the eigenvectors of A. 


(d) Form a new matrix M whose columns are the two 
eigenvectors of A. M is called a modal matrix. 


(e) Show that M—!AM isa diagonal matrix, D, with 
the eigenvalues of A on its leading diagonal. D is 
called the spectral matrix corresponding to the 
modal matrix M. 


(a) Show that the matrix 


5 2 
A=(4}) 
has only one eigenvalue and determine it. 


(b) Calculate the eigenvector of A. 


The matrix H is given by 


4-11 
H=|-2 40 
—-4 31 


(a) Find the eigenvalues of H. 
(b) Determine the eigenvectors of H. 


(c) Form anew matrix M whose columns are the 
three eigenvectors of H. M is called a modal 
matrix. 


(d) Show that M —lHM isa diagonal matrix, D, with 
the eigenvalues of H on its leading diagonal. D is 
called the spectral matrix corresponding to the 
modal matrix M. 


1 
ifo 2.4 
A a0 3 -6 
22 4 
7 3-10 
A*=! 0 46 
=2 2.3 


11 


i: <8 0 

(a) -8 (b) | 0 -44 

a Si 
,(2 8 0 
© go 4-4 
4 1 
1 { 77 6 -10 
aoe 
70 7 


x=ly=-1,z=2 
x=2Q—-A,yH=14+2A—-—2p,z=A,w=p 
x = 0.082, y = 0.184, z = 0.082 


x = 1.462, y = —0.308, z = —0.385 


(a) f= ee 
~ 3° 3 


(a) (2) (b) (2) 


12 


13 
14 
15 


16 


17 


Review exercises 8 
(a) x=l1l,y=—-13 
(b) x=1Ly=1,2=2 
(a), (c) and (d) have non-trivial solutions. 
(b) and (c) have non-trivial solutions. 


(a) 7 +A-6=0 
(b) A = —3,2 


© (5)-C05) 


(a) A=3 6b) ha) 


1 1 0 
(a) A=2,3,4 (b) 1),)2],]1 
=| 1 1 
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INTRODUCTION 


Complex numbers often seem strange when first encountered but it is worth persever- 
ing with them because they provide a powerful mathematical tool for solving several 
engineering problems. One of the main applications is to the analysis of alternating cur- 
rent (a.c.) circuits. Engineers are very interested in these because the mains supply is 
itself a.c., and electricity generation and transportation are dominated by a.c. voltages 
and currents. 

A great deal of signal analysis and processing uses mathematical models based on 
complex numbers because they allow the manipulation of sinusoidal quantities to be 
undertaken more easily. Furthermore, the design of filters to be used in communications 
equipment relies heavily on their use. 

One area of particular relevance is control engineering — so much so that control 
engineers often prefer to think of a control system in terms of a ‘complex plane’ repre- 
sentation rather than a ‘time domain’ representation. We will develop these concepts in 
this and subsequent chapters. 
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COMPLEX NUMBERS 
We have already examined quadratic equations such as 
xr —x-6=0 (9.1) 


and have met techniques for finding the roots of such equations. The formula for obtain- 
ing the roots of a quadratic equation ax” + bx + c = Ois 


—b+ JP — 4ac 
x= 


ay (9.2) 
Applying this formula to Equation (9.1), we find 
van PE VED = 4-6) 
2 
1+ /25 
= 
_ 1 a 5 
— = 
so that x = 3 and x = —2 are the two roots. However, if we try to apply the formula to 


the equation 
2x +2x+5=0 


we find 


—2+ /-36 
x SS 
4 


A problem now arises in that we need to find the square root of a negative number. We 
know from experience that squaring both positive and negative numbers yields a positive 
result; thus, 


6° = 36 and (—6)* = 36 


so that there is no real number whose square is —36. In the general case, if 
ax’ + bx +c = 0, we see by examining the square root in Equation (9.2) that this prob- 
lem will always arise whenever b* — 4ac < 0. Nevertheless, it turns out to be very useful 
to invent a technique for dealing with such situations, leading to the theory of complex 
numbers. 

To make progress we introduce a number, denoted j, with the property that 


peel 


We have already seen that using the real number system we cannot obtain a negative 
number by squaring a real number so the number j is not real — we say it is imaginary. 
This imaginary number has a very useful role to play in engineering mathematics. Using 
it we can now formally write down an expression for the square root of any negative 
number. Thus, 


/—36 = /36 x (-1) 
= /36 xj? 


= 6j 
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Example 9.1 


Solution 


Returning to the solution of the quadratic equation 2x” + 2x + 5 = 0, we find 


_ —2+ /-36 
> 4 
_ —2+6j 
~~ 4 
7 -1+3j 
=~ 5 
1 3, 1 3, 
We have found two roots, namely x = = + 5 jandx = — 5 _ 5 j. These numbers are 


called complex numbers and we see that they are made up of two parts — a real part and 


an imaginary part. For the first complex number the real part is — 5 and the imaginary 
3 
part is i For the second complex number the real part is — 5 and the imaginary part is 


== In a more general case we usually use the letter z to denote a complex number with 


real part a and imaginary part b, so z = a+ bj. We write a = Re(z) and b = Im(z), and 
denote the set of all complex numbers by C. Note that a, b € IR whereas z € C. 


Z=a+bj, zis a member of the set of complex numbers, that is z € C 
a = Re(z) b = Im(z) 


Complex numbers which have a zero imaginary part are purely real and hence all real 
numbers are also complex numbers, that is R Cc C. 


Solve the quadratic equation 2s” — 3s +7 = 0. 


Using the formula for solving a quadratic equation we find 
._ =D ay(=3" —4@G) 
: 2(2) 
3 ae A 7 
a 
3+ S47; 


4 
= 0.75 + 1.7]j 


Using the fact that j? = —1 we can develop other quantities. 


Example 9.2 Simplify the expression j’. 


Solution 


We have 

pap xj 
(-1) xj 
=-j 


9.2.1 


Example 9.3 


Solution 


Example 9.4 


Solution 
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The complex conjugate 


If z= a+ bj, we define its complex conjugate to be the number z = a — bj; that is, we 
change the sign of the imaginary part. 


Write down the complex conjugates of 
(a) -7+j (BF) 6-Sh 6 WJ 


To find the complex conjugates of the given numbers we change the sign of the imaginary 
parts. A purely real number has an imaginary part 0. We find 


(a) -—7-j (b) 6+ 5j (c) 6, there is no imaginary part to alter (d) —j 


We recall that the solution of the quadratic equation 2x? + 2x + 5 = 0 yielded the two 


1 a 3. d 

an 
Ae 8 8 
pair. This illustrates a more general result: 


complex numbers j, and note that these form a complex conjugate 


When the polynomial equation P(x) = 0 has real coefficients, any complex roots 
will always occur in complex conjugate pairs. 


Consider the following example. 


Show that the equation x? — 7x? + 19x — 13 = 0 has a root at x = 1 and find the other 
roots. 


If we let P(x) = x° — 7x* + 19x — 13, then P(1) = 1 —-7+ 19 — 13 = 00 that x = 1 
is aroot. This means that x — | must be a factor of P(x) and so we can express P(x) in 
the form 


P(x) =x — 7x +: 19x — 13 = (x— 1) (ax + Bxty) 
= ax? + (B—a)x’ + (y — B)x—y 
where a, 6 and y are coefficients to be determined. Comparing the coefficients of x° we 


find @ = 1. Comparing the constant coefficients we find y = 13. Finally, comparing 
coefficients of x we find 6 = —6, and hence 


P(x) = 2° — 7x? + 19x — 13 = (x — 1)? — 6 + 13) 


The other two roots of P(x) = 0 are found by solving the quadratic equation 

x — 6x + 13 = 0, that is 

624/36=52  624/—16 
2 7 2 

and again we note that the complex roots occur as a complex conjugate pair. This illus- 

trates the general result given in Section 1.4 that an nth-degree polynomial has n roots. 


P 


=3+42j 
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1 


EXERCISES 9.2 
Solve the following equations: 
(a)x2+1=0 (b) x7 +4=0 
(c) 3x7 +7 =0 (d) x7 +x+1=0 


= i2=0 
6) = = — 
2 


(f) —x? —3x-4=0 
(g) 2x? +3x+3=0 
(h)x* +3x+4=0 
Solve the cubic equation 
3x3 — 11x? + 16x — 12 =0 
given that one of the roots is x = 2. 


Write down the complex conjugates of the following 
complex numbers. 


1 
(@) -11-8 )5+3) © 5) @-17 


Solutions 
(a) +j (b) +2) 
(c) t/773j (d) —1/2 + (V3/2)j 
(e:) 14 ¥3j (f) -3/2 + (V7/2)j 


(g) -3/4 + (V15/4)j (h) —3/2  (V7/2)j 


2, 5/6 + (V47/6)j 

1 
(a) —114 8) (b) 5—3j (c) —55 
(d) —17 (e) cos wt —jsin wt 


(f) cos @t+jsinet (g) 0.333j+1 


(e) coswt +jsin wt (f) cos wt — j sin wt 


(g) —0.333j +1 


Recall from Chapter 2 that the poles of a rational 
function R(x) = P(x)/Q(x) are those values of x for 
which Q(x) = 0. Find any poles of 


3x 3 


x 
oy © Perel 


3 e+ 
Solve the equation se +2s+5=0. 
Express as a complex number 
air OF OF 


State Re(z) and Im(z) where 


()c=74+1lj — (b) <=-6 +) 

()z=0 @ z= a 

2=i (e) z=}? 

@x=3 ()x=4j 

(c)x = 1/2 + (V/3/2)j 

s=-149j 

@1 ®j ©-l 

(a) 7, 11 (b) —6,1 (c) 0,0 
1 1 

(d) oy FI (e) 0, 1 (f) —1,0 


OPERATIONS WITH COMPLEX NUMBERS 


Two complex numbers are equal if and only if their real parts are equal and their imagi- 


nary parts are equal. 


Example 9.5 Find x and y so that x + 6j and 3 — yj represent the same complex number. 


Solution 
x+ 6) =3—yj 


If both quantities represent the same complex number we have 


Since the real parts must be equal we can equate them, that is 


+= 3 


9.3.1 


Example 9.6 


Solution 


9.3.2 


Example 9.7 


Solution 


Example 9.8 


Solution 
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Similarly, we find, by equating imaginary parts 
6=-y 
so that y = —6. 


The operations of addition, subtraction, multiplication and division can all be performed 
on complex numbers. 


Addition and subtraction 


To add two complex numbers we simply add the real parts and add the imaginary parts; 
to subtract a complex number from another we subtract the corresponding real parts and 
subtract the corresponding imaginary parts as shown in Example 9.6. 


If z; = 3 —4j and z, = 4 + 2j find z, + z, and z, — z,. 


z+2z,=(3—-4j)) + (442)) 
=@G+4)4 (4-2) 
=7—2j 

Z-%=(3—-—4)) — (44+ 2)) 
=(3-4) + (-4—2)j 
=-1-6j 


Multiplication 


We can multiply a complex number by a real number. Both the real and imaginary parts 
of the complex number are multiplied by the real number. Thus 3(4 — 6j) = 12 — 18). 
To multiply two complex numbers we use the fact that j? = —1. 


Ifz, = 2 —2jandz, =3 + 4j, find z,z). 


2% = (2—2j)(3 +45) 
Removing brackets we find 


21% = 6 — 6j + 8j — Bj” 
=6-—6+8)+8 using j* = —1 
= 1442) 


If z = 3 — 2j find zz. 


If z = 3 — 2j then its conjugate is z = 3 + 2j. Therefore, 
wz = (3 — 2j)(3 + 2j) 


=9-6j)+6j-4/ 
=9-4j* 
S19 


We see that the answer is a real number. 
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Whenever we multiply a complex number by its conjugate the answer is a real number. 
Thus if z= a+ bj 
z= (a+ bj) (a — bj) 
=a + baj — abj — b’j* 
= a he be 


Ifz=a+bjthenzz=a?4+b* 


9.3.3 Division 


To divide two complex numbers it is necessary to make use of the complex conjugate. 
We multiply both the numerator and denominator by the conjugate of the denominator 
and then simplify the result. 


Example 9.9 Ifz, =2+9jandz, =5 —2j find —. 
a) 


; 24+ 9j : . ; ; . 
Solution We seek I. The complex conjugate of the denominator is 5 + 2j, so we multiply 


both numerator and denominator by this quantity. The effect of this is to leave the value 


of ai unaltered since we have only multiplied by 1. Therefore, 
oo 


z 249) — (24+9j) 5 +2)) 
% S-™y GO-2j)G+2)j) 


10 +.45j + 4j + 18)? —8 +494 


2544 29 
_ 849 
= ~39 T 99) 


The multiplication of two conjugates in the denominator allows a useful simplification. 
We see that the effect of multiplying by the conjugate of the denominator is to make the 
denominator of the solution purely real. 


If z, =x, +jy, and z, = x, +jy, then the quotient “1 is found by multiplying both 
co) 
numerator and denominator by the conjugate of the denominator, that is 
4 ati 
% y+ 2 


x, +; 52 x, — Jr 

Xy+j¥. %— Jo 

— 1% 1 V2 + jy, — X12) 
x3 +Y3 


1 


1 


2 


1 


EXERCISES 9.3 

If z; = 3+ 2j and z, = 4 — 8j find 
(a) zy + Zo (b) 21 — 2 (c) Za — 2 
Express the following in the form a + bj: 
(a) b) = 
a) —— = 

1+j J 
(c) ! + (d) ! 
c) -+>— ae 

j 2-j I+j 

3 

sae a 
Express the following in the form a + bj: 

2 —2 + 3) 
(a) (b) =" 

1-j J 

Solutions 

(a) 7 — 6j (b) —1 + 10) (c) 1 — 10) 
(a) 2 7 (b) 2j (© 2 ss 
a, ’ Pra ge 
(a) tc go 

ee e) 2 

ee, 26 
(a) 1+j (b) 3+ 2j (c) 6+ 12) 

1 

(d) 474+ 32) (e) 2— 5) 


Technical Computing Exercises 9.3 


Many technical computing languages allow the user 
to input and manipulate complex numbers. Investigate 
how the complex number a + Jj is input to the 
software to which you have access. 


Use technical computing software to simplify the 
following complex numbers: 


(a) (1+jP (b) 
+ 7 
G+iP @-3)) 

Solve the polynomial equations 

(a) + 7x? + 9x+63 =0 

(b) x44 15x? — 16 =0 

You may recall from Chapter 1 that in MATLAB® the 
roots function can be used to solve polynomial 


equations with real roots. This function can also 
calculate complex roots. 


= 
(1 — 2j)7 
(c) 
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(c) 3)(4 — 2j) 
5 +3; 
Va 


(b) (7 — 2j)(5 + 6j) 


Find a quadratic equation whose roots are 1 — 3j and 
1+ 3j. 


If (x + jy)? = 3 + 4j, find x and y, where x, y € R. 


Find the real and imaginary parts of 


(b) jt -7? 


x2 —2x+10=0 


x=2,y=1;x=-2,y=-1 


62 61 
(a) 35 85 
(b) 1, —-1 
(c) 0,0 

1 
(d) 07 


Use a package to find the real and imaginary parts of 
15+j 
75) 


A common requirement in control theory is to find the 
poles of a rational function. Use a package to find the 
poles of the function 

0.5 
53 + 2s? + 0.28 + 0.0556 
To solve this problem automatically may require the 
use of specialist toolboxes in MATLAB® or GNU 
Octave which are not part of the core program. For 
example, the Signal Processing toolbox in 
MATLAB® has the function tf 2zp which converts a 
transfer function expressed as a ratio of polynomials 
to pole/zero form. 


G(s) = 
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zy = 345} 


ww 


n 
4 
x 
aS 
e 
a4 
& 
ob 
sS 
= 
4 


gy ; = Real axis 
> at bj ; 
gb }------- 9 (a, b) 3 = —1-2j e-2 
5p ; 3 
E | ; 
4 z= 4 
Real axis -5 
Figure 9.1 Figure 9.2 
Argand diagram. Argand diagram for Example 9.10. 


GRAPHICAL REPRESENTATION OF COMPLEX NUMBERS 


Given a complex number z = a + bj we can obtain a useful graphical interpretation of 
it by plotting the real part on the horizontal axis and the imaginary part on the vertical 
axis and obtain a unique point in the x—y plane (Figure 9.1). We call the x axis the real 
axis and the y axis the imaginary axis, and the whole picture an Argand diagram. In 
this context, the x—y plane is often referred to as the complex plane. 


Example 9.10 Plot the complex numbers z, = 7 + 2j, z, = —3+5j, z, = —1—2j and z, = —4j onan 
Argand diagram. 


Solution The Argand diagram is shown in Figure 9.2. 


EXERCISES 9.4 
1 Plot the following complex numbers on an Argand 2 (a) Plot the complex number z = 1 + j on an Argand 
diagram: diagram. 
(a) z;=—-3 -3j (b) Simplify the complex number j(1 + j) and plot 
(b) z =7+2j the result on your Argand diagram. Observe that 
the effect of multiplying the complex number by j 
i) aa is to rotate the complex number through an angle 


(d) w= —0.5j of 7t/2 radians anticlockwise about the origin. 
(e) 75 = —2 


Solutions 


1. See Figure S.17. 


Imaginary 


Figure S.17 
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2 (a) See Figure S.18.  (b) j1+j) =—-1+4+j 


Imaginary 


ale 1 e iti 


Figure S.18 


POLAR FORM OF A COMPLEX NUMBER 


It is often useful to exchange Cartesian coordinates (a, b) for polar coordinates 7 and 
6 as depicted in Figure 9.3. 
From Figure 9.3 we note that 


a : 
cos@ = — sind = — 
r r 
and so, 
a=rcos0 b=rsin0é 
Furthermore, 
b 
tan@d = — 
a 


Using Pythagoras’s theorem we obtain r = Ja? + b?. By finding r and 6 we can express 
the complex number z = a + bj in polar form as 


Z=rcosé +jrsinéd = r(cos@ +jsing) 


which we often abbreviate to z = rZ@. Clearly, r is the ‘distance’ of the point (a, b) from 
the origin and is called the modulus of the complex number z. The modulus is always 
a non-negative number and is denoted |z|. The angle is conventionally measured from 
the positive x axis. Angles measured in an anticlockwise sense are regarded as positive 
while those measured in a clockwise sense are regarded as negative. The angle @ is called 
the argument of z, denoted arg(z). Since adding or subtracting multiples of 27t from 6 
will result in the ‘arm’ in Figure 9.3 being in the same position, the argument can have 
many values. Usually we shall choose @ to satisfy —7 < 6 < 7. 


Cartesian form: z = a + bj 
Polar form: z = r(cos@ +jsin@) = rZ@ 


{l=r=Ve+P 


b 
a=rcos@ b=rsin0o tania 
a 
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Figure 9.3 Figure 9.4 
Polar and Cartesian forms of a complex Argand diagram depicting z = | —jin 
number. Example 9.11. 

Note that 


r/(—0) = r(cos(—0@) +jsin(—@)) 
r(cos@ —jsin@) 


=Z 


Ifz=a+)j then z = a — bj andz=rZ(-8@). 


Example 9.11 Depict the complex number z = 1 — j on an Argand diagram and convert it into polar 
form. 


Solution The real part of z is 1 and the imaginary part is —1. We therefore plot a point in the x—-y 
plane with x = | and y = —1 as shown in Figure 9.4. 
From Figure 9.4 we see that r = 12 + (—1)? = J/2and@ = —45° or —71/4 radians. 
Therefore z = 1 —j = V2Z(—7/4). 


To express a complex number in polar form it is essential to draw an Argand diagram 
and not simply quote formulae, as the following example will show. 
Example 9.12 Express z = —1 —j in polar form. 


Solution If we use the formula |z| = r = Va? +2”, we find that r = 2. Using tan@ = b/a, we 
find that tan@ = —1/— 1 = 1 so that you may be tempted to take 0 = 71/4. Figure 9.5 


shows the Argand diagram and it is clear that 9 = —37t/4. Therefore, z = —1 —j = 
/2L —37/4, and we see the importance of drawing an Argand diagram. 
y A 


Figure 9.5 
Argand diagram depicting z = —1 — j in Example 9.12. 


1 
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9.5.1 Multiplication and division in polar form 
The polar form may seem more complicated than the Cartesian form but it is often more 
useful. For example, suppose we want to multiply the complex numbers 
Z, =r, (cos, +jsin6,) and 2, =1r,(cos 6, +jsin,) 
We find 
2% =r, (cos 6, +jsin@, )r, (cos, + jsin6,) 
= r,Iy{(Cos 6, cos 6, — sin @, sin6,) + j(sin @, cos @, + sin @, cos 6, )} 
which can be written as 
r,ry{cos(6, + 8,) +jsin(O, + 6,)} 
using the trigonometric identities of Section 3.6. This is a new complex number which, 
if we compare with the general form r(cos 6 +j sin 6), we see has a modulus of r,r, and 
an argument of 6, + 6,. To summarize: to multiply two complex numbers we multiply 
their moduli and add their arguments, that is 
%y%y = LO, + ) 
Example 9.13 If z,; = 3427/3 and z, = 4477/6 find z,z,. 
Solution Multiplying the moduli we find r,r, = 12, and adding the arguments we find 0, + 6, = 
m/2. Therefore z,;z, = 1227/2. 
A similar development shows that to divide two complex numbers we divide their moduli 
and subtract their arguments, that is 
Si 
= 2 =4(@, =) 
ae 2 
Example 9.14 If z, = 37/3 and z, = 4277/6 find z, /z). 
Solution Dividing the respective moduli, we find r,/r, = 3/4 and subtracting the arguments, 
m/3 — 7/6 = 7/6. Hence z,/z, = 0.752 7/6. 
EXERCISES 9.5 


Mark on an Argand diagram points representing 
z, =3-2,% =i =i, 24 = —2 —4j and 
Zs = 3. Find the modulus and argument of each 

complex number. 


Express the following complex numbers in polar 
form: 


(a) 3-j (BF) 2 ©) -j (d) —5 +12) 


Find the modulus and argument of (a) z; = —-J/3+ j 
and (b) z) = 4+ 4j. Hence express z,z, and z;/z, in 
polar form. 


Express /2/.71/4, 2/.71/6 and 2Z_ —7/6 in Cartesian 
form a + bj. 


<1 rl 
Prove the result — = —/6, — 6). 
2% "2 
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1 
Express z = OC where w and C are real constants, 
jo 


in the form a + Jj. Plot z on an Argand diagram. 


If z; = 4(cos 40° + j sin 40°) and z, = 3(cos 70°+ 
jsin 70°), express z,Z, and z, /Z) in polar form. 


Solutions 


Iz;| = 13, arg(z,) = —0.5880 
[Zy| = 1, arg(z)) = —7/2 

lz3| = Larg(z3) =7 

Iz4| = V20, arg(z,) = —2.0344 
[z5| = 3, arg(zs) = 0 
(a) /10Z —0.3218 
(c) 1L —n/2 

(a) 2, 57/6 


(b) 4/2, 1/4 


(b) 220 
(d) 13/.1.9656 


8 Simplify 


(V2 (510/4))* (22 (—n/3)) 
2/ (—7/6) 


24%) = 8V2/ 130/12 
2 /% = 2 samt? 
4 1+j,V3+j,v3-j 
a 
7) %4% = 12(cos 110° + jsin 110°) 
2/2 = (cos 30° — jsin 30°) 


8 4 


VECTORS AND COMPLEX NUMBERS 


It is often convenient to represent complex numbers by vectors in the x—y plane. Fig- 
ure 9.6(a) shows the complex number z = a + jb. Figure 9.6(b) shows the equivalent 


vector. Figure 9.7 shows the complex numbers z, = 2 + j and z, = | + 3). 


If we now evaluate z, = z, + z, we find z,; = 3 + 4j which is also shown. If we 
form a parallelogram, two sides of which are the representations of z, and z,, we find 
that z, is the diagonal of the parallelogram. If we regard z, and z, as vectors in the plane 
we see that there is a direct analogy between the triangle law of vector addition (see 


Section 7.2.3) and the addition of complex numbers. 


(a) (b) 
Figure 9.6 


The complex number z = a + jb and its equivalent vector. 


0 1 2 3 4 x 


Figure 9.7 
Vector addition in the complex plane. 


Example 9.15 


Solution 
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FPnNw hn = 


O123 45 67 8 9 10 x 


Figure 9.8 Figure 9.9 
Vector addition and subtraction. Diagram for Example 9.15. 


More generally, if z, and z, are any complex numbers represented on an Argand 
diagram by the vectors OA and OB (Figure 9.8) then upon completing the parallel- 
ogram OBCA, the sum z, + z, is represented by the vector OC. We can also obtain 
a representation of the difference of two complex numbers in the following way. If 
we write 


£3 = 2) — Zp 
= ral + (—z,) 


and note that if z, is represented by the vector OB, then —Z, is represented by the vector 
—OB = BO = CA. The complex number z, is represented by OA = BC, so that 
z+ (-%) = BC + CA = BA. Thus the difference Z, — Z, is represented by the 
diagonal BA. To summarize, the sum and difference of z, and z, are represented by 
the two diagonals of the parallelogram OBCA. 


Represent z, = 6+ Jj and z, = 3 + 4j, and their sum and difference, on an Argand 
diagram. 


We draw vectors OA and OB representing z, and z,, respectively (Figure 9.9). Then we 
complete the parallelogram OACB as shown. The sum z, +z, is then represented by OC 


and the difference z, — z, by BA. It is easy to see that the vector BA = (3) that is it 


represents the complex number 3 — 3j, while OC = (5) , that is it represents 9 + 5j, so 
that z, +z, =9+5jandz, —z, =3 — 3j. 


THE EXPONENTIAL FORM OF A COMPLEX NUMBER 


You will recall from Chapter 6 that many functions possess a power series expansion, 
that is the function can be expressed as the sum of a sequence of terms involving integer 
powers of x. For example, 


2 x3 


x Xx 
e =Itet a tat 
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and this representation is valid for any real value of x. The expression on the r.h.s. is, of 
course, an infinite sum but its terms get smaller and smaller, and as more are included, 
the sum we obtain approaches e*. Other examples of power series include 


Pe 
Se ee es (9.3) 
and 
ext 
ee YS a (9.4) 


which are also valid for any real value of x. It is useful to extend the range of applicability 
of these power series by allowing x to be a complex number. That is, we define the 
function e* to be 


2: 3 


| z 
C= TZF a Tap te 


and theory beyond the scope of this book can be used to show that this representation is 
valid for all complex numbers z. 
We have already seen that we can express a complex number in polar form: 


z=r(cos@ +jsin@) 


Using Equations (9.3) and (9.4) we can write 


eg oe 


ee er ae re ae 
= ot a a 


Furthermore, we note that e!” can be written as 


, 1202 763 
jo : J J 
e Se ae + SP =P 
6? oe 
Se 5, ag 


so that 

z=r(cos@é +jsin@) = rel? 
This is yet another form of the same complex number which we call the exponential 
form. We see that 


e” = cos@ +jsind (9.5) 


It is straightforward to show that 


e/* = cos@ —jsind 
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Therefore, if Z = r(cos 6 —j sin@) we can equivalently write z = re~/’. The two expres- 
sions for e/* and e~!? are known as Euler’s relations. From these it is easy to obtain the 
following useful results: 


el? +e? el? —e 
cos 9 = —_—_ sin@ = —___ 
2) 2j 


Example 9.16 We saw in Section 3.7 that a waveform can be written in the form f(t) = Acos(wt+@). 
Consider the complex number e!@’**), We can use Euler’s relations to write 


J+) — cos(wt + b) +jsin(wt + o) 
and hence, 


f(t) =A Re(e@'*9) 


EXERCISES 9.7 


1. Find the modulus and argument of 5 Express z = e! /? in the form a + bj. 


jm/4 —j7t/6 jm/3 ; 
a () 25 te) Te 6 Express —1 — jin the form re!®. 


2. Find the real and imaginary parts of 
(a) 5ei77/3 (b) el277/3 
(c) llei™ (d) 2e7i7 


7 Express 
(a) 7+ 5j and 
1 1 
aye : : (b) = — =j in exponential form. 
3 Express z = 6(cos 30° + j sin 30°) in exponential 2. 3) 
form. Plot z on an Argand diagram and find its real 8 


: : Express z; = 1 —j and 
and imaginary parts. 


1l+j. ‘ jo 
4 Ifo,,T €R, find the real and imaginary parts of a 34 in the form re”. 
e(otio)T J 
Solutions 
1 (a) 3,7/4 (b) 2, —1/6 (c) 7, 1/3 5 ej 
2 (a) 2.5,4.3301 (b) —0.5,0.8660 (c) —11,0 6 V2e'-37/4)j 
(d) —2,0 


7 (a) V74e2-8 (b) 0.60 e~ 99% 


3 6e'7/9)i, Re(z) = 5.1962, Im(z) = 3 i 
B V2e(-7/4)), —¢ 67/12) 
J2 


4& Real part: e?? cos wT, imaginary part: e?? sin wT 
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PHASORS 


Electrical engineers are often interested in analysing circuits in which there is an a.c. 
power supply. Almost invariably the supply waveform is sinusoidal and the resulting 
currents and voltages within the circuit are also sinusoidal. For example, a typical voltage 
is of the form 


u(t) = Vcos(@t + 6) = Vcos(27ft + o) (9.6) 


where V is the maximum or peak value, is the angular frequency, f is the frequency, 
and @ is the phase relative to some reference waveform. This is known as the time 
domain representation. Each of the voltages and currents in the circuit has the same 
frequency as the supply but differs in magnitude and phase. 

In order to analyse such circuits it is necessary to add, subtract, multiply and di- 
vide these waveforms. If the time domain representation is used then the mathematics 
becomes extremely tedious. An alternative approach is to introduce a waveform repre- 
sentation known as a phasor. A phasor is an entity consisting of two distinct parts: a 
magnitude and an angle. It is possible to represent a phasor by a complex number in 
polar form. The fixed magnitude of this complex number corresponds to the magnitude 
of the phasor and hence the amplitude of the waveform. The argument of this complex 
number, ¢, corresponds to the angle of the phasor and hence the phase angle of the 
waveform. Figure 9.10 shows a phasor for the sinusoidal waveform of Equation (9.6). 

The time dependency of the waveform is catered for by rotating the phasor anti- 
clockwise at an angular frequency, w. The projection of the phasor onto the real axis 
gives the instantaneous value of the waveform. However, the main interest of an engi- 
neer is in the phase relationships between the various sinusoids. Therefore the phasors 
are ‘frozen’ at a certain point in time. This may be chosen so that t = 0 or it may be cho- 
sen so that a convenient phasor, known as the reference phasor, aligns with the positive 
real axis. This approach is valid because the phase and magnitude relationships between 
the various phasors remain the same at all points in time once a circuit has recovered 
from any initial transients caused by switching. 

Some textbooks refer to phasors as vectors. This can lead to confusion as it is possible 
to divide phasors whereas division of vectors by other vectors is not defined. In practice 
this is not a problem as phasors, although thought of as vectors, are manipulated as 
complex numbers, which can be divided. We will avoid these conceptual difficulties by 
introducing a different notation. We will denote a phasor by V, which corresponds to 


V /¢ in complex number notation (see Figure 9.10). Thus, for example, a current i(t) = 
Icos(@t+@) would be written J in phasor notation and JZ ¢ in complex number notation. 


Many engineers use the root mean square (r.m.s.) value of a sinusoid as the mag- 
nitude of a phasor. The justification for this is that it represents the value of a d.c. sig- 
nal that would dissipate the same amount of power in a resistor as the sinusoid. For 


Figure 9.10 . 
Illustration of the phasor V = VZ@ where w = angular 
frequency with which the phasor rotates. 


9.8.1 


9.8.2 


9.8.3 
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example, in the case of a current signal, J?,.R is the average power dissipated by the 
sinusoid J cos(wt + @) in a resistor R. For the case of a sinusoidal signal the r.m.s. value 
of the signal is 1/./2 times the peak value of the signal (see Section 15.3, Example 15.4, 
for a proof of this). We will not adopt this approach but it is a common one. 

We start by examining the phasor representation of individual circuit elements. In 
order to do this we need a phasor form of Ohm’s law. This is 


V=1z (9.7) 


where V is the voltage phasor, J is the current phasor and Z is the impedance of an 
element or group of elements and may be a complex quantity. Note that phasors and 
complex numbers are mixed together in the same equation. This is a common practice 
because phasors are usually manipulated as complex numbers. 


Resistor 


Experimentally it can be shown that if an a.c. voltage is applied to a resistor then the 
current is in phase with the voltage. The ratio of the magnitude of the two waveforms is 
equal to the resistance, R. So, given J = [Z0, Z = RZO, using Equation (9.7) we have 
V = /RZO. This is illustrated in Figure 9.11. 


Inductor 


For an inductor we know from experiment that the voltage leads the current by a phase of 
m/2, and so the phase angle of the impedance is 71/2. We also know that the magnitude 
of the impedance is given by wL. So, given] = 120, Z = wL/ 7/2, using Equation (9.7) 
we have V = IJwL 7/2. An alternative way of representing Z for an inductor is to use 
the Cartesian form, that is 


; ™ m™ 
Z = ole! = wL| cos — +jsin — 
2 2 
= joL 


This is useful when phasors need to be added and subtracted. The phasor diagram for an 
inductor is illustrated in Figure 9.12. 


Capacitor 


For a capacitor it is known that the voltage lags the current by a phase of 7/2 and the 


magnitude of the impedance is given by —. So given 1=120,Z = —L —m/2, we 
@C oC 


Figure 9.11 Figure 9.12 
Phasor diagram for a resistor. Phasor diagram for an inductor. 
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Figure 9.13 


Phasor diagram for a capacitor. RLC circuit. Phasor diagram for the circuit in 
Figure 9.14. 


~ I 
have, using Equation (9.7), V = a —m/2. Alternatively, 
@ 


jn? 1 T Tm 
Z= = cos jsin 
oC wC 2 2 


areal 
aC 
Engineers often prefer to rewrite this last expression as 
1 
joC 


The phasor diagram for the capacitor is illustrated in Figure 9.13. 

We have shown how phasors can be multiplied by a complex number; division is very 
similar. Addition of phasors will now be illustrated; subtraction is similar. Consider the 
circuit shown in Figure 9.14 in which a resistor, capacitor and inductor are connected 
in series and fed by an a.c. source. As this is a series circuit, the current through each 
element, /, is the same by Kirchhoff’s current law. By Kirchhoff’s voltage law the voltage 
rise produced by the supply, V;, must equal the sum of the voltage drops across the 
elements. Therefore, 


Ve = Ver Ver Ve. 
Note that this is an addition of phasors so that the voltage drops across the elements 
do not necessarily have the same phase. The phasor diagram for the circuit is shown in 
Figure 9.15, in this case with |V,| > |V,|; / is the reference phasor. 

Note that the phasor addition of the element voltages gives the overall supply voltage 
for a particular supply current. If the magnitude of these element voltage phasors is 
known then it is possible to calculate the supply voltage graphically. In practice it is 
easier to convert the polar form of the phasors into Cartesian form and use algebra to 


analyse the circuit. 
Now, 


Figure 9.14 Figure 9.15 
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Therefore, 


Vo =VatVotV, 


ae I 
= 1R + IjoL + — 
jac 


~ 1 
=1(|R+joL + — 
( : joc ) 
it 
Therefore the impedance of the circuit is Z = R+ joL + OC We can calculate the 
a) 
frequency for which the impedance of the circuit has minimum magnitude: 
1 
Z=R+joL+ — 
joc 
J 
=R L— — 
van oC 


1 
=R+jl aL -— — 
+i(2 a) 

Now 


1 2 
Z| =./R2 EL —- — 
a i +(0 sc) 


and so, as @ varies |Z| is a minimum when 


1 
ol — —=0 
aC 
pa 
LC 
1 
o=,/— 
LC 


This minimum value is |Z| = R. Examining Figure 9.15 it is clear that the minimum 
impedance occurs when V, and V,, have the same magnitude, in which case Vx, has no 
imaginary component. The frequency at which this occurs is known as the resonant 
frequency of the circuit. 


Engineering application 9.1 


The Poynting vector 


An electromagnetic wave freely travelling in space has electric and magnetic field 
components which oscillate at right angles to each other and to the direction of prop- 
agation. This type of wave is known as a transverse wave. Figure 9.16 illustrates 
an electromagnetic wave travelling in free space. The Poynting vector is used to de- 
scribe the energy flux associated with an electromagnetic wave. It has units of W m7? 
and is a power per unit area, that is a power density. If the total power associated with 
an electromagnetic wave front is required then the Poynting vector can be integrated 
over an area of interest. 
The Poynting vector, S, for electric field, E, and magnetic field, H, is defined as 


S=ExH 
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Electric field 


vo 


Magnetic field 


Figure 9.16 
An electromagnetic wave travelling in free space. 


where E and H are vector quantities. Note that S is also a vector quantity as this 
expression is a vector product. 

It is possible to specify magnitude and phase of the field quantities using com- 
plex numbers; that is, in terms of phasors. Consider the case when E and H are the 
following: 


E = (2+ j0.5)i + Oj + 0k 

H = 0i + (3.5 + j0.25)j + 0k 
Here Cartesian coordinates have been used to define the two field quantities and iis a 
unit vector in the x direction, j is a unit vector in the y direction and k is a unit vector 
in the z direction. Examining these two terms we note that the electric field is aligned 


in the x direction and the magnetic field is aligned in the y direction. 
Calculating the Poynting vector we obtain 


i j k 
S=—E x= 1205 0 0 
0 3.541025 0 


= Oi — Oj + (2 + j0.5)(3.5 + j0.25)k = (6.875 + j2.25)k 


We see that the only non-zero component is aligned in the z direction. This is consis- 
tent with our understanding of a transverse wave in that the direction of propagation 
is at right angles to the two field components. This implies that the energy flow of 
the electromagnetic wave is in the z direction. 


F 9.9 | DE MOIVRE’S THEOREM 


A very important result in complex number theory is De Moivre’s theorem which states 
that ifn € N, 


(cos @ +jsin@)" = cosné + jsinné (9.8) 


Example 9.17 Verify De Moivre’s theorem when n = 1 andn = 2. 
Solution Whenn = 1, the theorem states: 


(cos@ +jsin@)' = cos 16 +jsin 10 


Example 9.18 


Solution 
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which is clearly true, and the theorem holds. When n = 2, we find 


(cos@ +j sin@)” = (cos @ +jsin@)(cos@ +jsin@) 
= cos’ 6 +jsin@ cos@ +jcosé@ sin +j* sin’ 6 
= cos’ @ — sin’ 6 + j(2sin@ cos @) 


Recalling the trigonometric identities cos 20 = cos? 6 — sin’ @ andsin 20 = 2 sin6 cos 6, 
we can write the previous expression as 


cos 26 + j sin 26 
Therefore, 
(cos @ +jsin@)* = cos 20 + jsin 26 


and De Moivre’s theorem has been verified when n = 2. 


The theorem also holds when n is a rational number, that is n = p/q where p and gq are 
integers. Thus we have 


(cos@ +jsin@)?/4 = cos eG + jsin Le 
q q 


In this form it can be used to obtain roots of complex numbers. For example, 


1 1 
/cos@ +jsin@ = (cos6 +jsin@)'/? = cos 5" + jsin a 


In such a case the expression obtained is only one of the possible roots. Additional roots 
can be found as illustrated in Example 9.18. 

De Moivre’s theorem is particularly important for the solution of certain types of 
equation. 


Find all complex numbers z which satisfy 
Si (9.9) 
The solution of this equation is equivalent to finding the solutions of z = 11/3; that is, 


finding the cube roots of 1. Since we are allowing z to be complex, that is z € C, we can 
write 


z=r(cosé +jsin@) 
Then, using De Moivre’s theorem, 


2 =r (cosé +jsing)> 
= r'(cos 36 +jsin 36) 
We next convert the expression on the r.h.s. of Equation (9.9) into polar form. Figure 9.17 
shows the number | = | + 0j on an Argand diagram. 


From the Argand diagram we see that its modulus is | and its argument is 0, or pos- 
sibly +271, +47, ..., that is 2n7t where n € Z. Consequently, we can write 


1 = 1 (cos 2n7 + j sin 2n7t) neZ 
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Example 9.19 


Solution 


y 
o¢—_» 
1 x 
23 
Figure 9.17 Figure 9.18 
The complex number z = 1 + 0j. Solutions of 23 = 1. 


Using the polar form, Equation (9.9) becomes 
r (cos 30 +jsin30) = I(cos 2n7 + j sin 2n7t) 
Comparing both sides of this equation we see that 
r=1 that is r=lsincereR 
and 
36 = 2nTt that is 6 = 2ntt/3 neZ 
Apparently 6 can take infinitely many values, but, as we shall see, the corresponding 
complex numbers are simply repetitions. When n = 0, we find 6 = 0, so that 
z=z, = 1(cos0+jsin0) = 1 
is the first solution. When n = | we find 6 = 271/3, so that 


nm ., 20 1 v3 
=e = Leos 1S = ae! 


is the second solution. When n = 2 we find 6 = 47t/3, so that 


\ 4m Lisi 4n 1 ,«/3 

= Z;, = l{ cos sin = 

ns a ae 2 9g 

is the third solution. If we continue searching for solutions using larger values of n we 
find that we only repeat solutions already obtained. It is often useful to plot solutions 


on an Argand diagram and this is easily done directly from the polar form, as shown in 
Figure 9.18. We note that the solutions are equally spaced at angles of 27/3. 


Find the complex numbers z which satisfy z? = 4j. 


Since z € C we write z = r(cos 6 + jsin@). Therefore, 


Zz =r (cosé +jsind) 
= r’(cos 26 + jsin 26) 
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Figure 9.19 
Solution of z? = 4j. 


by De Moivre’s theorem. Furthermore, 4) has modulus 4 and argument 7t/2 + 2n7, 
n € Z, that is 


4j = 4{cos(7t/2 + 2n7t) + j sin(7/2 + 2n7)} neZ 
Therefore, 

r’ (cos 20 + jsin20) = 4{cos(7t/2 + 2n7t) + j sin(7t/2 + 2n7t)} 
Comparing both sides of this equation, we see that 

r=4 and so r=2 
and 

260 = 1/2 + 2nt and so 6=7/4+n7 


When n = 0 we find @ = 7/4, and whenn = | we find 6 = 57t/4. Using larger values of 
n simply repeats solutions already obtained. These solutions are shown in Figure 9.19. 
We note that in this example the solutions are equally spaced at intervals of 27t/2 = 7. 
In Cartesian form, 

2 2 2 2 


i= geo and Zy = 7 ia J2(1 +3) 


In general, the n roots of z” = a + jb are equally spaced at angles 27t/n. 

Once the technique for solving equations like those in Examples 9.18 and 9.19 has 
been mastered, engineers find it simpler to work with the abbreviated form r/6. Exam- 
ple 9.19 reworked in this fashion becomes 


Letz=r0, thn 2=r126 
Furthermore, 4) = 4/ 71/2 + 2n7, and hence if C= 4j, we have 
5 ™ 
r2£206=42 5 
from which 
Tl 


r=4 and 20 = > + 2nn 


as before. Rework Example 9.18 for yourself using this approach. 
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Example 9.20 


Solution 


Engineering application 9.2 


Characteristic impedance 


In a later application we shall explain the concept of the characteristic impedance 
of an electrical transmission line in more detail. For now, we can assume that it is an 
important electrical parameter represented by the equation 


7. [Rtiok 
° V G+joC 


where R, L, G and C are transmission line parameters and w is the angular frequency, 
all of which are real numbers. If R and G are zero, as they would be for an ideal 
transmission line, then Z, is wholly real because: 


Z= eee 
Vio = ee 


Alternatively, if G or R is included then the equation involves taking the square root 
of a complex number. 
Given values of R, G, L, C and w we can write 

R + joL 

G+joC 
in Cartesian form (see Example 9.9) and hence in polar form (see Section 9.5). 
Application of De Moivre’s Theorem will then allow us to calculate the required 
square roots to find Zp . 


Another application of De Moivre’s theorem is the derivation of trigonometric 
identities. 


Use De Moivre’s theorem to show that 
cos 30 = 4cos* @ — 3cos@ 
and 


sin30 = 3sind —4sin°6 


We know that 

(cos @ + jsin@)*> = cos 36 +jsin30 
Expanding the I.h.s. we find 

cos? 6 + 3j cos” 6 sind — 3cos@ sin? 6 —j sin’? 6 = cos 30 + jsin30 
Equating the real parts gives 

cos? 6 — 3cos@ sin? 6 = cos 30 (9.10) 
and equating the imaginary parts gives 


3 cos’ 6 sind — sin? @ = sin30 (9.11) 


Example 9.21 


Solution 
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Now, writing sin? @ = 1 — cos? @ in Equation (9.10) and cos? @ = 1 — sin? @ in Equa- 
tion (9.11), we find 


cos 36 = cos? @ — 3cos6(1 — cos” 6) 
= cos’ 6 + 3cos* @ — 3cos6 
= 4cos* @ — 3cos6 

sin 36 = 3(1 — sin’ 4) sind — sin’ @ 
= 3sin6 — 4sin° 6 


as required. 


This technique allows trigonometric functions of multiples of angles to be expressed 
in terms of powers. Sometimes we want to carry out the reverse process and express a 
power in terms of multiple angles. Consider Example 9.21. 


If z= cos@ +jsin@ show that 


1 1 
z+—-=2cosé Z—— =2jsind 
z z 
ste . 1 1 
and find similar expressions for z” + — and z” — —. 
zr zr 


Consider the complex number 
Z=cosé +jsind 
Using De Moivre’s theorem, 
-~=z! = (cosd +j sin@)~! = cos(—0) +jsin(—0) 
But cos(—@) = cos @ and sin(—@) = — sin 9, so that if z = cos@ + jsin0@ 
1 
— =cosé —jsin@ 
z 
Consequently, 
1 1 , 
z+—-=2cosé and Z—-— =2jsind 
z z 
Moreover, 
z" = cosnd + jsinnd and z" =cosné —jsinné 
so that 


1 1 
z' + — =2cosnd and =z" — — = 2jsinn0 
zt z 


| 1 
z" + — = 2cosnd and z' — — = 2jsinnd 
a ge 
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1 
Example 9.22 Show that cos? @ = 3 (608 20 + 1). 


Solution 


The formulae obtained in Example 9.21 allow us to obtain expressions for powers of 


cos 6 and sin@. Since 2cos 6 = z+ —, squaring both sides we have 
z 


iy i) 
Posrti=l(zts) =e e274 — 
z 2 


>, 1 
= eee) +2 


1 
But 2 + a= 2 cos 26, so 


2? cos? @ = 2cos 20 +2 


and therefore, 


> 1 1 
cos’ 0 = 5 eer 


as required. 


EXERCISES 9.9 


1 Express (cos 6 +jsin 0)? and (cos 0 +jsin@) 
the form cos n@ + jsinné. 


2 Use De Moivre’s theorem to simplify 


(a) (cos 36 +jsin3@)(cos 46 + j sin 46) 
cos 86 + j sin 80 


(0) ean ee A 
cos 20 — jsin 20 
3. Solve the equations 


(a) 2+1=0 
(b) A =14j 


4 Find ./2 + 2j and display your solutions on an 
Argand diagram. 


5 Prove the following trigonometric identities: 


(a) cos49 = 8 cos* @ — 8cos?6 + 1 
Solutions 


1 cos9@ +jsin 96, cos(@/2) + jsin(6/2) 


2 (a) cos 76 +jsin 70 
(b) cos 106 +j sin 100 


2 


1 
Pi aa +1) 


1/2 in 


11 
12 


(b) 32sin°@ = 10 — 15cos 20 + 6 cos 40 — cos 60 


Solve the equation +25 =0. 


Find the fifth roots of j and depict your solutions on 
an Argand diagram. 


1 
Show that cos? 9 = 7 (cos 30 + 3cos@). 


1 
Show that sin* 6 = g (00S 40 — 4cos 26 + 3). 


Express cos 56 in terms of powers of cos 0. 
Express sin 5@ in terms of powers of sin 6. 


Given e/? = cos @ +jsin 9, prove De Moivre’s 
theorem in the form 


(e!?)" = cos no + jsinné 


(a) 1Zn/3+2nn/3 n=0,1,2 
(b) 2'/8/ n/164+nn/2  n=0,1,2,3 
8'/6/ m/124+2nn/3 n=0,1,2 


6 V5Ln/4+nn/2 
7 1Z£1/10 + 2nn/5 
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n=0,1,2,3 10 16cos° 6 — 20cos?@ + 5cos6 


n=0,1,2,3,4 11 16sin° 6 — 20sin? 6 + 5sin@ 


LOCI AND REGIONS OF THE COMPLEX PLANE 


Regions of the complex plane can often be conveniently described by means of complex 
numbers. For example, the points that lie on a circle of radius 2 centred at the origin 
(Figure 9.20) represent complex numbers all of which have a modulus of 2. The argu- 
ments are any value of 9, —7t < 6 < 7. We can describe all the points on this circle by 
the simple expression 
Iz] =2 

that is, all complex numbers with modulus 2. We say that the locus (or path) of the point 
zis a circle, radius 2, centred at the origin. The interior of the circle is described by 
|z| < 2 while its exterior is described by |z| > 2. 


Similarly all points lying in the first quadrant (shaded in Figure 9.21) have arguments 
between 0 and 7t/2. This quadrant is therefore described by the expression: 


0 < arg(z) < 7/2 


y 
x 
Figure 9.20 Figure 9.21 Figure 9.22 
A circle, radius 2, centred at the First quadrant of the x—y plane. Locus of points satisfying 
origin. arg(z) = 7/4. 
Example 9.23 Sketch the locus of the point satisfying arg(z) = 71/4. 
Solution The set of points with arg(z) = 2/4 comprises complex numbers whose argument is 
7/4. All these complex numbers lie on the line shown in Figure 9.22. 
Example 9.24 Sketch the locus of the point satisfying |z — 2| = 3. 
Solution — First mark the fixed point 2 on the Argand diagram labelling it ‘A’ (Figure 9.23). Con- 


sider the complex number z represented by the point P. From the vector triangle law of 
addition 

OA + AP = OP 

AP = OP — OA 
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Example 9.25 


Solution 


z P 
0 1 2. x 
Figure 9.23 Figure 9.24 
Points z and 2 + 0j. Locus of points satisfying |z — 2| = 3. 


Recall from Section 9.6 that the graphical representation of the sum and difference of 
vectors in the plane, and the sum and difference of complex numbers, are equivalent. 
Since vector OP represents the complex number z, and vector OA represents the complex 
number 2, AP = OP — OA will represent z — 2. Therefore |z — 2| represents the distance 
between A and P. We are given that |z— 2| = 3, which therefore means that P can be any 
point such that its distance from A is 3. This means that P can be any point on a circle 
of radius 3 centred at A(2, 0). The locus is shown in Figure 9.24. |z — 2| < 3 represents 
the interior of the circle while |z — 2| > 3 represents the exterior. Alternatively we can 
obtain the same result algebraically: given |z — 2| = 3 and also that z = x + jy, we can 
write 


Iz—2| = |@—2)+jy| =3 
that is 
V@=2) yy =3 


or 
(@—2)’'+y' =9 
Generally, the equation (x — a)? + (y—b)? = r’ represents a circle of radius r centred at 


(a, b), so we see that (x — 2)? + y = 9 represents a circle of radius 3 centred at (2, 0), 
as before. 


Use the algebraic approach to find the locus of the point z which satisfies 


| ee il 
Zz = 5 -J 


If z =x + Jy, then we have 
. 1 : 
l(x— 1) +jy| = a +j0 - 1)| 
Therefore, 


1 
@@-1)P+y = rae + (y-1)’} 
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and so 
A(x — 1)? +4y? =x? + (y— 1)? 
that is 
3x7 — 8x + 3y° + 2y +3 =0 
By completing the square this may be written in the form 
(-YoOs)-s 
3 3 3 


that is 


(Ges 


: : : 4 1 
which represents a circle of radius /8/3 centred at @ a 3) 


“1 Sketch the loci defined by (d) O< arg(z) < 1/2 

(a) arg(z) = 0 eee 

(b) arg(z) = 7/2 (f) |z+j| >3 

(c) arg(zg—4) = 1/4 (g) |Ie-1] < |z-2| 

eh Pele = 2 ee If s = 0 + jw sketch the regions defined by 
| Sketch the regions defined by 

(a) Re(z) >0 Qe 

(b) Im) <3 weed 

(c) |z| >3 (c) —-2<@m@<2 


“a See Figure S.19. 


arg(z—4) = 7/4 
2lzl = Iz— 11 
arg(z) = 0 arg(z) = 77/2 y 
eo 
(a) (b) (c) (d) 


Figure S.19 


wile 
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2. See Figure S.20. 


y 


Im(z) < 3 Wax en 0 < arg(z)< 7/2 


7 


Y, 


Y a “ V7, Iz+jl > 3 Y 
x 7 \|\. , 
J was GN) 
(e) (f) 


Yj, (g) 


Iz-1l < lz -2I 


y 


Figure S.20 


3 See Figure S.21. 


P @ 
Y 
< 2<w<2 
o o 
o 
Vi 
(a) (b) 
Figure S.21 
REVIEW EXERCISES 9 
1. Show that ee en =cosé +jsind. 5 Express in the form a + bj: 
cos @ — j sind ( 
+ i 96 
2 Express in Cartesian form (a) (cos@ + jsin@) (b) (cos@ + jsind) 
5+4j 1 cos @ + jsin@ 
@ (b) —— c _ 
5-4 24+ 3j cos@ +jsing 
(c) : ae : - (d) : - 6 IfzeC, show that 
243) 2-3j x—-jy 2 = ; 
(a) z+Z= 2Re(z) (b) z—Z= 2j Im(z) 
3 


Find the modulus and argument of —j, —3, 1 +j, () z= (zl 
cos@ +jsing. 


i 


; . 7 Show that oe + eJ©? = 2 cos wt and find an 
Mark on an Argand diagram vectors corresponding to : iwt a 
: : : expression for eo’ — eI, 
the following complex numbers: —3 + 2j, —3 — 2), 


cos 7 + j sin 7. 8 Express 1 + e7/@ in the form a + dj. 


9 Sketch the region in the complex plane described by 
|Z +2j| < 1. 


10 Express e!/?-%) in the form a + bj. 


11 Solve the equation z* + 1 = jv3. 


Solutions 
9 40. 3 . 4 
2 (a) at a (b) a 3B! (c) B 


oa 
Pry 2+! 

me | jl =1,arg(—)) = —7/2; |—3] = 3, arg(—3) = 2; 
jl+jl= “2: arg(1+j) = 7/4, |cos@ +jsind| = 1, 
arg(cos@ +jsind) = 0 

5 (a) cos 660 +jsin60 (b) cos 36 — jsin 30 
(c) cos(@ — @) +jsin(@ — d) 


12 


13 


11 
12 


wo 
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Express s? + 65 + 13 in the form (s — a)(s — b) 
where a, b € C. 


Express 2s? + 85 + 11 in the form 2(s — a)(s — b) 
where a, be C. 


eo — eI" = 2j sin wt 

1 + cos 2wt + j sin 2wt 

1.5831 + 0.4607) 

247 n/6+nn/2  n=0,1,2,3 


[s — (—3 + 2j)][s — (—3 — 2j)] 


tera) 
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INTRODUCTION 


Differentiation is a mathematical technique for analysing the way in which functions 
change. In particular, it determines how rapidly a function is changing at any specific 
point. As the function in question may represent the magnetic field of a motor, the voltage 
across a capacitor, the temperature of a chemical mix, etc., it is often important to know 
how quickly these quantities change. For example, if the voltage on an electrical supply 
network is falling rapidly because of a short circuit, then it is necessary to detect this 
in order to switch out that part of the network where the fault has occurred. However, 
the system should not take action for normal voltage fluctuations and so it is important 
to distinguish different types and rates of change. Another example would be detecting 
a sudden rise in the pressure of a fermentation vessel and taking appropriate action to 
stabilize the pressure. 

Differentiation will be introduced in this chapter. We shall derive a formula which can 
be used to find the rate of change of a function. To avoid always having to resort to the 
formula engineers often use a table of derivatives; such a table is given in Section 10.7. 
The chapter closes with a discussion of an important property of differentiation — that 
of linearity. 


10.2.1 
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y(t) 


Figure 10.1 
The function y(t) has different rates of 
123 4 5 6 7 8 9 10 11 12 ft change over different regions of f. 


GRAPHICAL APPROACH TO DIFFERENTIATION 


Differentiation is concerned with the rate at which a function is changing, rather than 
the actual change itself. We can explore the rate of change of a function by examining 
Figure 10.1. There are several regions to this curve corresponding to different intervals 
of t. In the interval [0, 5] the function does not change at all. The rate of change of y 
is zero. From t = 5 tot = 7 the function increases slightly. Thus the rate of change of 
y as ¢ increases is small. Since y is increasing, the rate of change of y is positive. From 
t = 7 tot = 8 there is a rapid rise in the value of the function. The rate of change of y 
is large and positive. From t = 8 to t = 9 the value of y decreases very rapidly. The rate 
of change of y is large and negative. Finally from t = 9 tot = 12 the function decreases 
slightly. Thus the rate of change of y is small and negative. 

The aim of differential calculus is to specify the rate of change of a function pre- 
cisely. It is not sufficient to say ‘the rate of change of a function is large’. We require an 
exact value or expression for the rate of change. Before being able to do this we need to 
introduce two concepts concerning the rate of change of a function. 


Average rate of change of a function across an interval 


Consider Figure 10.2. When ¢ = ¢,, the function has a value y(t, ). This is denoted by A 
on the curve. When f = f,, the function has a value of y(t,). This point is denoted by B 
on the curve. The function changes by an amount y(¢,) — y(¢,) over the interval [f,, t,]. 
Thus the average rate of change of the function over the interval is 

changeiny _ y(t,) — y(t) 


change in t t,—t, 


The straight line joining A and B is known as a chord. Graphically, y(¢,) — y(¢,) is the 
vertical distance and t, — f, the horizontal distance between A and B, so that the gradient 
of the chord AB is given by 


BC _ y) — y(t) 
AC t—t 


The gradient or slope of a line is a measure of its steepness and lines may have positive, 
negative or zero gradients as shown in Figure 10.3. 

Thus the gradient of the chord AB corresponds to the average rate of change of the 
function between A and B. To summarize: 


The average rate of change of a function between two points A and B is the gradient 
of the chord AB. 
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10.2.2 


t 
¥@ Positive 


y(b) gradient 
p)i |Sasetesesecssscegecs 


Zero 
gradient 


y(t) -y () 


y(t) | -- 22a --L-------- "iC 
+—— (t,-t) =" 


Negative 


1 a gradient 
ty ty t t 
Figure 10.2 Figure 10.3 
Average rate of change across an interval. Lines can have different gradients. 
Chord AB, 


yOu B 


Chord AB, 


Tangent at A 


Figure 10.4 
Sr Point B is moved nearer to A to improve 
t accuracy. 


Rate of change of a function at a point 


Consider again Figure 10.2. Suppose we require the rate of change of the function at 
point A. We can use the gradient of the chord AB as an approximation to this value. If B 
is close to A then the approximation is better than if B is not so close to A. Therefore by 
moving B nearer to A it is possible to improve the accuracy of this approximation (see 
Figure 10.4). 

Suppose the chord AB is extended as a straight line on both sides of AB, and B is 
moved closer and closer to A until both points eventually coincide. The straight line 
becomes a tangent to the curve at A. This is the straight line that just touches the curve 
at A. However, the rate of change of this tangent, that is its gradient, still corresponds to 
the rate of change of the function, but now it is the rate of change of the function at the 
point A. To summarize: 


The rate of change of a function at a point A on the curve is the gradient of the 
tangent to the curve at point A. 


We have still to address the question of how the gradients of chords and tangents are 
found. This requires a knowledge of limits which is the topic of the next section. 


LIMITS AND CONTINUITY 


The concept of a limit is crucial to the development of differentiation. We write 
t — c to denote that t approaches, or tends to, the value of c. Note carefully that this is 
distinct from stating t = c. As t tends to c we consider the value to which the function 
approaches and call this value the limit of the function as t > c. 


Example 10.1 


Solution 


Example 10.2 


Solution 
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Figure 10.6 
Figure 10.5 As x > 0, y > 1, even though 
The curve f(t) = 1? + 2r —3. y(0) = 3. 


If t — 2, what value does 
ft) = +2t -—3 
approach? 
Figure 10.5 shows a graph of f(t). Clearly, whether t = 2 is approached from the I.h.s. 
or the r.h.s. the function tends to 5. That is, if t — 2, then f(t) — 5. We note that this is 


the value of f (2). Informally we are saying that as ¢ gets nearer and nearer to the value 2, 
so f(t) gets nearer and nearer to 5. This is usually written as 


lim(r +2t—3)=5 
to 


where ‘lim’ is an abbreviation of limit. In this example, the limit of f(t) ast — 2 is 
simply f (2), but this is not true for all functions. 


Figure 10.6 illustrates y(x) defined by 


l-x x<0O 
y(x) = 4 3 x=0 
x+1 x>0 
Evaluate: 
(a) lim,.,; y (b) lim, y (c) lim, y 


We note that this function is piecewise continuous. It has a discontinuity at x = 0. 


(a) We seek the limit of y as x approaches 3. As x approaches 3, we will be on that part 
of the function defined by x > 0, that is yx) = x+ 1. Asx — 3, theny > 3+ 1, 
that is y > 4. So 

limy =4 


x3 


(b) When x approaches —1, we will be on that part of the function defined by x < 0, 
that is yx) = 1 — x. Soas x — —1, then y > 1| — (—1), that is y > 2. Hence 


lim y =2 


(c) As x approaches 0 what value does y approach? Note that we are not evaluating y(0) 
which actually has a value of 3. We simply ask the question ‘What value is y near 
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when x is near, but distinct from, 0?’ From Figure 10.6 we see y is near to 1, that is 


limy = 1 


x0 


Example 10.3 The function y(x) is defined by 


Solution 


0 x <0 
y(x) = 4x 0 <x <2 
x—-2 x>2 
(a) Sketch the function. 
(b) State the limit of y as x approaches (i) 3, (ii) 2, (iii) 0. 


(a) The function is shown in Figure 10.7. Note that the figure has three parts; each part 
corresponds to a part in the algebraic definition. 


(b) () Asx — 3, the relevant part of the function is y(x) = x — 2. Hence 
limy = 1 
(ii) Suppose x < 2 and gradually increases, approaching the value 2. Then, from 
the graph, we see that y approaches 2. Now, suppose x > 2 and gradually 
decreases, tending to 2. In this case y approaches 0. Hence, we cannot find the 
limit of y as x tends to 2. The lim,_,, y does not exist. 


(iii) As x tends to 0, y tends to 0. This is true whether x approaches 0 from below, 
that is from the left, or from above, that is from the right. So, 


limy = 0 


' Figure 10.7 
: The function y has different limits as x > 2 
2 x from the left and the right. 


It is appropriate at this stage to introduce the concept of left-hand and right-hand limits. 
Referring to Example 10.3, we see that as x approaches 2 from the left, that is from below, 
then y approaches 2. We say that the left-hand limit of y as x tends to 2 is 2. This is 
written as 

li =2 

wr? 
Similarly, the right-hand limit of y is obtained by letting x tend to 2 from above. In this 
case, y approaches 0. This is written as 

Lig y=0 
Consider a point at which the left-hand and right-hand limits are equal. At such a point 
we say ‘the limit exists at that point’. 
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EXERCISES 10.3 
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The limit of a function, at a point x = a, exists only if the left-hand and right-hand 
limits are equal there. 


Continuous and discontinuous functions 


A function f is continuous at the point where x = a, if 
lim f = f(a) 
xa 


that is, the limit value matches the function value at a point of continuity. A function 
which is not continuous is discontinuous. In Example 10.3, the function is continuous 
at x = 0 because 


limy =0 = f©) 


but discontinuous at x = 2 because lim,_,, y does not exist. In Example 10.2, the function 
is discontinuous at x = O because lim, ,) y = | but y(O) = 3. The concept of continuity 


corresponds to our natural understanding of a break in the graph of the function, as 
discussed in Chapter 2. 


A function f is continuous at a point x = a if and only if 
iat di SI) 


that is, the limit of f exists at x = a and is equal to f(a). 


1. The function, f(t), is defined by 2 The function g(t) is defined by 
1 0<t<2 0 t<0 
f@=42 2<t <3 = P vcr 3 
3 t>3 BES VSS: Vep et 


Sketch a graph of f(t) and state the following limits if 


they exist: 


(a) lim,_,,5f 
(b) lim,_,>. f 
(c) lim,_,3 f 
(d) lim,_,o f 
(e) lim,_,3- f 


Solutions 


1 (a) 1 (b) 2 
(d) 1 (e) 2 


12 t>4 
(a) Sketch g. 
(b) State any points of discontinuity. 
(c) Find, if they exist, 
(i) lim,_,; ¢ 
(ii) lim,_, 4g 


(iii) lim,_, 4-8 


(c) not defined 2 (b) t=4 
(c) (i) 9 (ii) not defined (iii) 11 
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RATE OF CHANGE AT A SPECIFIC POINT 


We saw in Section 10.2 that the rate of change of a function at a point is the gradient of 
the tangent to the curve at that point. Also, we can think of a tangent at A as the limit of 
an extended chord AB as B — A. We now put these two ideas together to find the rate 
of change of a function at a point. 


Example 10.4 Given y = f(x) = 3x’ + 2, obtain estimates of the rate of change of y at x = 3 by 
considering the intervals 


(a) [3, 4] (b) [3, 3.1] (c) [3, 3.01] 


Solution (a) Consider Figure 10.8. 


y(3) = 3(3)9? +2 = 29 
y(4) = 3(4)? +2 =50 


Let A be the point (3, 29) on the curve. Let B be the point (4, 50). Then 


: change in y 
average rate of change over the interval [3, 4] = —————— 
change in x 


_ (4) -y@) 
As 

_ 202 a 
4—3 


This is the gradient of the chord AB and is an estimate of the gradient of the tangent 
at A. That is, the rate of change at A is approximately 21. 


(b) y(3.1) = 30.83 and so, 


30.83 — 29 
average rate of change over the interval [3, 3.1] = a. a 18.3 
This is a more accurate estimate of the rate of change at A. 


(c) y(3.01) = 29.1803 and so, 


29.1803 — 29 
3.01 — 3 
= 18.03 


average rate of change over the interval [3, 3.01] = 


» Figure 10.8 
x The function: y = 3x” + 2. 


Example 10.5 


Solution 
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This is an even better estimate of the rate of change at A. Hence at A, if x increases 
by | unit then y increases by approximately 18 units. This corresponds to a steep 
upward slope at A. 


Example 10.4 illustrates the approach of estimating the rate of change at a point by using 
the ‘shrinking interval’ method. By taking smaller and smaller intervals, better and better 
estimates of the rate of change of the function at x = 3 can be obtained. However, we 
eventually want the interval to ‘shrink’ to the point x = 3. We introduce a small change 
or increment of x denoted by 6x and consider the interval [3, 3 + 6x]. By letting 6x tend 
to zero, the interval [3, 3 + 46x] effectively shrinks to the point x = 3. 


Find the rate of change of y = 3x” + 2 at x = 3 by considering the interval [3, 3 + 5x] 


and letting 6x tend to 0. 


When x = 3, y(3) = 29. When x = 3 + 6x then 


y(3 + dx) = 3(3 + dx)? +2 
= 3(9 + 65x + (6x)*) +2 
= 3(dx)? + 185x + 29 


So, 
__ change in y 


average rate of change of y across [3, 3 + dx] = h : 
change in x 


(3(6x)* + 185x + 29) — 29 
~ bx 
3(6x)? + 185x 
= Ox 
6x(3dx + 18) 
~ bx 
= 36x+ 18 


We now let 6x tend to 0, so that the interval shrinks to a point: 


rate of change of y when x is 3 = jim (38x + 18) = 18 


We have found the rate of change of y at a particular value of x, rather than across an 
interval. We usually write 


lim 


rate of change of y when x is 3 fim, 


y(3 + 6x) — y(3) 
Ox 


. jp 
lim | ———— 


ro ) = lim Gdx + 18) 


6x0 


18 
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EXERCISES 10.4 


7) Find the rate of change of y = 3 2 at same answer results regardless of which interval is 
used. 
(a) x = 4 by considering the interval [4, 4 + 6x] 4& Find the rate of change of y(x) = 2 — x at 


(b) x = —2 by considering the interval [—2, —2 + dx] (a) x = 3, by considering the interval [3, 3 + dx] 


(c) x = 1 by considering the interval [1 — 5x, 1 + dx] (b) x = —5, by considering the interval 


; [—5, —5 + 6x] 
2. Find the rate of change of y = 1/x atx = 2. 
(c) x = 1, by considering the interval 

3 To determine the rate of change of y = x* — x {1 — dx, 1 + 6x]. 

at x = | the interval [1, 1 + 6x] could be used. ; _ x _ 

Baually the intervals (1.3%, Vor 5 Find the rate of change of y(x) = ee at x = 3 by 

[1 — 6x, | + 6x] could be used. Show that the considering the interval [3, 3 + dx]. 

Solutions 
1 (a) 24 (b) —12 (c) 6 4 (a) -6 (b) 10 (c) -—2 
1 

2 —0.25 5 +b 
oy 1 


| 10.5 | RATE OF CHANGE AT A GENERAL POINT 


Example 10.5 shows that the rate of change of a function at a particular point can be 
found. We will now develop a general terminology for the method. Suppose we have a 
function of x, y(x). We wish to find the rate of change of y at a general value of x. We 
begin by finding the average rate of change of y(x) across an interval and then allow the 
interval to shrink to a single point. Consider the interval [x, x + dx]. At the beginning of 
the interval y has a value of y(x). At the end of the interval y has a value of y(x + 4x) so 
that the change in y is y(x + dx) — y(x), which we denote by dy (see Figure 10.9). So, 


change in y 
average rate of change of y = —————— 
change in x 

_ ya+dx)— yx) _ dy 

ox bx 


Now let 5x tend to 0, so that the interval shrinks to a point. Then 


bx 0 bx 


bx) — 6 
rate of change of y = lim (Soe) = rim (=) 
> A 


To see how we proceed to evaluate this limit consider Example 10.6. 
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y(x + 6x) 
dy = y(x + 6x) — yx) 
y) 


Figure 10.9 
The rate of change of y at a point is found by letting 6x > 0. 


Example 10.6 Find the rate of change of y(x) = 2x” +3x. Calculate the rate of change of y when x = 2 
and when x = —3. 


Solution Given y(x) = 2x? + 3x 


then 
y(x + 6x) = 2(x + dx)? + 3(x + dx) 
= 2x? + Axdx + 2(6x)* + 3x + 35x 
Hence 
y(x + 5x) — y(x) = 2(6x)? + 4xdx + 34x 
So, 
8x) — 
rate of change of y = lim (| 
ix ee 
ja + Axdx + ws) 
= lim 
dx—>0 bx 


jim (26x + 4x+3) =4x+3 


When x = 2, the rate of change of y is 4(2) + 3 = 11. When x = —3, the rate of change 
of y is 4(—3) +3 = —9. A positive rate of change shows that the function is increasing 
at that particular point. A negative rate of change shows that the function is decreasing 
at that particular point. 


5 d 
The rate of change of y is called the derivative of y. We denote jim (2) by 2 This 
x Xx 


is pronounced ‘dee y by dee x’. 


da. 
rate of change of y(x) = fe jim 


(= + 6x) — =) 
bx 


__ dy _ Oy 
Note that the notation — means lim —. 
dx 6x0 bX 
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Example 10.7 


Solution 


Using the previous example, if 
y(x) = yf Bg 
then 


dy 
2 ede 
a. 


d 
- is often abbreviated to y’ 


y is pronounced ‘y dash’ or ‘y prime’. To stress that y is the dependent variable and x 
the independent variable we often talk of ‘the rate of change of y with respect to x’, or, 
more compactly, ‘the rate of change of y w.r.t. x’. The process of finding y’ from y is 
called differentiation. This shrinking interval method of finding the derivative is called 


differentiation from first principles. We know that the derivative + is the gradient 


of the tangent to the function at a point. It is also the rate of change of the function. In 
many examples, the independent variable is ¢ and we need to find the rate of change of y 


d 
with respect to ¢; that is, find - This is also often written as y’ although y, pronounced 


‘y dot’, is also common. The reader should be aware of both notations. Finally, y’ is used 


dy d 
to denote the derivative of y whatever the independent variable may be. So - 7 and 
z dr 


d 
= could all be represented by y’. 
w 


Find the gradient of the tangent to y = x? at A(1, 1), B(—1, 1) and C(2, 4). 


We have y = x” and so 


y(x + 6x) = (x + 6x)? 
= x* + 2xdx + (5x)? 


and hence 


y(x + 6x) — y(x) = x? + 2xdx + (dx)? — x° 
= 2xdx + (5x)? 


Then 


* = gradient of a tangent to curve 


_ (yt 6x) — yx) 
= lim (| ————_ 
bx>0 bx 
. (x + bx)? — x? : 2xdx + (5x)? 
= lim | ————— _] = lim | ———_——_ 
dx>0 bx bx>0 bx 


= jim (2x + 6x) = 2x 


Example 10.8 


Solution 
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d d 
At (1, 1), ; = 2 = gradient of tangent at A. At (—1, 1), * = —2 = gradient of 


d 
tangent at B. At (2, 4), ;. = 4 = gradient of tangent at C. 


d 
Suppose we wish to evaluate the derivative, os at a specific value of x, say x). This is 
denoted by 


dy dy / 
re (x = Xp) or more compactly by a (Xo) or y (X%) 


An alternative notation is 


dy dy 
dx} _. dx 


0) 


So, for Example 10.7 we could have written 


=4 y(2)=4 


Wy =2 ee) 
SU) = @=-1)= 


Refer to Figure 10.10. By considering the gradient of the tangent at the points A, B, C, 


D and E state whether = is positive, negative or zero at these points. 


d 
At A and C the tangent is parallel to the x axis and so 7 is zero. At B and E the tangent 


d 
has a positive gradient and so *. is positive. At D the tangent has a negative gradient 


dy 
and thus — is negative. 
Ap g 


Figure 10.10 
C Graph for Example 10.8. 


As we saw in Chapter 3, functions are used to represent physically important quantities 
such as voltage and current. When the current through certain devices changes, this can 
give rise to voltages, the magnitudes of which are proportional to the rate of change of 
the current. Consequently differentiation is needed to model these effects as illustrated 
in Engineering applications 10.1 and 10.2 
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Engineering application 10.1 


Voltage across an inductor 


The voltage, v, across an inductor with inductance, L, is related to the current, i, 


through the inductor by 
di 
a 
dt 


Figure 10.11 shows the relationship between magnetic flux lines passing through the 
coils of an inductor and the current flowing through the inductor. 


Figure 10.11 

Schematic diagram of an 
inductor showing the 
relationship between magnetic 
flux lines and current. 


This relationship is a quantification of Faraday’s law which states that the voltage 
induced in a coil is proportional to the rate of change of magnetic flux through it. If 
the current in a coil is changing then this corresponds to a change in the magnetic 
flux through the coil. Note that if a is large then v is large. This is why care has 
to be taken when abruptly switching off the current to an inductor because it causes 
high voltages to be generated. 


Engineering application 10.2 


Current through a capacitor 


The current, 7, through a capacitor with capacitance, C, is related to the voltage, v, 
across the capacitor by 


; du 
p= C— 
dt 
Displacement current ip 
= 
= 
Conduction current 7 i 
Figure 10.12 
= Schematic diagram of a capacitor 
showing the conduction current and 
> the displacement current. 


It may appear confusing to talk of a current flow through a capacitor as no actual 
charge flows through the capacitor apart from that caused by any leakage current. 
Instead there is a build-up of charge on the plates of the capacitor. This in turn gives 
rise to a voltage across the capacitor. If the current flow is large then the rate of change 
of this voltage will be large. The current flow through the capacitor wires is termed a 


1 


Example 10.9 


Solution 


EXERCISES 10.5 
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conduction current while that between the capacitor plates is called a displacement 
current (see Figure 10.12). 

The displacement current can be thought of as a virtual current that flows between 
the plates due to the build-up of positive charge on one plate and negative charge 
on the other plate. It doesn’t correspond to a real current flowing between the plates 
as they are separated by an insulating material. 


We can relate the small changes 5x and dy to the derivative oa 
If 5x is very small yet still finite we can state that 
dy _ dy 
dx dx 
This result allows an important approximation to be made. From Equation (10.1) we see 
that if a small change, 6x, is made to the independent variable, the corresponding change 
in the dependent variable is given by the following formula: 


(10.1) 


oy  —3 
errs 


If y = x° estimate the change in y caused by changing x from 3 to 3.1. 


d 
If y = x° then = 2x. The approximate change in the dependent variable is given by 


d 
by & S 5x = 2xdx 
dx 
Taking x = 3 and dx = 0.1 we have 
dy © (2)(3)(0.1) = 0.6 


We conclude that at the point where x = 3 a change in x to 3.1 causes an approximate 
change of 0.6 in the value of y. 


Calculate the gradient of the functions at the specified 5 Differentiate y = 2x2 + 9: that is, find dy 
: ; , re 


points. 
(a) y = 2x? at (1, 2) 


What is the rate of change of y when x = 3, —2, 1, 0? 


(b) y = 2x —x2 at (0, 0) 6 Find the rate of change of y = 4t — t?. What is the 


d 
(c) y= 1+x4.x7 at (2,7) value of = when t = 2? 
(d) y = 2x? + 1 at (2,9) 


7 Ify =x — 3x2 +x then 


A function, y, is such that ° is constant. a : 

What can you say about the function, y? a 3x" — 6x + 1 

For which graphs in Figure 10.13 is the derivative Estimate the change in y as x changes from 
always (a) positive or (b) negative? (a) 2 to 2.05 

Find the derivative of y(x) where y is (b) 0 to 0.025 


(a) x2 (b) —x? 42x (c) —1 to —1.05 
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y y y od 
x 
x 
x Xx 
(i) Gai) (iii) (iv) 
Figure 10.13 
Solutions 
Ta@4 62 @5 @s8 4 (a) 2x = (b) —2x +2 
2. yis linear in x, that is y= ax+b 5 4x, 12,—-8,4,0 
3 (i) _ always negative (ii) always positive 6 4-210 
(iii) always positive (iv) always negative 7 (a) 0.05 (b) 0.025 (c) —0.5 


EXISTENCE OF DERIVATIVES 


d 

So far we have seen that the derivative, = of a function, y(x), may be viewed either 
algebraically or geometrically. 

dy i y(x + 6x) — y(x) 

— = hm — 

dx dx 0 bx 
SY ison f 
— = rate of change o 
Ae g y 


= gradient of the graph of y 


; ; d ; aye 
We now discuss briefly the existence of se For some functions the derivative does not 


exist at certain points and we need to be able to recognize such points. Consider the 
graphs shown in Figure 10.14. Figure 10.14(a) shows a function with a discontinuity at 


(b) 


Figure 10.14 
(a) The graph has a discontinuity at x = a. (b) The graph has a cusp at 


X=da. 
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x = a. The function shown in Figure 10.14(b) is continuous but has a cusp or corner 


aed dy 
at x = a. In both cases it is impossible to draw a tangent at x = a, and so an does not 


exist at x = a. It is impossible to draw a tangent to a curve at a point where the curve is 
not smooth. Note from Figure 10.14(b) that continuity is not sufficient to guarantee the 
existence of a derivative. 


Example 10.10 Sketch the following functions. State the values of t for which the derivative does not 
exist. 


(a) y=It| (b) y = tant (c) y=1/t 


Solution (a) The graph of y = |t| is shown in Figure 10.15(a). A corner exists at t = 0 and so the 
derivative does not exist here. 


(b) A graph of y = tant is shown in Figure 10.15(b). There is a discontinuity in tant 
when t = _ ... —37/2, —7/2, 7/2, 371/2,.... No derivative exists at these 
points. 

(c) Figure 10.15(c) shows a graph of y = 1/t. The function has one discontinuity at 
t = 0, and so the derivative does not exist here. 


y=ltl 


(a) (b) (c) 


Figure 10.15 
(a) There is a corner at t = 0; (b) tant has discontinuities; (c) y = 1/t has a discontinuity at t = 0. 


EXERCISES 10.6 


1. Sketch the functions and determine any points where (d) y= (1/2 
a derivative does not exist. : t>0 
iT (e) The unit step function u(t) = 0 0 
(@) y=— : 
t-—1 é 
_ 1 +: = 
(b) y=|sin¢| (f) The ramp function f(t) = ie Pe 
(c) y= ef 0 t<0O 
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Solutions 


1 See Figure S.22. 


(b) No derivative exists for tf = n7 


(a) No derivative exists for ft = 1 


lsin ¢l 


-2n-nm 0 nm 2737 t 


(e) No derivative exists for t = 0 


u(t) 
1 


Figure S.22 


i'm =COMMON DERIVATIVES 


It is time consuming to find the derivative of y(x) using the ‘shrinking interval’ method 
(often referred to as differentiation from first principles). Consequently the deriva- 
tives of commonly used functions are listed for reference in Table 10.1. It will be help- 
ful to memorize the most common derivatives. Note that a, b and n are constants. In 
the trigonometric functions, the quantity ax + b, being an angle, must be measured in 


radians. 


A shorter table of the more common derivatives is given on the inside back cover of 


this book for easy reference. 


Example 10.11 Use Table 10.1 to find y’ when 


(a) y=e™ 

(b) y= 

(c) y = tan(3x — 2) 
(d) y =sin(ox +9) 


(e) y= 


y= 
(g) y =cosh™! 5x 


1 
Vi 

1 
x5 


(c) Derivative exists for all values of t 


et 


(f) No derivative exists for ¢ = 0, although 
the function is continuous here 


ui) 


Solution 


Table 10.1 
Derivatives of commonly used functions. 


Function, y(x) Derivative, y’ 


10.7 Common derivatives 


Function, y(x) 


Derivative, y' 
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constant 0 
x nx} 
e* e* 
ex —e-*x 
e& ae“ 
1 
Inx = 
x 
sin x cosx 
cosx —sinx 
sin(ax + b) acos(ax + b) 
cos(ax + b) —asin(ax + b) 
tan(ax + b) asec? (ax + b) 


cosec(ax + b) —acosec(ax + b) cot(ax + b) 


sec(ax + b) asec(ax + b) tan(ax + b) 


cot(ax + b) —acosec? (ax + b) 


a 


V1 — (ax +b) 


sin-! (ax + b) 


(a) From Table 10.1, we find that if 


yoie™ then y =ae“ 
In this case, a = —7 and so if 
y=e”™ then y =-Te 
(b) From Table 10.1, we find that if 
y=x" then y =n! 


In this case, n = 5 and so if 


y=x then y = 5x4 


—7x 


cos~! (ax + b) ae 
V1 — (ax +b)? 
tan~! (ax + b) z 
14+ (ax+b)? 
sinh(ax + b) acosh(ax + b) 
cosh(ax + b) asinh(ax + b) 
tanh(ax + b) asech2(ax + b) 
cosech(ax + b) —acosech(ax + b) x 
coth(ax + b) 
sech(ax + b) —asech(ax + b) x 
tanh(ax + b) 
coth(ax + b) —acosech? (ax + b) 
re a 
sinh” * (ax + b) 
V(ax+b)2 +1 
a | a 
cosh “(ax + b) 
J (ax +b)? —1 
ion —— 
1—(ax+b)? 


(c) Ify = tan(ax+b) then y = asec? (ax+b). In this case, a = 3 and b = —2. Hence if 


y = tan(3x — 2) then 


(d) If y = sin(ax + D) then y’ = acos(ax + b). Here a = w and b = ¢, and so if 


y = sin(@x + ¢) then 


y’ = 3 sec?(3x — 2) 


y =acos(@x+ ¢) 


1 9 : 
(e) Note that — = x'/. From Table 10.1 we find that if y = x” then y’ = nx’!. In 


% 
this case, n = —1/2 and so if 


then 


1 
= hi 
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1 

(f) Note that = = x >. Using Table 10.1, we find that if y = x~> then y’ = —5x~°. 
x 

(g) From Table 10.1, if y = cosh! (ax + b) then 

a 


VJ (ax +b)? — 1 


In this case, a = 5 and b = 0. Hence, if 


if: 


5 
= cosh! 5x then ‘= —____. 
af J 25x2 — 1 


Example 10.12 Differentiate y(t) = e’. 


Solution We note that the independent variable is t. However, Table 10.1 can still be used. From 
Table 10.1, we find 


a 
dt . 


We note that the derivative of e’ is again e’. This is the only function which reproduces 
itself upon differentiation. 


EXERCISES 10.7 


1. Use Table 10.1 to find y’ given: 2 Find ° when 


(a) y=? oy y=e <n _ .2x/3 
(c) y=r3 (d) y=t aa (b) ye 
1 1 

(ec) y= A () y= ro) (c) y=e Vv? (d) y=Inx 

= att — p—3t = 
(g) as (h) y=e te y= one * ) 
® yay @) ya? i 
(kK) y =sin(2t + 3) (1) y=cos(4—1r) (f) y= tan! (mx 4+ 3) 

t (g) y = tanh(2x + 1) 

(m) y = tan 3 +1 (n) y = cosec(3t + 7) Hox ih! (—3x) 
(0) y=cot(1—1f) (p) y = sec(2t — 7) @ y= ee a) eeonseat 
(qq) y=sin-¢+7) @) yar QO Y= ya 3) (k) y= cos3e 


(s) y=tan7!(—2r— 1) 
(u) y = tanh(6r) 


(w) y= snn( 5?) 
9 v=eon(3- 5) 
3 2 


(t) y=cos—!(4t — 3) 
(v) y =cosh(2t + 5) 


(x) y = sech(—t) 


(Zz) y= cosh! (t + 3) 


sin(5x + 3) 


= (m) y= tan(2x + 7) 
cos 3x 


(n) y= cose 5) 


2. 3 
(0) y= tntl( a 
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Solutions 

1 (a) 2t (b) 978 Ba) -0.5x-3/2 (b) 2 2x/3 
(c) —3¢74 (d) 1 3 
(e) —t? (f) —21-3 (c) —0.5e—*/2 (d) 1/x 
(g) 3e% (h) —3e7* 
(i) -Se* @) oar" (e) ei cosec a7 cot = 
(k) 2cos(2t +3) (l) sin(4 — 1) 3 3 
(m) 0.5 sec?(t/2 + 1) (f) is (g) 2sech2(2x + 1) 
(n) —3cosec(3t + 7) cot(3t +7) 1+ (rex + 3)? 
(0) cosec?(1 — 1) (h) — — (i) —@cosec? (wx + 71) 
(p) 2sec(2t — 7) tan(2t — 7) sid 
(@) 1 () 0 (j) —S5cosec(5x + 3) cot(S5x + 3) 

J1l—(t : ™)2 ij (k) —3 sin 3x 
2 
(s) — scp? (t) — AGP 1) 3sec3xtan3x  (m) 2sec*(2x+ 7) 
(u) 6 sech? 6r (v) 2sinh(2r +5) (n) —0.5 cones( 5) cat 5) 
(w) 0.5 co("5*) (x) sech(—t) tanh(—f) 2 
‘ (0) 5 

(y) “9 decal? aa (z) ! a ~ (233) 
> 3 3. 2. /(t+3)2-1 7 


| 10.8 | DIFFERENTIATION AS A LINEAR OPERATOR 


In mathematical language differentiation is a linear operator. This means that if we wish 
to differentiate the sum of two functions we can differentiate each function separately 
and then simply add the two derivatives, that is 


derivative of (f + g) = derivative of f + derivative of g 


This is expressed mathematically as 


d 
We can regard Ae as the operation of differentiation being applied to the expression 


which follows it. The properties of a linear operator also make the handling of constant 
factors easy. To differentiate kf, where k is a constant, we take k times the derivative of 
f, that is 


derivative of (kf) =k x derivative of f 
Mathematically, we would state: 


df 


d 
na ae 
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Table 10.1 together with these two linearity properties allow us to differentiate some 
quite complicated functions. 


Example 10.13 Differentiate 


Solution 


(a) 3x (b) 9x (c) 7 (d) 3x7 +9x+7 


(a) Let y = 3x’, then 
dy _d 


i 32 
ae ay | x) 
3 : (x*) using linearit 
= 3—(x 
ay g y 
= 3(2x) from the table 
= 6x 
(b) Let y = 9x, then 
dy d 


dx a 
= 92. (x) using linearity 
dx 
=9 
(c) Let y= 7, then y’ = 0. 
(d) Let y = 3x7? +9x+7 


De Dae oopa%) 
dc dee xX 
dy 


d d d 
Ae = 3 (x7) + ae (x) + ik (7) using linearity 


=6x4+9 


Engineering application 10.3 


Fluid flow into a tank 


If fluid is being poured into a tank at a rate of g m* s~!, then this will result in an 
increase in volume, V, of fluid in the tank. The arrangement is illustrated in Fig- 


dv 
ure 10.16. The rate of increase in volume, ae m? s_!, is given by 


This relationship follows from the principle of conservation of mass. If q is large, 


dv 
then ie is large, which corresponds to the fluid volume in the tank increasing at a 


10.8 Differentiation as a linear operator 


cross-sectional area of the tank, A, is constant, then V = Ah. Therefore, 


dV d dh 
= = = (Vp) = A= 
dt dt dt 
because differentiation is a linear operator. So, 
dh 
ha 
=e 


a Figure 10.16 
A closed tank containing a volume of fluid V. 
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fast rate. Consequently, the height of the fluid, /, also increases at a fast rate. If the 


Example 10.14 Use Table 10.1 and the linearity properties of differentiation to find y’ where 


Solution 


@) y=3e" 

(b) y= I/x 

(c) y= 3sin4x 

(d) y = sin2x — cos 5x 
(e) y=3Inx 

(f) y = In 2x 

(g) y= 3x? +7x-—5 


(a) If y = 3e”*, then 


d d d 

;. = qe") = 3") using linearity 
= 3(2e”) using Table 10.1 
= 6e* 


(b) If y =x, then 


y=-lx* from Table 10.1 


1 
~ 2 
(c) If y = 3 sin 4x, then 
dy d 


d 
= = a (3 sin 4x) = 3 Fr (sin 4x) using linearity 
= 3(4cos 4x) using Table 10.1 


= 12cos4x 
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(d) The linearity properties allow us to differentiate each term individually. If 


in2 5 th Os pay = “ees 

= Sin ZX — COS IX en — = —(SsiIn ZX) — —(COS IX 
- dx dx dx 
= 2cos2x +5sin5x 


(e) If y = 3 Inx, then 


dy d d ae: : 
(31Inx) =3 re (In x) using linearity 


using Table 10.1 


(f) If y = In 2x, then 


y=In2+4+1nx using laws of logarithms 


and so 
dy 1 1 : : 
—=04-=- since In 2 is constant 
dx x x 


(g) Each term is differentiated: 


y =6x+7-0=6x4+7 


Engineering application 10.4 


Dynamic resistance of a semiconductor diode 


Recall from Engineering application 2.9 that a semiconductor diode can be modelled 
by the equation 


I=1,(e®" —1) 


where V is the applied voltage, J is the diode current, J; is the reverse saturation 
current, k is the Boltzmann constant, q is the charge on the electron, and T is the 
temperature of the diode junction in Kelvin. The parameter n is the ideality factor 
which is usually considered to be a constant for a given type of semiconductor junc- 
tion, dependent primarily on the design and manufacture of the diode. 

Examining the diode electrical characteristics, shown in Figure 2.31, we note that 
there are two main electrical regimes. The first occurs when the applied voltage, V, 
is negative and the current, /, is limited to a negative saturation current, /,, apart from 
a small region around the origin. The second occurs when the applied voltage, V, is 
positive and the current, /, takes on the shape of an exponential curve, apart from a 
small region around the origin. 

We will concentrate on the second case and furthermore we will assume that e#7 
is very much greater than 1. This is written concisely as en? >> 1. This isa good 
approximation apart from the small region close to the origin. The original expression 
for I can then be written as 


wv 
LT [yen 


where the symbol © means “is approximately equal to’. 
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For a fixed temperature the only variables in this equation are J and V. We can 
differentiate this equation to give 


es al f eur 
dV *nkT 
Back substituting the approximate expression J ~ ler into this equation yields 
eS [sem wy 
dV nkT nkT 
Inverting this equation gives 
dV mkt 
di ql 


It is constructive at this stage to compare this result with the static equivalent, V = JR, 
from which 

V 

ep 

il 


dv 
The quantity ar can be thought of as a dynamic value of the diode resistance, r,,, 


which varies depending on the value of the current in the diode. It is valid for a 
positive voltage V that is sufficiently large to ensure the diode is in the exponential 
portion of the electrical characteristic. 


Engineering application 10.5 


Operating point for an ideal semiconductor diode 


In Engineering application 2.18 we examined the mathematical model of a semi- 
conductor diode and then considered the specialised case of an ideal diode. At room 
temperature the equation describing the behaviour of its electrical characteristics was 


T=i1(e" -1) 


where / is the diode current, V is the applied voltage and J, is the constant reverse 
saturation current. Sometimes when a diode is used in a circuit it may be biased 
to operate in a certain region of its /-V characteristic. This means that its use is 
restricted to a certain voltage range. The point around which it operates is known as 
its operating point. This is illustrated in Figure 10.17. 

Deviations from this operating point may be small in certain cases. If so, they are 
known as small-signal variations and are caused by small a.c. voltages being super- 
imposed on the main d.c. bias voltage. In calculating how the diode will react to such 
small-signal voltages, the slope of the diode characteristic around the operating point 
is more relevant than the overall ratio of current to voltage. Provided the deviations 
from the operating point are not large, the tangent to the 7-V curve at the operating 
point provides an adequate model for how the diode will behave. The slope of the 
curve can be obtained by differentiating the diode equation. If 


40) 40 
T=h(e" -1)=L.e" -1 


then 

dl 
— = 401, e*”Y 
dv 
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Operating 
Tangent 

p 1 approximation 
Bias current 
Figure 10.17 

Diode characteristic showing 
operating point and tangent 
voltage approximation. 


since J, is constant. 
It is usual to write small changes in current and voltage as 6/ and 6V. Therefore, 
since 


sI dl 
sv. dV 
61 © 40/, ec" 6V 


This expression allows the change in diode current, 5/, to be estimated given a change 
in diode voltage, 5V, provided the operating point is known and the changes are small. 


Example 10.15 Find the derivative of y = e~' + #7, when 


(a) #=1 
(b) t=0 


y=e? +f 
y=-e'+4+2t 


(a) Whent = 1, y = —e7! + 2 = 1.632, that is y/(1) = 1.632. 
(b) When t = 0, y’ = —1, that is y'(0) = —1. 


Engineering application 10.6 


Obtaining a linear model for a simple fluid system 


Consider the fluid system illustrated in Figure 10.18. The pump is driven by a d.c. 
motor. The pump/motor can be modelled by a linear relationship in which the fluid 
flow rate, q;, is proportional to the control voltage, v,,,, that is 


in? 


Ch = Ky vVin 
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Pump/motor 


” Figure 10.18 
£ qo A fluid system comprising pump, tank 
Tank Valve ° and valve. 


where k, = pump/motor constant, v;, = control voltage (V), g, = flow rate into 
the tank (m* s~'). The valve has a non-linear characteristic given by the quadratic 
polynomial 


p = 2000042 


where p = pressure at the base of the tank (Nm~*), g, = flow out of the tank 
(m? s~!). The fluid being used is water which has a density p = 998 kg m~*. Assume 
g = 9.81 ms~ and that k, = 0.03 m? s~'V~'. Carry out the following: 


(a) Calculate the flow rate out of the tank, g,, and the control voltage, v,,, when the 


in? 


system is in equilibrium and the height of the water in the tank, h, is 0.25 m. 


(b) Obtain a linear model for the system, valid for small changes about a water height 
of 0.25 m. Use this model to calculate the new water height and flow rate out of 
the tank when the control voltage is increased by 0.4 V. 


Solution 
(a) The pressure at the bottom of the tank is given by 
P= pgh = 998 x 9.81 x 0.25 = 2448 
The flow rate through the valve is given by 


Dies P 
40 = 50.000 


5 2448 — 
a a 20350 
40 ~ 30000  V 20000 eed 


Now if the system is in equilibrium, then the height of the water in the tank must 
have stabilized to a constant value. Therefore, 


Gi = Io = 9.350 
and so 
ee, 
k, 0.03 
(b) Before answering this part it is worth examining what is meant by a linear 


model. Figure 10.19 shows the valve characteristic together with a linear ap- 
proximation around an output flow rate of g, = 0.350 m3 s~!. This corresponds 
to a water height of 0.25 m. 
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p (Nm) 


2448 


Figure 10.19 
Relationship between pressure across 
valve (p) and flow through valve (q,). 


0.35 Yo (m3 s-!) 


A linear model for the valve is one in which the relationship between p and 
q, 18 approximated by the straight line which forms a tangent to the curve at the 
operating point. The operating point is the point around which the model is valid. 
It is clear that if the straight line approximation is used for points that are a large 
distance from the operating point, then the linear model will not be very accu- 
rate. However, for small changes around the operating point the approximation 
is reasonably accurate. Clearly a different operating point will require a different 
linear approximation. In order to obtain the gradient of this line it is necessary to 
differentiate the function relating valve pressure to valve flow. So, 


p = 200004? 


d 
= = 400004, 


oO 


At the operating point g, = 0.350. Therefore, 


dp 


= 40000 x 0.350 = 14.000 
dq, 


qo=0-350 


This value is the gradient of the tangent to the curve at the operating point. 
Small changes around an operating point are usually indicated by the notation 6. 
Therefore, 


dp _ dp 


~ = 14000 
6q, dq, 


qo=0.350 


5p = 140005q, (10.2) 


Note that equality has been assumed for the purposes of the linear model. It is 
easy to relate a change in pump flow to a change in control voltage because the 
relationship is linear and so a linear approximation is not required. So, 


q = K Vin 
Differentiating this equation w.r.t. v,, yields 


dq; ey 
dvi, Si 


P 


In this case 


5qj ey dq, 
bu, 


in 


Sik 


P 
du,, 


1 
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and has a constant value independent of the operating point: 
8qj = ky SUin (10.3) 


The relationship between pressure at the bottom of the tank and the water height 
is also linear. 


d d 
Since p = pgh we have + = pg. Because = is a constant we can write 


dp dp _ 
> a 
5p = pgsh (10.4) 


in? 


Therefore, using Equation (10.3), we have 


6q,; = 0.03 x 0.4 = 0.012 


The change in control voltage, 5v;,,, is given as 0.4. We also know that k, = 0.03. 


Now if time is allowed for the system to reach equilibrium with this increased 
input flow, then dg, = dq,. In other words, the output flow increases by the same 
amount as the input flow and the water height once again stabilizes to a fixed 
value. Therefore, 


NG), = oa, = OU 
Using Equation (10.2), we find 
5p = 14000 5g, = 14000 x 0.012 = 168 


Using Equation (10.4), we get 
— bp 168 
~ pg 998 x 9.81 


Therefore the new water height is 0.25 + 0.0172 = 0.267 m to three significant 
figures. The new water flow rate is 0.35 + 0.012 = 0.362 m? s“!. 

To recap, all the elements of the fluid system were linear apart from the valve. 
By obtaining a linear model for the valve, valid for values close to the operating 
point, it was possible to calculate the effect of changing the control voltage to 
the motor. It is important to stress that the linear model for the valve is only good 
for small changes around the operating point. In this case the increase in control 
voltage was approximately 3%. The model would not have been very good for 
predicting the effect of a 50% increase in control voltage. Linear models of non- 
linear systems are particularly useful when several components are non-linear, 
as they are much easier to analyse. We examine these concepts in more detail in 
Chapter 18. 


= 0.0172 


EXERCISES 10.8 


Differentiate the following functions: 
(a) y= 4x3 — 5x? 

(b) y = 3sin(5t) + 2e* 

(c) y =sin(4t) + 3cos(2r) —t (f) y= a 


(d) y = tan(3z) 


(e) y= 2e* +17 — 4sin(2r) 


1 cos 5t 
3 2 
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2w> etw 
ae ame Wak 


(h) y= fet In(V). [Hint: ./x = x'/2, and use the 
laws of logarithms.] 


Evaluate the derivatives of the functions at the given 
value: 


(a) y=2t+9+el/2 t=1 
2 
t*—4t+6 
(b) <= t=2 
3 
(c) y=sint+cost t=1 


(d) y=3e" —2sin( 5) t=0 


1 
(e) y= Stan(2x) + =e x= 0.5 
e x 

(f) y= 3Int + sin(3r) t = 0.25 

. dx, 
Find —, if 

dt 

(a) x=e” (b) x=e 
where w is a constant. 
Find the derivative of 


(a) y =3sin7!(2t) — Scos~! (3r) 


1 
(b) y= 5 tanh (t +2) +4cos—!(2t — 1) 


(c) y = 2sinh(r — 1) — soon 5°) 


Solutions 


(b) 15 cos 5t + 8e* 
(d) 3 sec? 3z 


(a) 12x? — 10x 

(c) 4cos 4t — 6sin 2t — | 
3 

(e) 6e*" — 8cos2t (f) a 2.5 sin 5t 


1 
(g) 2w2 +24” (h) 0.5x7!/2 + = 
X 


(a) 2.8244 (b) 0 (c) —0.3012 
(d) 5 (e) 33.5194 (f) 14.195 
(a) we” (b) —we~ 

6 15 
(a) + 
. Vi-42 V1—9P 

1 4 

(b) 


Al+@¢+2)?7] Ved—n 


cosech(4t) + 3 sech(6r) 
(d) y= 5 


t+1 1-t 
= 2sinh~!{ —— } —3cosh7!| —— 
(e) y sin! ( 5 cos ( 5 


(f) y =3tanh—! (2r + 3) — 2tanh! (3¢ + 2) 


A function, y(t), is given by 


aoe BE a 2d 
a ae 


(a) Find 2 
a) Find —. 
dt 
(b) For which values of t is the derivative zero? 
Find the equation of the tangent to the curve 
ya) = +77 —9 
at the point (2, 27). 


Find values of ¢ in the interval [0, 7t] for which the 
tangent to x(t) = sin 2t has zero gradient. 


Find the rate of change of 


z(t) =2e/? — 77 


when 
(a) t=0 
(b) t=3 


t — 
(c) 6cosh(3t — 1) — 2 sinh = 


(d) —2cosech 4t coth 4t — 9 sech 6¢ tanh 6¢ 


1 3 
(e) + 
1 . 1 2 
I(r) - 2|(da-n) e 
6 6 
® 1—(2t+3)2 1— (t+2)? 


(a) 2% —5t+4 (b) 1,4 
y = 40x — 53 
7/4, 37/4 


(a) 1 (b) —1.5183 
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REVIEW EXERCISES 10 


1 


Find the rate of change of f(x) = 5 + 3x* at x = 2 by 
considering each of the intervals [2 — dx, 2], 

(2, 2 + dx], [2 — 6x, 2 + 6x]. Show that the same 
result is obtained in each case. 

Use a table of derivatives and the linearity rules to 
differentiate the following: 

(a) y= 4x? + 6x—-11 

(b) y = —x* + 2x — 10 

(c) y= xl/3 — xl/4 

(d) y = 5cos 4x — 3 cos 2x 
(e) y = sin! (4x + 3) 


5 
() y=v28- >, 
3x 
(g) y=19/2 + cost 
(h) y= 0? — 144 +8 


@) y=S5Int-+sin4t 


Find the equation of the tangent to y = x2 + 7x — 4 at 
the point on the graph where x = 2. 


Find the rate of change of f(t) = 2cost + 3 sint at 
i= 1, 


Solutions 


(a) 8x+6 

(b) —2x +2 

(c) 1-2/3 — 13/4 

(d) —20sin 4x + 6 sin 2x 
4 


V1 — (4x +3)2 
(f) 2/2x+ _ 


3x3 
(g) 31/2 — sint 
(h) 2t— 14 


5 
(i) 3 +4cos 4t 


5 


At any time f, the voltage, v, across an inductor of 
inductance L is related to the current, i, through the 


: di 
inductor by v = L—. 
dt 


(a) Find an expression for the voltage when 
i = 5 cos wt where q is the constant angular 
frequency. 


(b) Find an expression for the voltage when the 
current takes the form of a sine wave with 
amplitude 10 and period 0.01 seconds. 


Use the shrinking interval method to find the rate of 
change of f(t) = sint at t = 0 by considering the 
interval [0, dr]. [Hint: use the trigonometric identities 
in Section 3.6 and the small-angle approximation in 
Section 6.5.] Use the shrinking interval method to find 
the rate of change of f(t) = sin¢ at a general point. 


Given y(t) = 3 + sin 2r, find the average rate of 
change of y as ¢ varies from 0 to 2. 


5 d 
Explain the essential difference between = and = 
x 


Find y’ for the following functions: 
(a) y=2e 7% + 6cos(t/2) 
(b) y= (1 +2)" 


Using derivatives, estimate the change in y as x 
changes from 1.5 to 1.55 where y = 2e** +.x°. 


k) =t+2e7 
(k) 3 e 
y=I1lx-8 

—0.062 


(a) —S@Lsinot (b) 20007 L cos 2007 t 


—0.3784 
(a) —2e% —3 sn(5) (b) 2r-—4 


4.355 
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INTRODUCTION 


In this chapter we develop the techniques of differentiation introduced in Chapter 10 so 
that rates of change of more complicated functions can be found. We introduce rules for 
differentiating products and quotients of functions. The chain rule is used for differenti- 
ating functions of functions. We explain what is meant by defining functions implicitly 
and parametrically and show how these can be differentiated. The technique of logarith- 
mic differentiation allows complicated products of functions to be simplified and then 
differentiated. Finally derivatives of functions can themselves be differentiated. This in- 
volves the use of higher derivatives which are explained towards the end of the chapter. 


RULES OF DIFFERENTIATION 


There are three rules which enable us to differentiate more complicated functions. They 
are (a) the product rule, (b) the quotient rule, (c) the chain rule. Traditionally they are 
written with x as the independent variable but apply in an analogous way for other in- 
dependent variables. 


The product rule 


As the name suggests, this rule allows us to differentiate a product of functions, such as 
x sinx, t? cos 2t and e* In z. 


Example 11.1 


Solution 
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The product rule states: if 
y(x) = u(x) v@) 


then 
id ee se ‘vu + uv" 
= = — u— =u u 
Gh abe dx 


To apply this rule one of the functions in the product must be chosen to be u, and the 
other, v. Before we can apply the rule we need to calculate u’ and v’. 


Find y’ given 
(a) y=xsinx 
OG) y= te 


(a) y = xsinx = uv. Choose u = x and v = sinx. Then uw’ = 1, v’ = cosx. Applying 
the product rule to y yields 


y =sinx +xcosx 


(b) y =?r’e! = uv. Choose u = ?? and v = e’. Then uv’ = 2f and v' = e’. Applying the 
product rule to y yields 


y = 2te' + te! 


Engineering application 11.1 


Damped sinusoidal signal 


A common function found in engineering is the damped sinusoidal signal. This con- 
sists of a negative exponential function multiplied by a sinusoid. A typical example 
is 

f(t) =e" cost 
The graph of this function is shown in Figure 11.1. This function approximates the 
way a car body reacts when the car drives over a large bump in the road. Fortunately, 
the car shock absorbers ensure that the oscillations reduce in amplitude quite quickly. 

When sketching such a function it can be useful to think of the exponential term, 
and its mirror image around the time axis, providing an envelope that contains the 
signal. When values of the sinusoid are | then the signal touches the positive part 
of the envelope and when values of the sinusoid are —1 then the signal touches the 
negative part of the envelope. 

The rate of change of this signal with respect to time can be found by differentiat- 
ing f(t) using the product rule. To do so we note that f(r) is a product of u(t) = e~°" 
and u(t) = cost. Recasting the formula for differentiating a product in terms of t we 
have 


ft) =u) 


388 Chapter 11 Techniques of differentiation 


11.2.2 The quotient rule 


r?—t+3 
sinx t+2 


‘i j ‘ F , x 
This rule allows us to differentiate a quotient of functions, such as ——, 
—3z 
e 


and ; 
eal 
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The quotient rule states: when 


Example 11.2 Find y’ given 


Solution 


11.2.3 


u(x) 
yx) = WO) 
then 
du dv 
Vv — —_— — 
i dx "\ de vu’ — uv’ 
a De ie 
(a) sin x (b) - (c) e7! 
a => — = Cc = — 
ye oes | Y" P+] 
sinx uU ; : : 
(a) y= — = -.Sou= sinx, v = x and w' = cosx, v’ = 1. Using the quotient rule 
x v 


the derivative of y is found: 


,  XcOsx — sinx 


= Pe) 
- u 
(b) y= aan = Sow=P,v=2r+ Land u’ = 21, v' = 2. Hence, 
/ Qt+1)2#-(7)Q) 2 +1) 
(t+ NP t+ 1 
2t 
() ¥ = Pat Sou = e*%,v =f? +1 andu’ = 2e”, v’ = 2r. Application of the 


quotient rule yields 


le Het 2 (Patel) 
SEDER +1? 


The chain rule 


This rule helps us to differentiate complicated functions, where a substitution can be used 
to simplify the function. Suppose y = y(z) and z = z(x). Then y may be considered as 
a function of x. For example, if y = z> — z and z = sin 3x, then y = (sin 3x)° — sin(3x). 


Suppose we seek the derivative, = Note that the derivative w.r.t. x is sought. 


The chain rule states: 
dy dy ad 


== x 
dx dz dx 
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Example 11.3 


Solution 


Example 11.4 


Solution 


Example 11.5 


Solution 


d 
Given y = 2° where z = x* + 1 find oe 


If y = 2 and z = x* + 1, then y = (x* + 1)°. We recognize this as the composition 
d 
y(z(x)) (see Section 2.3.6). Now y = z° and so ; = 62°. Also z = x2 + 1 and so 
z 
d 
= = 2x. Using the chain rule, 


d d d 
S = x x - = 62x = 12x(x +1)° 


d 
Ify = In@x? + Sx +7), find >. 


We use a substitution to simplify the given function: let z = 3x” -+5x+7 so that y = Inz. 
Since 
y=Inz then a 
Also 
2 dz 
Z=3x°4+5x4+7 and so qe 


Using the chain rule we find 


dy dy dz 
dk dz dk 
= t x (6x + 5) 
Zz 
7 6x+5 
3x2 + 5x+7 


Note that in the previous answer the numerator is the derivative of the denominator. 
This result is true more generally and can be applied when differentiating the natural 
logarithm of any function: 


When y = In f(x) then = = oa 
Find y’ when 
(a) y=In(@® +8) (b) y=In(1 —1) (c) y= 8In(2 — 3r) 
(d) y=Indl +x) (e) y= In(1 + cosx) 


In each case we apply the previous rule. 


5x4 


(a) If y= InG@@ + 8), then y’ = S48" 


1 


11.2 Rules of differentiation 


(b) If y=In(1 —2), then y' = = = 


(c) Ify = 8InQ — 3r), then y’ = 85 
(d) If y = In(1 +x) then y’ = 


(e) If y=In(1 + cosx) then y = = 


Example 11.6 Differentiate 
(a) y= 3esinx 
(b) y = Gr? + 2t — 9)" 
(c) y=v1l+P? 


Solution 


d 7 d. 
(a) Let zx) = sinx. Then y(z) = 3e* so - = 3e*; z(x) = sinx and so :. = cosx. 
U4 


1 1 1 
—t leg fad 
—3 24 24 
— eo a 
1 
l+x 
—sinx sin x 
I+cosx  1+cosx’ 


In these examples we must formulate the function z ourselves. 


The chain rule is used to find a 


dy dy dz Z sin 
—=— x — = 3e*cosx = 3e*"* cosx 
dx dz dx 
2 wo dy 9 & : 
(b) Let z(t) = 34° + 2t — 9. Then y(z) =z”, rs = 107, a = 6t + 2. Using the chain 
Zz 


d 
rule es is found: 
dt 


OF 


dt dz dt 


10z? (6t + 2) = 20(3t + 1)(3r? + 2r — 9)? 


d lL. gi 
(c) Let z¢) =1+2?. Theny = /z = z!”, i. a a 
iS 
rule, we obtain 
d d d 1 t t 
pee ae got = — = 
dt dz dt 2 JZ [Tee 
EXERCISES 11.2 
Use the product rule to differentiate the following (d) y= vxe* 


functions: 


(a) y = sinxcosx 
(b) y =In¢ttant 
(©) y= (0 + 1)e” 


(e) y =e" sintcost 

(f) y = 3sinh 2t cosh 3t 

(g) y= (1+ sin?) tant 

(h) y = 4sinh(t + 1) cosh(1 — r) 


391 


d 
and i = 2t. Using the chain 
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2 Use the quotient rule to find the derivatives of the 5 For which values of ¢ is the derivative of y(t) = e~'t? 
following: zero? 
(a) aii : (b) a 6 Find the rate of change of y at the specified values of f. 
sin x nt 
2r 2 1 
e 3x* + 2x—-—9 : 
accu es ee a = sin — t=1 
(©) B+1 @ e+) os t 
— (72 _1)17 = 
a e4xtl (p Sinh2e Dy pie = 1) cae, 
. 1+e* cosh 3t (Cc) y= sinh(t?) E=2 
1+< l+t+? 
aoe d) y= ——— t=2 
() Te (d) y i=i 
t 
3 Use the chain rule to differentiate the following: (e) y= 7 p=] 
tsint 
(a) (8 +1)! (b) sin? (3t + 2) 
(c) InG? +1) (d) (2t +1)!/2 7 Find the equation of the tangent to 
1 =e (1— t the point (0, 1 
(ce) 3./cos(@x— 1) Ore Peo ies) setae pour gO 2) 
8 Differentiate 
(g) (at +b)”, a and b constants 
4 Differentiate each of the following functions: (a) y= Inx 
(a) y = Ssinx (b) y =5e* sinx (b) y= In2x 
a= sesinx ih = 5 sinx (c) y = Inks, k constant 
ae cate ar (d) y=In(1 +4) 
=(t4+4t =7e 
(ec) y= ( i ) (f) y =7e (©) y= +41) 
sin x 
(g) y= Aone 1 (f) y=In(5 +7sinx) 
Solutions 
| fae 
1 (a) cos? x — sin? x (b) — tant + Int sec? t (e) et(-e +a) +2041 
t (e+ 1)2 
1 ; ; 
(c) e (23 + 342 +2) (d) et ( /x+ (f) 2 cosh 2t cosh 3t — 3 sinh 2r sinh 3t 
2/% (cosh 3r)2 
‘(2 cos?t + sintcost — 1 
(e) e' (2cos“t + sint cos ) ree 
(f) 3[2 cosh 2t cosh 3t + 3 sinh 2r sinh 34] (g) ae cm by ad 
(g) (1 + sinfr) sec? t + sint 
‘ 3 (a) 3007(13 + 1)” 
(h) 4[cosh(t + 1) cosh(1 — f) ‘ 
—sinh(t + 1) sinh(1 —1f)] (b) 9sin*(3t + 2) cos(3t + 2) 
2x 
2 (a) —cosec? x 
‘ (©) x2+1 
(b) Int sec* t — (tant) /t @) Qt+ 7? 
ny 3 sin(2x — 1) 
—3 sin(2x — 
e2! (23 — 372 4.2) (¢) ————— 
(c) ———_,—. J cos(2x — 1) 
(+1? ' 
f) -(t+1)7 
@ —3x* — 49 + 27° + 6x +2 : ai 


G3 + 1)2 (g) an(at + by"! 
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4 (a) S5cosx Be 0,2 
(b) Se*(cosx + sinx) 6 (a) —0.5403 (b) 0 (c) 109.2 
(c) 5e%"* cos x (d) 2 (e) —2.0742 
(d) S5e*(cosx + sinx) Woy =2x41 
(e) 15(03 + 4r)'4 (31? + 4) I 1 1 
(f) —42te-3” ar mS ee 
4+ cosx (a) oS (e) 4 Tcosx 
(8) (4cosx + 1)2 1+¢ 34+4¢ 5+7sinx 


11.3.1 


Example 11.7 


Solution 


PARAMETRIC, IMPLICIT AND LOGARITHMIC 
DIFFERENTIATION 
Parametric differentiation 


In some circumstances both y and x depend upon a third variable, t. This third variable 
is often called a parameter. By eliminating f, y can be found as a function of x. For 
example, if y = (1 +1)? and x = 2r then, eliminating t, we can write y = (1 +.x/2)?. 


d 
Hence, y may be considered as a function of x, and so the derivative ;. can be found. 


However, sometimes the elimination of ¢ is difficult or even impossible. Consider the 
example y = sint +t, x = t? +e’. In this case, it is impossible to obtain y in terms of x. 


The derivative 7 can still be found using the chain rule. 
dy = 
dv 5 = <== 


d 
Finding ;. by this method is known as parametric differentiation. 


2 dy 
Given y = (1+ 1t)°, x = 2r find a 


By eliminating f, we see 


and so 
dy x 
bc aay (pee een 
dx = 2 


d 
Parametric differentiation is an alternative method of finding = which does not require 
the elimination of ft. dx 


Il 
~) 


dy dx 
—-—=2(1+t — 
dt (t+) dt 
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Example 11.8 


Solution 


Example 11.9 


Solution 


11.3.2 


Example 11.10 


Using the chain rule, we obtain 


dy dy /dx 2(1+1f) x 
— = => 1 ‘4 1 a 
dx dt / dt 2 a " 2 


d 
Given y= e' + t,x =?" +1, find *. using parametric differentiation. 


dy, dx 
dt —< dt 
Hence, 


dy dy /dx e+1 
dx dt/ dt 2t 


In this example, the derivative is expressed in terms of t. This will always be the case 
when ¢ has not been eliminated between x and y. 


d 
If x =sint + cost andy = 12 —1 + 1 find 7 = 


=2t-1 = cost — sint 
dt dt 
Hence, 
dy 2-1 
dx cost — sint 
dy -l 
When t = 0, — = — = — 
dx 1 


Implicit differentiation 


Suppose we are told that 
y +x =S5sinx+ 10cosy 


Although y depends upon x, it is impossible to write the equation in the form y = f(x). 
We say y is expressed implicitly in terms of x. The form y = f(x) is an explicit 


expression for y in terms of x. However, given an implicit expression for y it is still 
d d 
possible to find a Usually *. will be expressed in terms of both x and y. Essentially, 


the chain rule is used when differentiating implicit expressions. 


d 
When calculating *. we need to differentiate a function of y, as opposed to a function 


d 
of x. For example, we may need to find es (y*). 


Find 
(a) £ ot) (b) ei ey 
dx dx” 
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Solution (a) We make a substitution and let z = y* so that the problem becomes that of finding 


Z ; ; 
es Now, using the chain rule, 


dz dz dy 


=—x 
dx dy dx 


d 
If z = y* then es 4y° and so 
dy 


We conclude that 


God 3 dy 
as =43 
ot 
(b) We make a substitution and let z = y~? so that the problem becomes that of finding 
dz 
dx’ 4 
Ifz=y-3 then — = —3y~4 and so 
dy 
d:.. 9 dz dz dy _4dy 
—_— i — = = 3 
a Fa a a y 


Example 11.11 Find <<ny). 


: ._d d 1 
Solution We let z = Iny so that the problem becomes that of finding - If z = Iny then 7 =- 
y y 


and so using the chain rule 


dz dz dy = tldy 
x = 
dt dy dt = ydt 


we conclude that 


Examples 11.10 and 11.11 illustrate the general formula: 


d _df_ dy 
qe may x Ae 


This is simply the chain rule expressed in a slightly different form. 
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Example 11.12 


Solution 


Example 11.13 


Solution 


Example 11.14 


Solution 


nd 6) 
ind —(y). 
ac 


We have 


Given 
x 4 y= 1 =f y 
dy 


find —. 
dx 


Consider differentiation of the l.h.s. w.r.t. x. 
d , d_,. dy dy 
ee yas = 37° 4 
a +y) Ge Ler sere 
Now consider differentiation of the r.h.s. w.r.t. x. 


Stays ee 
ZtM=7M+ 70) = 70") 


d d 
We note from the formula following Example 11.11 that a (y°) =3y m So finally, 


d d 
34+ > a3 
from which 
dy _ 3x7 
dx 3y2-1 


d 
Note that +. is expressed in terms of x and y. 


d 
Find = given 
dx 


(a) hny=y-¥ 
(b) xys =e = e2* 


(a) Differentiating the given equation w.r.t. x yields 


ldy dy 
-—=——2x 
ydx dx 

from which 
dy 2xy 


dx y-1l 
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: d 2.3 : 

(b) Consider rp (x-y”). Using the product rule we find 
d 33 d 2,3 2d; 3 ody 
— = —(y?) = 2xy? +.°3y = 
eat Geo a0”? so ia age) ee 
. d , dz ; 

Consider —(e”). Let z = e” so — = e’. Hence, 
dx dy 

dz _ dz dy = ot 

dx dydx dx 

So, upon differentiating, the equation becomes 


d dy 


Ce —_ 
ae 


ly : ’ 
2 Bohs 23 aa ees, = Ie” 
xy +x Lae re 
So, 
dy 99 ; 2x 3 
—(3 —e’) = 2e* — 2xy 
a xy” — e*) = 2e xy 
from which 


dy _ 2e?* — 2xy? 
dx 3x2y? — e” 


11.3.3 Logarithmic differentiation 


The technique of logarithmic differentiation is useful when we need to differentiate a 
cumbersome product. The method involves taking the natural logarithm of the function 
to be differentiated. This is illustrated in the following examples. 


d 
Example 11.15 Given that y = 72(1 — 1)8 find a 
Solution The product rule could be used but we will demonstrate an alternative technique. Taking 
the natural logarithm of both sides of the given equation yields 
Iny = In(#?(1 — 18) 
Using the laws of logarithms we can write this as 


Iny = Int? +In(1 —1)® 
= 2Int+8In(1 —1) 


Both sides of this equation are now differentiated w.r.t. t to give 
“dny) = <@iny) + £m 9) 
io ae ao 


d 
The evaluation of a (In y) has already been found in Example 11.11, and so 


ldy 2 8 


ydt ot i1-t 
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Example 11.16 


Solution 


Example 11.17 


Solution 


Hence 
dy _ 2 8 
eo Le 


Finally, replacing y by ¢7(1 — t)® we have 


dt t 1-t 


Given 
y= (1 + x) 


dy 
find —. 
eae 


The product rule could be used. However, we will use logarithmic differentiation. Taking 
the natural logarithm of the equation and applying the laws of logarithms produces 


Iny = n@P’(1 +-x)’e*) = Inx’ + In(1 +x)? + Ine™ 
Iny = 3Inx+91n(1 + x) + 6x 


This equation is now differentiated: 


dy 3 9 
= 6 
dx p(2+ +6) 


= 3x7(1 + x)’e™ + 9x7 (1 + x)8e™ + 6x7(1 + x)’e™ 


Ify = J/1 4+? sin’ t find y’. 


Taking logarithms we find 


Iny = In(V1+?? sin’ rt) 
= Inv 1+? + In(sin’ 1) 


1 
=F In(1 +77) + 21n(sinr) 


Differentiation yields 


1 1 2t cost 


yo Dies” ane 


t t 
y’ => +2cots) =VJ1 +P sin?o( 5 rs + 2eott) 


14+? t2 


1 


1 
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b ae 

(b) = Giny) 
(d) <e) 
(f) —(x*+y) 


iy oy 
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(d) x = cos 2t y=3t 
3 
(e) er y=ew 


t 


(f) x=e’ -e7% y=e+e7 


5 Use logarithmic differentiation to find the derivatives 
of the following functions: 


1 _. 
(a) y=x4e" (b) y=-e* 
xX 


(ce) z=P(1 +1)? 


7 sin* x 


(d) y=e*sinx 
(e) y=x 


6 Use logarithmic differentiation to find the derivatives 
of the following functions: 


(a) z=r4(1 —1)9(2 +1)4 
(1 +.x2)3e7* 
= me 
(c) x= (14 13(2 414841) 
_ (sint1)(2 —#?)4 
~  (L+er)6 


e* sinx 


(d) y 
(e:) y=x° 


d 
7 Ifx=t+r? +2 andy = sin2r, find = when 1 = 1. 


Find each of the following: 
@ a) @ 26%) 
a A — 
dx dx” 
@ 26% © 6) 
= Pees 
dx” dx 
d 
@ 40+) 
Find each of the following: 
d 
—(e) 
(a) a 
(c) : (cos 2y) 
Cc — (COS 
re y 
d d 
(e) anv? a 
(g) ei (x+y)) . 
— (COS(X 
ys y 
(i) c (In7 
1 rr n7y) 
54.3 
Find — given 
dx 
(a) 2y? — 3x3 =x+y 
6) fPtvyxart+y 
(c) /2x+3y=1+e 
eJ/l+x 
(d) y= —3— 
xX 
(e) 2xy* = x3 + 3x7? 
(f) sinaw+y)=1+y 
(g) In@? + y?) = 2x -3y 
(h) ye?” = x7er/? 
wah) 2. 
Find —, given 
dx 
(a2) x=? y=14P 
(b) x =sint y=e 
() x=(14+0? y=1+h 
Solutions 
d 
(a) 5x4 (b) Syt= 
dy dy 
5yt = d) 2y— 
(c) 5y 7 (d) © i 


Given x =1+7° andy =1—?, find: 

(a) the rate of change of x w.r.t. f when t = 2 
(b) the rate of change of y w.r.t. t when t = 1 
(c) the rate of change of y w.r.t. x when t = 1 
(d) the rate of change of x w.r.t. y when tf = 2 
Find the equations of the tangents to 


y =x" + 6y 


when x = 4. 
Given the implicit function 3x* + y? = y find an 


: dy 
expression for —. 
dx 


& | 


(e) so (f) 


d 
(g) Qy+ ea 
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d d 
2 (a) eo (b) cosy 


dy dy 
—2 sin 2y— d) —3e37 = 
(c) —2sin2y— — (d) —3e7 9 


1 _ijdy dy 
(e) a ay (f) 1+ a 
(2) —si i 
g) —sinQx+y){1+ a 
1 dy — lay 
(h) y dx (i) y de 
1+ 9x2 
3 (a) i 
(b) (4x./x — 1)/y 


(1 — 6y?, /y) /x 
(c) s(etyiet 3y — 1) 


(d) : +1 
2 Creer - 
3x? + 3y? — 2y4 
2xy(4y? — 3) 
cos(x + y) 
1 —cos(x+ y) 


(e) 


(f) 


2x? + y? — x) 


® 3243242 


e/2—29) x(x + 4) 


h 
h) 2(1 + 2y) 


ée! t d 

mm @) 34/2 ) cost (©) (5) 
= pdt 2 ee 

2 sin 2t Ce ef +e# 


(d) 


HIGHER DERIVATIVES 


(a) (4x3 +x*)e* 


auf it 4 
(b) =e (+3) 
x x 


3 9 
3 9({? 
(c) Pd+t) ( + 3] 


(d) e* sinx(1 + cotx) 


7 
(e) x! ants(Z +4 cts] 
x 


(c) ae 5 
eee) Tey hang Sg 


3 

(e) xe* uns( +14 cts] 
x 

—0.1387 


(a) 192 (b) —2 (c) -; 


dx 4 dx 

(d) — = —3t"; whent = 2, — = —48 
dy dy 

_— 4x+6 4x4 24 

is. ee. 

— 6x 

oe T= 32 


The derivative, y’, of a function y(x) is more correctly called the first derivative of y 
w.r.t. x. Since y’ itself is a function of x, then it is often possible to differentiate this too. 


The derivative of y’ is called the second derivative of y: 


ides oo 
secon erivative oO = —] — 
- dx \ dx 


Oe ee d’y 
which is written as ae or more compactly as y”. 
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Example 11.18 If y(x) = 3x? + 8x +9, find y’ and y’. 
Solution y =6x4+8 y= <6 +8) =6 
Example 11.19 If y(t) = 2 sin 3r, find y’ and y’. 
Solution y =6cos3t y’ = —18sin3¢ 


The first and second derivatives w.r.t. time, ¢, are also denoted by y and y. 


Example 11.20 Find y” given 


lt+xy=xr+y 


Solution The equation is differentiated implicitly to obtain os 


0 +y+xy = 2x+4+2yy’ 


—_ 2x — y 
~ x—2y 
The quotient rule is now used with u = 2x — y and v = x — 2y. The derivatives of u and 
v are 
du a F du saa 
de ta 
Then, 
9 @=—2)2—y¥) — Gxe—y)0 = 2y) 
* (= 2y? 
This is simplified to 
» _ 3xy' — 3y 
~ @=2yyP 
; 2x —y ee eae 
Replacing y’ by 5 and simplifying yields 
x — 4y 
yo 6(x7 — xy +y’) 
Gx By 


Note that it is possible to simplify this further by observing that x7 + y? = 1 + xy as 
given. Therefore, 
, OU +xy—xy)_ 6 
= 2y) Ge 2y)8 


Just as the first derivative may be differentiated to obtain the second derivative, so the 
second derivative may be differentiated to find the third derivative and so on. A similar 
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Example 11.21 


Solution 


Example 11.22 


Solution 


2 Sotelo: Vict, a 
notation is used. The third derivative is written a3 or y” or y®), The fourth derivative 
ee eee és re . 
is written — or y’’ or y. The fifth derivative is written as or y’ or y®. 


dat 


Find the first five derivatives of z(t) = 2¢7 + sint. 


Zz = 60? + cost 2’ = sint 
z’ = 12t — sint Zz’ =cost 
Zz” =12—cost 


Calculate the values of x for which y’ = 0, given y = x* — x°. 


y=x-x y =4 — 3x y’ = 12° — 6x 


Putting y” = 0 gives 
12x? — 6x = 0 and so 6x(2x — 1) =0 


Hence 


The first and second derivatives can be used to describe the nature of increasing and 
decreasing functions. In Figure 11.2(a, b) the tangents to the curves have positive gra- 
dients, that is y’ > 0. As can be seen, as x increases the value of the function increases. 
Conversely, in Figure 11.2(c, d) the tangents have negative gradients (y’ < 0) and as 
x increases the value of the function decreases. The sign of the first derivative tells us 
whether y is increasing or decreasing. However, the curves in (a) and (b) both show y 
increasing but, clearly, there is a difference in the way y changes. 

Consider again Figure 11.2(a). The tangents at A, B and C are shown. As x increases 
the gradient of the tangent increases, that is y’ increases as x increases. Since y’ increases 
as x increases then the derivative of y’ is positive, that is y’ > 0. (Compare with: y 
increases when its derivative is positive.) So for the curve shown in Figure 11.2(a), y’ > 0 
and y” > 0. 

For that shown in Figure 11.2(b) the situation is different. The value of y’ decreases 
as x increases, as can be seen by considering the gradients of the tangents at A, B and 
C, that is the derivative of y’ must be negative. For this curve y’ > 0 and y” < 0. 

A function is concave down when y’ decreases and concave up when y’ increases. 
Hence Figure 11.2(a) illustrates a concave up function; Figure 11.2(b) illustrates a con- 
cave down function. The sign of the second derivative can be used to distinguish between 
concave up and concave down functions. 

Consider now the functions shown in Figure 11.2(c) and Figure 11.2(d). In both (c) 
and (d), y is decreasing and so y’ < 0. In (c) the gradient of the tangent becomes increas- 
ingly negative; that is, it is decreasing. Hence, for the function in (c) y’ < 0. Conversely, 
for the function in (d) the gradient of the tangent is increasing as x increases, although 
it is always negative, that is y” > 0. So for the function in (c) y’ < 0 and y” < 0; that 


Figure 11.2 
(a) y is concave up (y’ > 0, y’” > 0); (b) y is concave down (y’ > 0, y” < 0); (c) y is concave down (y’ < 0, y” < 0); (@ y 
is concave up (y’ < 0, y” > 0). 
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11.4 Higher derivatives 


is, the function is concave down. For the function in (d) y’ < 0 and y” > 0; that is, the 
function is concave up. In summary, we can state: 


When y’ > 0, y is increasing. When y’ < 0, y is decreasing. 
When y’ is increasing the function is concave up. In this case y” > 0. 
When y’ is decreasing the function is concave down. In this case y” < 0. 


An easy way of determining the concavity of a curve is to note that as the curve is 
traced from left to right, an anticlockwise motion reveals that the curve is concave up. 
A clockwise motion means that the curve is concave down. 

As will be seen in the next chapter, higher derivatives are used to determine the loca- 
tion and nature of important points called maximum points, minimum points and points 
of inflexion. 


Calculate dy sii dy given 4 Find values of t at which y” = 0, where 
dt dr? 
—: Bp 712 

oat oS 

(b) y=2P —-7 +1 3. 2 

(c) y= sin2t 5 Determine whether the following functions are 

(d) y =sinkt k constant concave up or concave down. 

= a3) 2 
(e) y=2e¥-P +1 @y= ®) y=? © y=l+t-? 
(f) = a : . < ay 
t+1 6 Determine the interval on which y = f°” is 

(g) y=4cos = (a) concave up, 

th) yao (b) concave down. 

(i) y =sinh4t 7. Evaluate y” at the specified value of f. 

. . 2 

Gj) y=sin*t (a) y=2cost—?r? t=1 
2) If sint + cost 

b) y= ———_. t=7/2 
y=2x43x2— 12x41 ey 2 ( 
=(1+re t=0 
find values of x for which y” = 0. ie) =e 
d ae 2_\2 
Moir —372 +2 find 8 Find — givenxy+x° =y". 
dr ; ‘ dx 
d°y dy 3 at % 
a) — b) + 9 Find — whenx? + — =7? 4x2. 
(@) dt? ©) dr3 dt ee ‘ 
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Solutions 
Mo) 2+1,2 mg! 
(b) 6¢? — 24, 12¢ -—2 . 
(c) 2cos2t, —4 sin 2r 5 (a) concave up 
(d) kcoskt, —k? sinkt (b) concave up 
(e) 6e*" — 25; 18e*" —2 (c) concave down 
1 2 
; 6 (a) concave up on (0, co 
© 2a ae (a) pon (0, 00) 
(g) —2sin(t/2), —cos(t/2) (b) concave down on (—oo, 0) 
(hy f+ 1), +2) 7 @) -308 =) -5 3 
(i) 4cosh4r, 16 sinh 4 : : 2 
G) 2sintcost, 2cos 2t 8 10y" — 10x" — 10yx 
1 (QQy — x)3 
= -_) 5 x +2 +4 3x2#? 


32 +t — 2xt3 
3 (a) O+1 (b) 6 3x°t* +t — 2xt 


REVIEW EXERCISES 11 


1 Differentiate each of the following functions: @) 2 +3xy+y =5 
_ 3,3 2 
(a) y=sin(5 + x)? a a= P 
si 5+ 3t 2-t d d 
(b) y=ot oy ifx= ~ and y = = find | and oa 


(c) y= (4x+7)9 


: dy 
(a) s=9 én 4 Ifx=4(1+cos@) and y = 3(6 — sin@) find 4s 


Ax d a 
(e) j= 5 If x = 3.cos?@ and y = 2sin26 find — and —. 
+11 dx dx? 
(f) y=x* tanx 6 Show that y = e~** sin 8x satisfies the equation 
cos 3x y” + 8y’ + 80y = 0. 
(9) y= 3G ; ; 
x 7 Differentiate y = x*. 


(h) y=e* cos 5x 
(i) y = Incos4x 8 Given y = x°e** find 


dy dy dy 
G) y= sin 2t cos 2t (a) dx (b) de (©) dx? 
1 dx 
(kK) y= ea 9 Use logarithmic differentiation to find a given 


(a) x = fe’ sint 

(b) x = fe cos 3t 
3 3 ‘ 

x? sin 2x (c) x =fe* sin 4t cos 3t 
cos x 


d 
2 Find " in each of the following cases: 


(a) y= 
10 Show that if y(t) = Asinwt + Bcos wt, where @ is a 


— ya-Xx 
(b) y=xre* tanx constant, then 


xe 


(c) y= y' +o’y =0 


sinx 


Solutions 


(b) 
(c) 
(d) 


(e) 
(f) 


(g) 
(h) 
(i) 
Gi) 
(k) 


(a) 


(b) 
(c) 


2(5 +x) cos(5 +x)? 
2cos xe? Sin 
20(4x + 7)4 
3x4 cos 3x + 4x3 sin 3x 
e** (4x3 — 3x2 + 44) 
(x3 + 11)? 


sec? x + 2x tan x 


x2 


—3x sin 3x — 2cos 3x 

x3 
—e*(5sin5x + cos 5x) 
—4 tan 4x 


2 cos 4t 
—2x 


cos x(2x3 cos 2x + 3x? sin 2x) + x sin 2x sin x 


cos? x 
which simplifies to 2x? cos x + 6x” sin x 


2 


e* (ea sec? x — x° tanx + 3x? tan x) 


e* (5x sinx + sinx — xcos.x) 


sin? x 
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2x + 3y 
3x + 2y 
9x? + 2y? 
1 — 4xy 


(d) — 


3(1 — cos6) 
4sin0 


,0 


x*(Inx + 1) 

(a) e2* (2x3 + 3x2) 

(b) 2xe?*(2x? + 6x + 3) 

(c) 2e?*(4x> + 18x” + 18x + 3) 

(a) e(tcost + (t+ 1) sinr) 

(b) —e~'((t? — 2t) cos 3 + 32? sin 3¢) 


4cos 4t 
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2 
(c) t2e* sin 4t cos x : 423 


3 4 


sin 4t cos 3t 
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INTRODUCTION 


In this chapter the techniques of differentiation are used to solve a variety of problems. 
It is possible to use differentiation to find the maximum or minimum values of a func- 
tion. For example, it is possible to find the maximum power transferred from a voltage 
source to a load resistor, as we shall show later in the chapter. Differentiation is also 
used in the Newton—Raphson method of solving non-linear equations. Such an equation 
needs to be solved to calculate the steady-state values of current and voltage in a series 
diode-resistor circuit. 

Finally we show how vectors can be differentiated. This forms an introduction to the 
important topic of vector calculus which is discussed in Chapter 26. 


MAXIMUM POINTS AND MINIMUM POINTS 


Consider Figure 12.1. A and B are important points on the curve. At A the function 
stops increasing and starts to decrease. At B it stops decreasing and starts to increase. 
A is a local maximum, B is a local minimum. Note that A is not the highest point on 
the curve, nor B the lowest point. However, for that part of the curve near to A, A is 
the highest point. The word ‘local’ is used to stress that A is maximum in its locality. 
Similarly, B is the lowest point in its locality. 
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(a) (b) 


Figure 12.1 
The function y has a local maximum at A and a local minimum at B. 


In Figure 12.1(a) tangents drawn at A and B would be parallel to the x axis and so 


at these points * is zero. However, in Figure 12.1(b) there are corners at A and B. It is 


d 
impossible to draw tangents at these points and so . does not exist at these points. 
Hence, when searching for maximum and minimum points we need only examine 
., dy. d 
those points at which ; 1s zero, or *. does not exist. 


i. , s : 
Points at which an is zero are known as turning points or stationary values of the 
function. 


At maximum and minimum points either: 


.. dy : 
i) — does not exist, or 
(i) A 


.. dy 
ee aN) 
ay) dx 


Aer eae ; is dy 
To distinguish between maximum and minimum points we can study the sign of ac on 


either side of the point. At maximum points such as A, y is increasing immediately to the 
d 
left of the point, and decreasing immediately to the right. That is, 7 is positive immedi- 
d 
ately to the left, and *. is negative immediately to the right. At minimum points such as 
B, y is decreasing immediately to the left of the point, and increasing immediately to the 


d d 
right. That is, = is negative immediately to the left, and = is positive immediately to the 
dx dx 


right. This so-called first-derivative test enables us to distinguish maxima from minima. 
This test can be used even when the derivative does not exist at the point in question. 


The first-derivative test to distinguish maxima from minima: 


d d 
To the left of a maximum point, a is positive; to the right, = is negative. 


d d 
To the left of a minimum point, - is negative; to the right, ~ is positive. 
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Example 12.1 Determine the position and nature of all maximum and minimum points of the following 


functions: 
@y=x 
(b) y= —? +141 
eC x 
=—+—.—-2 1 
(c) y 3 + 5 xX + 
(dq) y= It 


Solution (a) 


(b) 


wae OY 
If y = x’, then by differentiation = 2x: 


Recall that at maximum and minimum points either 


. dy 
(i) — does not exist, or 
dx 


d 
(ii) *. = 0. We must check both of these conditions. 


The function 2x exists for all values of x, and so we move to examine any points 


where ac = 0. So, we have 
dx 


dy _ 
a= 


2x =0 


The equation 2x = 0 has one solution, x = 0. We conclude that a turning point exists 
at x = 0. Furthermore, from the given function y = x”, we see that when x = 0 the 
value of y is also 0, so a turning point exists at the point with coordinates (0, 0). To 
determine whether this point is a maximum or minimum we use the first-derivative 


test and examine the sign of = on either side of x = 0. To the left of x = 0, x is 


clearly negative and so 2x is also negative. To the right of x = 0, x is positive and 
so 2x is also positive. Hence y has a minimum at x = 0. A graph of y = x* showing 
this minimum is given in Figure 12.2. 


If y = -t? +14 1, then y = —2r + 1 and this function exists for all values of f. 
Solving y’ = 0 we have 


1 
—2t+1=0 aaa 


1 ie 
We conclude that there is a turning point att = 2 The y coordinate here is — ( ;) + 


1 1 1 
5 +1= 17. We now inspect the sign of y’ to the left and to the right of f = =. A 


little to the left, say at t = 0, we see that y’ = —2(0) + 1 = 1 which is positive. A 
little to the right, say at t = 1, we see that y’ = —2(1) + 1 = —1 which is negative. 


Hence there is a maximum at the point (5. 17), 


A graph of the function is shown in Figure 12.3. 
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Figure 12.2 Figure 12.3 


The function y has a minimum at x = 0. 


: . 1 
The function y has a maximum at t = 2" 


3 2 


(c) Ify= = + 7 —2x+1, then y = x* +x —2 and this function exists for all values 


(d) 


3 
of x. Solving y’ = 0 we find 


v+x-2=0 
(x—-1)@+2) =0 
x=1,-2 
There are therefore two turning points, one at x = | and one at x = —2. We consider 


each in turn. 


At x = 1, we examine the sign of y’ to the left and to the right of x = 1. A little way 
to the left, say at x = 0, we see that y’ = —2 which is negative. A little to the right, 
say at x = 2, we see that y’ = 27 + 2 — 2 = 4 which is positive. So the point where 
x = | isa minimum. 

At x = —2, we examine the sign of y’ to the left and to the right of x = —2. A little 
way to the left, say at x = —3, we see that y = (—3)* + (—3) — 2 = 4 which is 
positive. A little to the right, say at x = —1, we see that y’ = (—1)*+(—1)—2 = —2 
which is negative. So the point where x = —2 is a maximum. 

A graph of the function is shown in Figure 12.4. 


Recall that the modulus function y = |t| is defined as follows: 


eee —t t<0 
Pa) eg ee) 


A graph of this function was given in Figure 10.13(a) and this should be looked at 


d d 
before continuing. Note that ae —1 for t negative, and nage +1 for t positive. 
The derivative is not defined at t = 0 because of the corner there. There are no points 
d 
when 7 = 0. Because the derivative is not defined at t = 0 this point requires 


d d 
further scrutiny. To the left of t = 0, ad < 0; to the right, ed > Oand sot =Oisa 
minimum point. dt dr 


Figure 12.4 
The function y has a maximum at x = —2 
and a minimum at x = 1. 
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Example 12.2 


Solution 


Figure 12.5 Figure 12.6 

5, ds y ET Des ‘ 
The derivative — decreases on The derivative — increases on passing 
passing through a maximum point. through a minimum point. 


Rather than examine the sign of y’ on both sides of the point, a second-derivative 
test may be used. On passing through a maximum point y’ changes from positive to 0 
to negative, as shown in Figure 12.5. Hence, y’ is decreasing. If y” is negative then this 
indicates y’ is decreasing and the point is therefore a maximum point. Conversely, on 
passing through a minimum point, y’ increases, going from negative to 0 to positive (see 
Figure 12.6). If y” is positive then y’ is increasing and this indicates a minimum point. 

So, having located the points where y’ = 0, we look at the second derivative, y”. Thus 
y” > O implies a minimum point; y” < 0 implies a maximum point. If y” = 0, then we 
must return to the earlier, more basic test of examining y’ on both sides of the point. In 
summary: 


The second-derivative test to distinguish maxima from minima: 

If y =O and y” < O ata point, then this indicates that the point is a maximum 
turning point. 

If y = 0 and y” > 0 at a point, then this indicates that the point is a minimum 
turning point. 

If y = 0 and y” = 0 at a point, the second-derivative test fails and you must 
use the first-derivative test. 


Use the second-derivative test to find all maximum and minimum points of the functions 
in Example 12.1. 


(a) Given y = x* then y' = 2x and y” = 2. We locate the position of maximum and 
minimum points by solving y’ = 0 and so such a point exists at x = 0. Evaluating 
y” at this point we see that y”(0) = 2 which is positive. Using the second-derivative 
test we conclude that the point is a minimum. 


(b) Given y = —#?+1+4+ 1 then y = —2t+1 and y’ = —2. Solving y = 0 we find 


1 
= 5 Evaluating y” at this point we find (5) = —2 which is negative. Using 


— i : 
the second-derivative test we conclude that tf = 5 is a maximum point. 
a8 


Be: 
; + 5 ~ 2x + 1, then y’ = x+x—2andy’ = 2x+1.y¥ =0 
atx = | andx = —2. Atx = 1, y’ = 3 which is positive and so the point 


(c) Given y = 
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is a minimum. At x = —2, y’ = —3 which is negative and so the point is a 
maximum. 
—1 t<0 
(d) y= 1 t>0 


undefined at t = 0 


Since y’(0) is undefined, we use the first-derivative test. This was employed in 
Example 12.1. 


Engineering application 12.1 


Risetime for a second-order electrical system 


Consider the electrical system illustrated in Figure 12.7. The input voltage, v,, is 
applied to terminals a—b. The output from the system is a voltage, v,, measured across 
the terminals c—d. The easiest way to determine the time response of this system to 
a particular input is to use the technique of Laplace transforms (see Chapter 21). 

When a step input is applied to the system, the general form of the response de- 
pends on whether a quantity called the damping ratio, ¢, is such that ¢ > 1,¢ = l or 
¢ < 1. The quantity ¢ itself depends upon the values of L, C and R. This is illustrated 
in Figure 12.8. If the damping ratio, ¢ < 1, then v, overshoots its final value and the 
system is said to be underdamped. For this case it can be shown that 


1 t 
v, =U — ue (cos(p + ey for t>0 (12.1) 
where U is the height of a step input applied at t = 0, and 
as (12.2) 
a= — : 
Yk, 
1 
i resonant frequency (12.3) 
VLC 
B =,/a2 — a natural frequency (12.4) 


Engineers are often interested in knowing how quickly a system will respond to a 
particular input. For many systems this is an important design criterion. One way of 
characterizing the speed of response of the system is the time taken for the output to 
reach a certain level in response to a step input. This is known as the rise time and 
is often defined as the time taken for the output to rise from 10% to 90% of its final 
value. However, by looking at the underdamped response illustrated in Figure 12.8 it 
is clear that the time, ¢,,, required for the output to reach its maximum value would 
also provide an indicator of system response time. As the derivative of a function is 
zero at a maximum point it is possible to calculate this time. 
Differentiating Equation (12.1) and using the product rule, 


eae IG) are rae Ee a sin(ft) 
ae Sai (u Ue (coscen + rae )) t>0 


=(0-— Dee cos(Bt)) — uo( 


dt 


e “a feed 
B 
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= —U(—ae™ cos(ft) — eB sin(Br)) 
-u( —ae “a sin( Br) ie e “ap | 


B B 
=e ; a? sin(Br) 
= —Ue —acos(Bt) — 6 sin(Bt) — ioe +acos(Br) 
oP. 
= Ue ™ | B + — )sin(Bt) 
B 
d 
Yo — 0, Hence 
dt 
2 2 
ven (PSR) sin(Bt) = 0 
B 
Volt) 
Gall 
U 

a R L c 

| : . : d : 
a 

[oe tn i 

b d 

A Figure 12.8 
Figure 127 ; Response of a second-order system to a step 
A second-order electrical system. input. 


This occurs when sin(6t) = 0, which corresponds to t = kmt/B,k = 0,1,2.... 
It is now straightforward to calculate t,,, once 6 has been calculated, using Equa- 
tions (12.2), (12.3) and (12.4) for particular values of R, L and C. You may like to 


show that the turning point corresponding to k = | is a maximum by calculating 
D 


re ° and carrying out the second-derivative test. 
It is possible to check whether or not a system is underdamped using the following 
formulae: 


R 
— a damping ratio (12.5) 


c 


L 
K.= Be critical resistance (12.6) 


Let us look at a specific case with typical values L = 40 mH, C = | pk, R = 200. 
Using Equations (12.5) and (12.6), we find 


R, me SS = 
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and therefore the system is underdamped because ¢ < 1. 
Also, 


= 5000 


1 

oO = ——— 
NUGE: 
Ro 200 

DL DKA iO 


B = ,/@? — a? = ¥ 5000? — 2500? = 4330 


Finally, 


= 2500 


fo, = 


<i T 
m > 4330 
We conclude for this case that the risetime is 726 us. 


= 7.26 x 10-* = 726 ps 


Engineering application 12.2 


Maximum power transfer 


Consider the circuit of Figure 12.9 in which a non-ideal voltage source is connected 
to a variable load resistor with resistance R, . The source voltage is V and its internal 
resistance is R,. Calculate the value of R, which results in the maximum power being 
transferred from the voltage source to the load resistor. This is an essential piece of 
information for engineers involved in the design of power systems. Often an impor- 
tant design consideration is to transfer the maximum amount from the power source 
to the point where the power is being consumed. 


Non-ideal 
voltage ! 
source! 
Figure 12.9 
Meaasas Maximum power transfer occurs when Ry = Rg. 
Solution 


Let i be the current flowing in the circuit. Using Kirchhoff’s voltage law and Ohm’s 
law gives 
WW = UR ae A) 
Let P be the power developed in the load resistor. Then, 
Vk 


eo = — 
b (Ry +R)? 
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Clearly P depends on the value of R, . Differentiating w.r.t. R, and using the quotient 
rule, we obtain 
ie y2 1(Rg + R,)* — R,2(R, + R,) 
dR, (Ry + R,)* 
=e (Rs +R) — 2R, 
(Roo ee) 
= Rs —R, 
(Ro + R,)? 


OP : : ene 
Equating aR, to zero to obtain the turning point gives 
L 


2 Rs — Ay = 
(Rs +R)? 
that is, 
Ry, = Rg 
So a turning point occurs when the load resistance equals the source resistance. We 
need to check if this is a maximum turning point, so 
dP 2 Rs — RK. 


dR, = (Rs + 8,3 
ee y2a is +R,)? — (Ry — R,)3(Rs + R,)? 
dR; (Rs + R,)° 
EE Vy —(Rs +R) oa. 3(Rg =i) 
(R, + R,)* 
ae 2R,, — 4B, 
(Ry + R,)* 
= 2V? (Ry = 2Rs) 
(Rs +R,)* 


When R, = Rg, this expression is negative and so the turning point is a maximum. 
Therefore, maximum power transfer occurs when the load resistance equals the 
source resistance. 


EXERCISES 12.2 


1. Locate the position of any turning points of the 2 Locate and identify all turning points of 


following functions and determine whether they are 
maxima or minima. 
(a) y= x7 —x+6 
(c) y=x-1 

(ec) y= x — 12x 


(b) y= 2x27 +3x41 
(d) y=1+x—2x2 
() y=7+3x 


ra 9 
(a) Nes ee ae 


(b) y=te™ 
(Cc) y= xt — 2x2 
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Solutions 
1 2 1 
1 (a) & sz). minimum 2 (a) (2 5) maximum, (4 3) minimum 
(b) (—0.75, —0.125), minimum (b) (1, 0.368), maximum 
(c) none 


(d) (0.25, 1.125), maximum 


(c) (0,0) maximum, (1, —1) minimum, (—1, —1) 


minimum 
(e) (2, —16) minimum, (—2, 16) maximum 
(f) none 
Technical Computing Exercises 12.2 

Computer languages such as MATLAB® are matrix (a) Use a technical computing language to find y’ and 
orientated and do not always provide the ability to y” when y = e9 cost, 
differentiate functions. Others such as Wolfram (b) Solve y’ = 0 and hence locate any turning points 
Mathematica and Maplesoft Maple have this capability by entheinterval [0, 6] and determine their type. 


default. If you are attempting the following exercises in 
MATLAB® you may require the Symbolic Math Toolbox 
which is an add-on for the main program. 


(c) Plot a graph of y and check the position of 
the turning points with the results obtained in 
part (b). 


POINTS OF INFLEXION 


Recall from Section 11.4 that when the gradient of a curve, that is y’, is increasing, 
the second derivative y” is positive and the curve is said to be concave up. When the 
gradient is decreasing the second derivative y” is negative and the curve is said to be 
concave down. A point at which the concavity of a curve changes from concave up to 
concave down or vice versa is called a point of inflexion. 


A point of inflexion is a point on a curve where the concavity changes from concave 
up to concave down or vice versa. It follows that y’ = 0 at such a point or, in 
exceptional cases, y” does not exist. 


Figure 12.10(a) shows a graph for which a point of inflexion occurs at the point 
marked A. Note that at this point the gradient of the graph is zero. Figure 12.10(b) shows 
a graph with points of inflexion occurring at A and B. Note that at these points the 
gradient of the graph is not zero. 

To locate a point of inflexion we must look for a point where y” = 0 or does not exist. 
We must then examine the concavity of the curve on either side of such a point. 
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Example 12.3 


Solution 


Example 12.4 


Solution 


(a) (b) 


Figure 12.10 
(a) There is a point of inflexion at A; (b) there are points of inflexion at A 
and B. 


Figure 12.12 
Figure 12.11 The derivative, y’, changes sign at x = 0, but 
The second derivative, y”, changes y” remains positive. 


sign at x = 0. 


Locate any points of inflexion of the curve y = x°. 


Given y = x°, then y’ = 3x” and y” = 6x. Points of inflexion can only occur where 
y” = 0 or does not exist. Clearly y” exists for all x and is zero when x = 0. It is possi- 
ble that a point of inflexion occurs when x = 0 but we must examine the concavity of 
the curve on either side. To the left of x = 0, x is negative and so y” is negative. Hence 
to the left, the curve is concave down. To the right of x = 0, x is positive and so y” 
is positive. Hence to the right, the curve is concave up. Thus the concavity changes at 
x = 0. We conclude that x = 0 is a point of inflexion. A graph is shown in Figure 12.11. 
Note that at this point of inflexion y’ = 0 too. 


A common error is to state that if y’ = y” = 0 then there is a point of inflexion. This 
is not always true; consider the next example. 


Locate all maximum points, minimum points and points of inflexion of y = x*. 


y=49 y" = 12x° 


Example 12.5 


Solution 
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y = Oatx = 0. Also y” = 0 at x = 0 and so the second-derivative test is of no help 
in determining the position of maximum and minimum points. We return to examine y’ 
on both sides of x = 0. To the left of x = 0, y’ < 0; to the right y’ > 0 and so x = O is 
a minimum point. Figure 12.12 illustrates this. Note that at the point x = 0, the second 
derivative y” is zero. However, y” is positive both to the left and to the right of x = 0; 
thus x = 0 is not a point of inflexion. 


Find any maximum points, minimum points and points of inflexion of y = x* + 2x”. 

Given y = x* + 2x then y = 3x? + 4x and y’ = 6x + 4. Let us first find any maximum 

and minimum points. The first derivative y’ is zero when 3x? + 4x = x(3x +4) = 0, that 
4 

is when x = 0 or x = —-~. Using the second-derivative test we find y’(0) = 4 which 


4 
corresponds to a minimum point. Similarly, y'( - =) = —4 which corresponds to a 


maximum point. 
We seek points of inflexion by looking for points where y” = 0 and then examining 


the concavity on either side. y” = 0 when x = —-. 
Since y” is negative when x < —~, then y’ is decreasing there, that is the function is 
concave down. Also, y” is positive when x > a and so y’ is then increasing, that is the 


2 
function is concave up. Hence there is a point of inflexion when x = — 7" The graph of 


y = x° + 2x? is shown in Figure 12.13. 


Figure 12.13 


4 
There is a maximum at x = —~, a minimum at 


x = 0 and a point of inflexion at x = — 3" 


From Examples 12.4 and 12.5 we note that: 


(1) The condition y’” = 0 is not sufficient to ensure a point is a point of inflexion. 
The concavity of the function on either side of the point where y’ = 0 must be 
considered. 


(2) Ata point of inflexion it is not necessary to have y’ = 0. 
(3) Ata point of inflexion y’ = 0 or y” does not exist. 


418 Chapter 12 Applications of differentiation 


EXERCISES 12.3 


1 Locate the maximum points, minimum points and (e) y= i 
points of inflexion of (f) y=1® 
(a) y=3r?+6t—1 (g) y=xt — 2x 
(b) y=4-t-?? (h) z=t+- 
3 2 
© y=>-> +10 5x3 
3 2 (i) y=xo-— 
io 2 3 
@) ys Zt y — 20x+7 j) y=r!4 
Solutions 


1 (a) (-1, —4) minimum (g) (0,0) maximum, (1, —1) minimum, 


; if 1 
(b) (—0.5, 4.25) maximum (A: alps | ee |, Pac: 
59 v3 9 v3 9 
(c) (0, 10) maximum, (1, 4 minimum, 
6 points of inflexion 
1 119 . ; : 
(. =) point of inflexion (h) (1, 2) minimum, (—1, —2) maximum 
131 Pee ; (i) ioe minimum, i maximum, 
(d) { 4, a minimum, (—5, 77.83) maximum, 3 3 
17.0 finfl (010) | ae — 
== F int of i i VY); > > , are 
( 5 s) point of inflexion V2" 122 V2’ 12/2 
(e) (0, 0) point of inflexion also points of inflexion 
(f) (0,0) minimum G) (0, 0) point of inflexion 
Technical Computing Exercises 12.3 


(a) Use a technical computing language such as | (b) From your graph find the position of any maxima, 
MATLAB® to produce a graph of y = 3t!/9. minima or points of inflexion. 


THE NEWTON—RAPHSON METHOD FOR SOLVING 
EQUATIONS 


We often need to solve equations such as 
f(x) = 2x4 -x +x -10=0 
ft) =2e% —-7 =0 
f@) =t-—sint =0 


The Newton—Raphson technique is a method of obtaining an approximate solution, or 
root, of such equations. It involves the use of differentiation. 


Example 12.6 


Solution 
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y=f@) 


f(x) 


Figure 12.14 
The tangent at B intersects the x axis at C. 


Suppose we wish to find a root of f(x) = 0. Figure 12.14 illustrates the curve y = 
f(x). Roots of the equation f(x) = 0 correspond to where the curve cuts the x axis. 
One such root is illustrated and is labelled x = <. Suppose we know that x = x, is an 
approximate solution. Let A be the point on the x axis where x = x, and let B be the 
point on the curve where x = x,. The tangent at B is drawn and cuts the x axis at C where 
xX = x,. Clearly x = x, is a better approximation to < than x,. We now find x, in terms of 
the known value, x,. 


AB = distance of B above the x axis = f(x,) 


CA = x, — xX, 
Hence, 
AB 
gradient of line CB = — = FM) 
CA x, —X, 


But CB is a tangent to the curve at x = x, and so has gradient f’(x, ). Hence, 


f' () S f(a) 
aj 49 
eee _ f(x) 
eS 
and therefore, 
= F&) 


Jy = Ki PG, (U2) 


Equation (12.7) is known as the Newton—Raphson formula. Knowing an approximate 
root of f(x) = 0, that is x,, the Newton—Raphson formula enables us to calculate an 
improved approximate root, x5. 


Given that x, = 7.5 is an approximate root of e* — 6x? = 0, use the Newton—Raphson 
technique to find an improved value. 


x, =7.5 
f@=e — 68 f(x) =—723 
f'@)=e-18 = f"(%,) = 796 
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Using the Newton—Raphson technique the value of x, is found: 


—723 
pe ee Cee Ts 
f(x) 796 
An improved estimate of the root of e* — 6x°7 = 0 is x = 8.41. To two decimal places 
the true answer is x = 8.05. 


The Newton—Raphson technique can be used repeatedly as illustrated in 
Example 12.7. This generates a sequence of approximate solutions which may converge 
to the required root. Each application of the method is known as an iteration. 


Example 12.7 A root of 3sinx = x is near to x = 2.5. Use two iterations of the Newton—Raphson 
technique to find a more accurate approximation. 


Solution The equation must first be written in the form f(x) = 0, that is 
f@) =3sinx-x=0 
Then 
x, = 2.5 
f(x) = 3sinx —x f (x,) = —0.705 
f(x) =3cosx—- 1 Sf’) = —3.403 
Then 


2,5 — SP) _ 9.093 

G25. =. Se 

> (—3.403) 

The process is repeated with x, = 2.293 as the initial approximation: 
x; = 2,293 f(x,) = —0.042 f' (x) = —2.983 


Then 


(—0.042) 
299932 5 = 9570 
2 (—2.983) 


Using two iterations of the Newton—Raphson technique, we obtain x = 2.28 as an im- 
proved estimate of the root. 


Example 12.8 An approximate root of 
er —-2?-5=0 


is x = 3. By using the Newton—Raphson technique repeatedly, determine the value of 
the root correct to two decimal places. 


Solution We have 


x,=3 

f@ =x -27-5 f(x) =4 

f' (s) = 32° — 4x f'@,) = 15 
Hence 


4 
= 3-75 = 2.733 
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An improved estimate of the value of the root is 2.73 (2 d.p.). The method is used again, 
taking x, = 2.73 as the initial approximation: 


x, =2.73 f(x) =0441 ~— f'(@,) = 11.439 
0.441 


11.439 
An improved estimate is x = 2.69 (2 d.p.). The method is used again: 


x, =2.69  f(x,)=-0.007 _f’(x,) = 10.948 


Xy = 2.73 — = 2.691 


So 
—0.007 
xX, = 2.69 — al) = 2.691 
10.948 
There is no change in the value of the approximate root and so to two decimal places the 


root of f(x) = 0 is x = 2.69. 
The calculation can be performed in MATLAB® using the roots function: 
roots([1 -2 0 -5]) 


which will produce the real root of 2.69, agreeing with our numerical calculation using 
the Newton—Raphson method. The two complex roots of the equation will also be re- 
turned. Technical computing languages make use of a variety of numerical methods and 
to some extent the user has to take it on trust that they are correctly implemented and 
tested so that they always produce the correct result. 


The previous examples illustrate the general Newton—Raphson formula. 


If x = x, is an approximate root of f(x) = 0, then an improved estimate, x,,,,, is 


n+1? 
given by 


ee ec) 
Se TCE) 


The Newton—Raphson formula is easy to program in a loop structure. Exit from the loop 
is usually conditional upon |x,,,; — x,| being smaller than some prescribed very small 
value. This condition shows that successive approximate roots are very close to each 


other. 


Engineering application 12.3 


Series diode—resistor circuit 


Consider the circuit of Figure 12.15(a). A diode is in series with a resistor with re- 
sistance R. The voltage across the diode is denoted by V and the current through the 
diode is denoted by J. The /—V relationship for the diode is non-linear and is given 
by 


T= 1(e"" -1) 


where /, is the reverse saturation current of the diode. Given that the supply volt- 
age, V,, is 2 V, 1. is 10-4 A and R is 22 kQ, calculate the steady-state values of J and V. 
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>< 


Point at which both 
the diode and the 
resistor equations 
are satisfied 


(b) oe 


Figure 12.15 
A simple non-linear circuit: (a) series diode-resistor circuit; (b) resistor load line 
superimposed on diode characteristic. 


Solution 

There are several ways to solve this problem. A difficulty exists because the diode 
I-V relationship is non-linear. One possibility is to draw a load line for the resistor 
superimposed on the diode J—V characteristic, as shown in Figure 12.15(b). The load 
line is an equation for the resistor characteristic written in terms of the voltage across 
the diode, V, and the current through the diode, /. It is given by 


V.-V=IR 
l= ee 
AR R 


1 V. 
This is a straight line with slope =F and vertical intercept a When V = 0,7 = 
V. 
a This corresponds to all of the supply voltage being dropped across the resistor. 


When V = V,, J = 0. This corresponds to all of the supply voltage being dropped 
across the diode. Therefore, these two limits correspond to the points within which 
the circuit must operate. The solution to the circuit can be obtained by determining 
the intercept of the diode characteristic and the load line. This is possible because 
both the resistor characteristic and diode characteristic are formulated in terms of V 
and /, and so any solution must have the same values of J and V for both components. 
If an accurate graph is used, it is possible to obtain a reasonably good solution. An 
alternative approach is to use the Newton—Raphson technique. Combining the two 
component equations gives 


—V+V,=Ri(e” — 1) 
Now 22 10 eo and.co 
—V 4+2=29x 10? x 106-"(e"" — 1) 
Now, define f(V) by 
vy Va 10 ey ey 
We wish to solve f(V) = 0. We have 
FW = 22 < 107" 406" PIS 88 105 6 I 
Choose an initial guess of V, = 0.5: 


me ee ee DOSEN EA eS = 
a er Oe 8.8 x 10-%e20 + 1 


= 0.7644 
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With an equation of this complexity, it is better to use a computer or a programmable 
calculator. Doing so gives 


Vs = 0.6895, ..., Vip = 0.5770, ..., Vig = 0.5650 


which is accurate to four decimal places. 
It is useful to check the solution by independently calculating the current through 
the diode using the two different expressions. So, 
P=A0e (G2 6.58 10s A 
0.5650 2 


(== = 6. 10° A 
I | oe 


and therefore the solution is correct. 


EXERCISES 12.4 


1 Use the Newton-Raphson technique to find the value ()e/? —5x=0 x, =6 


of a root of the following equations correct to two 
decimal places. An approximate root, x,, is given in 


2. 
each case. (e) sinx + 7 =-1 x; = 0.6 
= 42 = 
ChE = os 2 Explain circumstances in which the Newton—Raphson 
(b) 3x3 — 4x7 + 2x -9 =0 xX, =2 technique may fail to converge to a root of f(x) = 0. 
Solutions 
1 (a) 1.02 (b) 1.85 (c) 7.15 (d) 1.76 | (e) 0.64 


DIFFERENTIATION OF VECTORS 


Consider Figure 12.16. If r represents the position vector of an object and that object 
moves along a curve C, then the position vector will be dependent upon the time, t. We 
write r = r(t) to show the dependence upon time. Suppose that the object is at the point 
P with position vector r at time ¢ and at the point Q with position vector r(t + df) at 
the later time ¢ + dt as shown in Figure 12.17. Then PO represents the displacement 
vector of the object during the interval of time 67. The length of the displacement vector 
represents the distance travelled while its direction gives the direction of motion. The 
average velocity during the time from f to t + df is the displacement vector divided by 
the time interval 65f, that is 


PQ r(t+5t) —r(t) 
St 


average velocity = i 


The instantaneous velocity, v, is given by 


. rit+6t)—r(t) dr 
v= lim = 
510 ot dt 
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Example 12.9 


y 
P 
C 
PO 
r r(t) 
Q 
x r(t + 82) 
Figure 12.16 Figure 12.17 
Position vector of a point P on a Vector PQ represents the displacement of 
curve C. the object during the time interval ér. 


Now, since the x and y coordinates of the object depend upon the time, we can write the 
position vector r as 


r(t) =x(Hit yOj 
Therefore, 


r(t + dt) = x(t + dt)it+ yt + 61)j 


so that 
_ x(t + dthit+y(t + dt)j —x(i— y(nj 
v(t) = lim 
st>0 ot 
_ {xit+dt)—xt),  y@+dt)—y), 
= lim 1+ J 
st>0 bt ot 
dx, dy, 
a + a 


often abbreviated to v =r = Xi + yj. Recall the dot notation for derivatives w.r.t. 
time which is commonly used when differentiating vectors. So the velocity vector is 
the derivative of the position vector with respect to time. This result generalizes in an 
obvious way to three dimensions. If 


rt) =x@)it+ yOj+zOk 
then 
r(t) =x(Hi+yOj+zok 
The magnitude of the velocity vector gives the speed of the object. We can define the 
acceleration in a similar way: 
dv @r 
a=—= 
dt dr? 


=f =<xXi+ ¥j+ zk 


In more general situations, we will not be dealing with position vectors but other physical 
quantities such as time-dependent electric or magnetic fields. 


If a = 327i + cos 2¢j, find 
da 


dt 


2 
(c) = 


da 
@ > @) ae 
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Solution (a) Ifa = 3?7i + cos 2rj, then differentiation with respect to t yields 


d 
= = = 6fi — 2 sin 2¢j 


d 
(b) S| = J (6t)? + (—2sin 21)? = V 3612 + 4 sin? 2 


2 


agate 
Cc — = 01 —4COS 
dr? J 


It is possible to differentiate more complicated expressions involving vectors pro- 
vided certain rules are adhered to. If a and b are vectors and c is a scalar, all functions 
of time f, then 


da dc db da 
ae reg? wey net Rate Di posed ie 
a ea cla ae oe ee 
da db d da 
ao) Feat Slax b)aax 2+ ox xb 


Example 12.10 Ifa = 3¢i— rj and b = 227i + 3), verify 
(a) — b) =a- + . -b (b) : Pr b) =a _ + a xb 
a) — =a-—+— —(axb) =ax — + — 
dt dt dt dt 


Solution (a) a-b= (3ti—?’j)- (21+ 3j) = 6t? — 37? 


d 2, 
—(a-b) = 184° — 6t 
dt 


Also 
da db 
— = 3i - 2tj — =4ti 
dt dt 
So, 
db d 
avi +b: = = (311 — #2) (4ti) + (274 + 39) - Gi — 249) 
= 127° + 6t? — 6f = 1827 — 6t 
db di 
We have verified “(a b) =a-— + aa b. 
dt dt 
ij k 
(b) axb=|3t -# 0 
2 2630 
= (9t + 2¢*)k 


: (ax b) = (94+ 8f)k 
— e4 = 
dt 
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Also, 
j k 
db |' J 
ax—=(|3t -? 0O 
doi 0 Oo 
= 4k 
aa i j &k 
—xb=/3 —2t 0 
dr 2 3 0 
= (9+ 49r)k 
and so 


db da 


d 
ax +7 xb= 4k + 9+ 40)k = (9 + 8° )k = — (a xb) 


dt dt 


as required. 


EXERCISES 12.5 


1 ifr = 3ti+ 27j +0, find 5 
(a) dr (b) @r 
= ee 
dt dt? 


2 Given B = te“‘i-+ cos fj find 

x dB (3 dB 6 
Pye ane 
dt dr? 


di 
3 Ifr = 417i + 2tj — 7k evaluate r and " when t = 1. 


7 
4 Ifa=fi-—7tk, and b = (24+-Hi+7j —2k, 
da db 
finda- b) find — find — 
(a) finda-b (b) find — (c) find 
d db da 
d) show that —(a-b) =a- — + —-b. 
(d) show sa Tc ) a a 
Solutions 
1 (a) 3i+ 4tj+37°k 3 
(b) 4j + 6rk 4 
2 (a) (-te'+e)i-—sintj 
(b) e'(t — 2)i — cos tj 5 


Given r = sinti + cos fj, find 


(ar (b)r ©) Il 


Show that the position vector and velocity vector are 


perpendicular. 
Show r = 3ei+ (2 + 1)j satisfies 
¥+rj 


Given a = 7i— (4—1)j, b =i+ tj show 


d db da 
(a) eax h= (»» 2)\+(¢ «») 


d 
(b) qe aa 


4i + 2j — 7k, 8i + 2j 
(a) 17° +2727+14) = (b) 377i — 7k 
(c) i+2¢j 


(a) costi— sintj (b) —sinti-—costj 


(c) 1 
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REVIEW EXERCISES 12 


1. Determine the position of all maximum points, 4 Given 
minimum points and points of inflexion of a= (12 +1)i-j+¢k 
(a) y = 283 — 210? + 604 +9 b= 21j—-k 
find 
(b)y =1(? - 1) pe da w) 
a) — eas 
2 In Section 9.8 we showed that the impedance of an dt dt 
LCR circuit can be written as d d 
(c) qa» (d) —(a xb) 
: 1 t dt 
Z=k rj ol— oC Use two iterations of the Newton—Raphson method 
: to find an improved estimate of the root of 
(a) Find |ZI. P ; 
: ae sinft=1—=,0 <t < 7, givent = 0.7 is an 
(b) Fora given circuit, R, L and C are constants, and . 2 
- dizi approximate root. 
q@ can be varied. Find ——. 
da . Determine the position of all maximum points, 
(c) For what value of w will |Z| have a maximum or St P ; : . 
an : : minimum points and points of inflexion of 
minimum value? Does this value give a 
maximum or minimum value of |Z|? (a) y= ee 
3 Use two iterations of the Newton—Raphson technique (b) y= Pe“ 
to find an improved estimate of the root of (c) y= x3 — 3x2 43x—1 
Pee (d) y=e*+e7% 
given f = 1.8 is an approximate root. @) y=|t|-? 
Solutions 
; i 7 95 (a) 2ti+k 
1 (a) (2,61) maximum, (5, 34) minimum, ue a) een 
: (b) 2j 
point of inflexion 
(c) —3 
1 2 a 1 2 d) —4ti+ 2tj + (61? + 2)k 
(b) (= -3] minimum, (-+ =a) ( J 
v3" 3v3 V3 3V3 0.705, 0.705 
maximum, (0, 0) point of inflexion 
(a) (0, 1) isa maximum 
2 (a) R24 @212 2L \ 1 f2 
mie CC wl Points of inflexion when x = + ae 
ol2 —1 /o C2 (b) (0,0) point of inflexion, (3, 1.34) maximum. 
(b) Further points of inflexion when t = 4.73, 1.27 
VR? + wl? — 2L/C + 1/(w2C2) eS 
(c) (1, 0) point of inflexion 
1 bn 4 
(c) O= Jig produces a minimum value of Z (d) (0,2) minimum 
(e) (0.5, 0.25) maximum, (—0.5, 0.25) maximum, 
3 1.859, 1.857 minimum at (0, 0) (y’ does not exist here) 
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INTRODUCTION 


When a function, f(x), is known we can differentiate it to obtain the derivative, — 


The reverse process is to obtain the function f(x) from knowledge of its derivative. This 
process is called integration. Thus, integration is the reverse of differentiation. 

A problem related to integration is to find the area between a curve and the x axis. 
At first sight it may not be clear that the calculation of area is connected to integration. 
This chapter aims to explain the connection. An area can have various interpretations. 
For example, the area under a graph of power used by a motor plotted against time 
represents the total energy used by the motor in a particular time period. The area under 
a graph of current flow into a capacitor against time represents the total charge stored 
by the capacitor. 

Circuits to carry out integration are used extensively in electronics. For example, a 
circuit to display the total distance travelled by a car may have a speed signal as input and 
may integrate this signal to give the distance travelled as output. Integrator circuits are 
widely used in analogue computers. These computers can be used to model a physical 
system and observe its response to a range of inputs. The advantage of this approach is 
that the system parameters can be varied in order to see what effect they have on system 
performance. This avoids the need to build the actual system and allows design ideas to 
be explored relatively quickly and cheaply by an engineer. 


Example 13.1 


Solution 
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ELEMENTARY INTEGRATION 


d 
Consider the following problem: given *. = 2x, find y(x). Differentiation of the func- 


d 
tion y(x) = x? + c, where c is a constant, yields 7 = 2x for any c. Therefore y(x) = 


x’ +c is a solution to the problem. As c can be any constant, there are an infinite number 
of different solutions. The constant c is known as a constant of integration. In this ex- 
ample, the function y has been found from a knowledge of its derivative. We say 2x has 
been integrated, yielding x° +c. To indicate the process of integration the symbols f and 
dx are used. The { sign denotes that integration is to be performed and the dx indicates 
that x is the independent variable. Returning to the previous problem, we write 


ys frxdr=2 +e 


t i 7 
symbols for constant of integration 
integration 


In general, if 


2 = Fe) 
then 
y= [roa 


Consider a simple example. 


d 
Given oe cos x — x, find y. 
dx 


We need to find a function which, when differentiated, yields cos x — x. Differentiating 
sin x yields cos x, while differentiating —x?/2 yields —x. Hence, 


2 
y= | (o0sx— 2) dr=sinx > +e 


where c is the constant of integration. Usually brackets are not used and the integral is 
written simply as [ cos.x — x dx. 


The function to be integrated is known as the integrand. In Example 13.1 the integrand 
is COSX — Xx. 
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tt 1 


n+1 


d ttl 
Example 13.2 Find = + °) and hence deduce that fx” dx = tc. 
n 


dx\n+1 


Solution From Table 10.1 we find 


d / x"! d / xt! d é has 
( “ + ) = ( x ) 4 7,0) using the linearity 


Genel) dk\ael of differentiation 


again using the 


1 
- St) +e 


n+1dx dx linearity of differentiation 
il 
= ——{(n+ 1)x"}+0 using Table 10.1 
n+1 
= x" 


Consequently, reversing the process we find 


itl 
[va +c 
n+1 


as required. Note that this result is invalid ifn = —1 and so this result could not be 
applied to the integral [(1/x) dx. 


Table 13.1 lists several common functions and their integrals. Although the variable 
x is used throughout Table 13.1, we can use this table to integrate functions of other 
variables, for example ¢ and z. 


Example 13.3 Use Table 13.1 to integrate the following functions: 
(a) x 
(b) cos kx, where k is a constant 
(c) sin(3x + 2) 
(d) 5.9 
(e) tan(6t — 4) 
ie 
(g) 2 


(h) cos 100n7t, where n is a constant 


Solution (a) From Table 13.1, we find {ke a +c,n # —1.To find f x* dx let n = 4; 
we obtain 
{2 dx = Zl +c 
5 
sin(ax) 


(b) From Table 13.1, we find f cos(ax) dx = ——— +c. In this case a = k and so 
a 


in kx 
[costar = — +c 
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Table 13.1 
The integrals of some common functions. 
f@) J fo) dx f@) Sf) de 
sin b 
k,constant kx+c eaten B (ax + b) dee 
xi! 
x” +e n#-1 tanx In | sec x| + ¢ 
n+1 
1 In | sec ax| 
xis H In |x| +¢ tan ax —= 
x a 
x x 1 b 
€ e+c tan(ax + b) maser 
a 
e* —e*+c 1 
cosec(ax+b)  —{In|cosec(ax + b) 
ax a 
et aa —cot(ax + b)|} +c 
a 
‘ 1 
sin x —cosx+c sec(ax + b) —{In | sec(ax + b) 
a 
; —cos ax +tan(ax + b)|} +e 
sin ax +c 
1 : 
; —cos(ax + b) cot(ax + b) —{In | sin(ax + b)|} +c 
sin(ax +b) =———— +c a 
a 1 ; 
cos x sinx +c a ao a rs 
: 1 
sin ax —_—__ -tan-!—+¢ 
cos 
ax - c a +x a 


Note that a, b,n and c are constants. When integrating trigonometric functions, 
angles must be in radians. 


(c 


Nee 


(d) 


(e) 


From Table 13.1, we find f sin(ax + b) dx = +c. In this case a = 3 


—cos(ax + b) 
and b = 2, and so - 


—cos(3x + 2) 


[singe dx = - 


From Table 13.1, we find that if k is a constant then f k dx = kx +c. Hence, 


[59dr=sor+e 


In this example, the independent variable is ¢ but nevertheless from Table 13.1 we 
can deduce 


1 t+b 
[ana + dt — MIBSeUP ane 6 
a 
Hence with a = 6 and b = —4, we obtain 


1 6t — 4 
n|sec( Mae 


tan(6t — 4) dt = 
fan ) . 
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13.2.1 


Example 13.4 


az 


(f) The independent variable is z but from Table 13.1 we can deduce J e“dz= a +c. 
Hence, taking a = —3 we obtain 7 


fem 4 eo % n eo ip 
Ve c= c 
—3 3 


2 


1 
(g) Since 2 =x“, we find 


1 
[a“- [rtwatten-ite 


(h) When integrating cos 100n7tt with respect to rt, note that 100n7t is a constant. Hence, 
using part (b) we find 


sin 100n7tt 


I t dt = 
/ cos 100n7t T00n7 


Integration as a linear operator 


Integration, like differentiation, is a linear operator. If f and g are two functions of x, 


then 
[r+sar= [rors fea 


This states that the integral of a sum of functions is the sum of the integrals of the indi- 
vidual functions. If A is a constant and f a function of x, then 


[Arac=a f pa 


Thus, constant factors can be taken through the integral sign. 
If A and B are constants, and f and g are functions of x, then 


faresedraa [parte f gar 


These three properties are all consequences of the fact that integration is a linear op- 
erator. Note that the first two are special cases of the third. The properties are used in 
Example 13.4. 


Use Table 13.1 and the properties of a linear operator to integrate the following 
expressions: 


(a) 2+9 (d) (t +2)? (g) 3sin4r 
(b) 34-7 (e) : og (h) 4cos(9x + 2) 


(c) : (f) 4e* (i) 3e” 


Solution 
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. sinx+cosx z—l 
0) — () tan 5 ) (n) 3sec(4x — 1) 
(k) 2t—e’ (m) ef te! (0) 2cot 9x 


(p) 7cosec(7/3) 


(a) [eroar= fears fax using linearity 


3 
=> +9x+e using Table 13.1 


Note that only a single constant of integration is required. 


(b) / 34 — J/t dt =3 i t* dt — } 1”? dt using linearity 


p 13/2 

= 3(5) = 32 +c using Table 13.1 
ar ape 

“—_. oo 


1 
(c) / — dx = In|x| +c using Table 13.1. 
x 
Sometimes it is convenient to use the laws of logarithms to rewrite answers involving 


logarithms. For example, we can write In |x| + c as In |x| + In|A| where c = In [A]. 
This enables us to write the integral as 


1 
fc ee=inian 
Xx 


B 
(d) fororaa[Ptarrag =F 42? +4 te 


1 Zz 
(e) —4+zdz=In|z}+—+c 
Zz 2 


de: 
(f) [setae te=20% +e 


: 3 cos 4t 
(g) [ssinan dt = — ri 
4sin(9x + 2) 
——__+e 


(h) [ 4cos(9x +2) dr = . 


2z 
(i) poeva=s ag 


. sin x + cos x —cosx+ sinx 
g [Ree 


(k) [rureaar-e se 


(1) fron(S -) dz =2In 


z-l 
see (#5*)| +e 
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(m) [etetase-ette 
3 
(n) 3 sec(4x — 1) dx = plese Dane e 
2 : 
(0) [2cor9x ax — ria | sin 9x| +c 


(p) / 7 cosec(7t/3) dx = {7 cosec(7t/3)}x +c as cosec(7t/3) is a constant 


Engineering application 13.1 


Distance travelled by a particle 


The speed, v, of a particle is the rate of change of distance, s, with respect to time f, 


thatis v = sy The speed at time ¢ is given by 3+ -2r. This is illustrated in Figure 13.1. 


Note that the speed of the particle is increasing linearly with time. Find the distance 
in terms of f. 


Solution 
We are given that 
ds 
=— =3+42t 
i dt BE 


and are required to find s. Therefore, 
sa f 342d 5+? +c 


Note that the speed of the particle is modelled by a linear expression in t. This means 
that the speed increases by the same amount in each subsequent second. On the other 
hand, the distance, s, is modelled by a quadratic in t. Therefore, the distance travelled 
in each subsequent second increases. Figure 13.1 illustrates the speed-time graph and 
Figure 13.2 illustrates the distance-time graph. 


v 9 
3 

c 
O if O ff 
Figure 13.1 Figure 13.2 
Graph of speed of the particle, v, against Graph of distance travelled by the 


time, f. particle, s, against time, f. 
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Engineering application 13.2 


Voltage across a capacitor 


Recall from Engineering application 10.2 that the current, i, through a capacitor 
depends upon time, f, and is given by 


where v is the voltage across the capacitor and C is the capacitance of the capacitor. 
Derive an expression for v. 


Solution 
If 
du du i 

i= C= th <a 

ee See aes 
Therefore, 

j 1 
v= / > da— CG i! i dt using linearity 


Note that whereas the capacitance, C, is constant, the current, 7, is not and so it cannot 
be taken through the integral sign. In order to perform the integration we need to know 
i as a function of t. 


13.2.2 Electronic integrators 


Often there is a requirement in engineering to integrate electronic signals. Various cir- 
cuits are available to carry out this task. One of the simplest circuits is shown in 
Figure 13.3. The input voltage is v,, the output voltage is u,, the voltage drop across 
the resistor is Up and the current flowing in the circuit is 7. Applying Kirchhoff’s voltage 
law yields 


v =U +, (13.1) 


For the resistor with resistance, R, 


Vy =iR (13.2) 
For the capacitor with capacitance, C, 
du 
;—- C2 1333 
dt oe) 
Combining Equations (13.1) to (13.3) yields 
du, 
v; = RC + v, (13.4) 
dt 


In general, v, will be a time-varying signal consisting of a range of frequencies. For 
the case where v, is sinusoidal we can specify a property of the capacitor known as the 
capacitive reactance, X., given by 
1 
xX,=>= 
2nfC 
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. i 
<-— 
UR 
Vi ] Cc Uo 
Figure 13.4 
Figure 13.3 Summing integrator using an operational 
Simple integrator. amplifier (op amp). 


where f = frequency of the signal (Hz). It can be seen that X, decreases with increasing 
frequency, f. For frequencies where X, is small compared with R, most of the voltage 
drop takes place across the resistor. In other words, v, is small compared with vg. Exam- 
ining Equation (13.1) for the case when X, is very much less than R (written as X, < R), 
and v, < vp, it can be seen that Equation (13.4) simplifies to 


rede 
Ve = 
: dt 


(13.5) 


This equation is only valid for the range of frequencies for which X, < R. Rearranging 
Equation (13.5) yields 


du, v; 

dt RC 

v, = — | v,. dt 
fe} RC 1 


The output voltage from the circuit is an integrated version of the input voltage with a 
scaling factor —. 
RC 


An electronic integrator which performs better can be made from an operational am- 
plifier. The circuit for an operational amplifier integrator is given in Figure 13.4. The 
main advantage of this circuit is the low output impedance and high input impedance, 
making it useful for electronic control applications and analogue signal processing. The 
function of the operational amplifier is to amplify the potential difference between the 
inverting and non-inverting inputs. These are labelled — and + respectively in the circuit 
diagram. Usually the gain of the amplifier is extremely high, so even a small voltage dif- 
ference between the two inputs will give a very large output voltage, which is limited by 
the voltage supply attached to the device. Notice in this circuit that there is a capacitor 
connected from the output back to the input. This capacitor provides negative feedback. 
This means that a proportion of the output voltage is fed back to the input and this in 
turn serves to reduce the output. As a consequence of this the overall gain of the circuit 
is limited and the amplifier reaches an equilibrium state where the voltage at point X is 
the same as at the non-inverting input, which is connected to earth. For this reason the 
point X in the circuit is sometimes referred to as a virtual earth. 

Assuming point X is at zero volts, and using Ohm’s law, gives 


=, =e (13.6) 


13.2.3 


Example 13.5 
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Figure 13.5 
Output from an operational 


00 amplifier integrator circuit with 
290 291 292 293 294 295 296 297 298 299 300 v, being a square wave input 


t (ms) and v, = v3 = 0. 


Assuming i, is negligible, then 
ip = 1, +i, +1; (13.7) 


For the capacitor, 


“ (13.8) 


The negative sign is a result of the direction chosen for i,. Combining Equations (13.6)- 
(13.8) yields 


vy UV, U3 dv, 
=—=C 
R, e R, as R, dt 
a wD) U3 
v, =- dt 
. lac - R,C - R,C 


The circuit therefore acts as an integrator. The minus sign in the integration is a result 
of the circuit design which is known as an inverting circuit. A typical output from this 
circuit is given in Figure 13.5. 


Integration of trigonometric functions 


The trigonometric identities given in Table 3.1 together with Table 13.1 allow us to 
integrate a number of trigonometric functions. 


Evaluate 


(a) { cos*t dt 
(b) f sin’ t de 
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Solution Powers of trigonometric functions, for example sin’ t, do not appear in the table of stan- 
g p PPp' 


dard integrals. What we must attempt to do is rewrite the integrand to obtain a standard 
form. 


(a) From Table 3.1 


1 + cos 2t 
2 


2 
cos’ t = 


and so 


1 2t 
feotra = fa 


_ ¢t  sin2t 
2 4 
(b) / sin’ t dt = / 1 — cos’ t dt using the trigonometric identities 
= / 1 dt — / cos’ t dt using linearity 
= t a sin 2t 4 . t (a) 
a 5 m1 ¢ using part (a 
_ t  sin2t 4§ 
~ 2 4 


Example 13.6 Find 


(a) f sin 2t cost dt 


(b) f sin mt sinnt dt, where m and n are constants with m 4 n 


Solution (a) Using the identities in Table 3.1 we find 
2sinA cos B = sin(A + B) + sin(A — B) 


1 
hence sin 2t cost can be written 5 (sin 3t + sint). Therefore, 


1 
[ sin2e cost dt = / ane + sint) dt 


1 ( —cos3t 2 
= — co 
5 3 S Cc 


1 
—— cos 3f — = cost 
eis a0 +c 


(b) Using the identity 2 sinA sin B = cos(A — B) — cos(A + B), we find 


1 
sin mt sin nt = gore —n)t — cos(m+ n)t} 


1 


Therefore, 


/ sin mt sinnt dt = 


EXERCISES 13.2 


Integrate the following expressions using Table 13.1: 


(a) x10 (b) 9 @2> 
1 
(d) vt (@) = () 3.2 
x i : 2 3 
(g) a (h) Si (i) @) 
Oe ae ky 
Integrate the following expressions using Table 13.1: 
(a) e* (b) e® © e* 
(d) 5 (e) et ® 
1 —2.5x 
@) |G me» 


Integrate the following expressions using Table 13.1: 


(a) sin4x (b) sin 9t 
(c) (3) (d) (2) 
2 5 
(e) cos 7x (f) cos(—3x) 
(g) cos ( *) (h) tan 9x 
(i) cosec 2x G) sec 5t 
(k) cot 8y () cos(5t + 1) 
(m) tan(3x + 4) (n) sin(3t — 70) 


(0) cosec(5z+ 2) 


) si 2t 1 
(q) sin = 


(s) cosec(7t — 2x) 


(p) «(5 + ) 


(r) cot(5 — x) 


Use Table 13.1 to integrate the following expressions: 


1 
® 3 ) — 
1 
© = > W3 
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. 5 lcosim —n)t —cos(m+n)t} dt 


sin(m—n)t  sin(m+n)t 
+c 
m—-n m+n 
(e) : (f) : 
py se 
/0.25 — x2 0.01 + v2 
1 1 
a hiya 
© iy 7 © 042 
1 
(i) Q) 
2—x2 : 432 


Integrate the following expressions: 


1 
(a)3+x+— 
Xx 
(b) e2x _ eo 2k 


(c) 2 sin 3x + cos 3x 


(d) sec(2t + 7t) + ca( 5 — ") 


(e) wo(5) + cosec(3t — 7) 


. x 1 
(f) Sue sr ob = 
1 


cos(3x) 


2 
1 
(h) ("+ 7} 
t 
(i) 
 3e2e 
(j) tan(4t — 3) + 2 sin(—t — 1) 


(k) 1 + 2cot 3x 


1 si t 3 t 
(1) sin 3) cos 3 


(m)(t — 2)? 

(n) 3e7* — e~#/2 
(0)7 — 7x6 +e 
(p) (k+1)? 
(q) ksint — cos kt 


(g) 


k constant 


k constant 
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1 3 1 
= se d) ———— 
© 42 oi SS Te 
1 2 -7 
(s) ———— (¢) === () 
V25— 2 4—x2 2 — 3x2 
1 
6 1427 i) ——— 
t hye 
Tae 6 V1 — (-/2) 
: ..  3+x, 3 
6 The acceleration, a, of a particle is the rate of change 11 By writing in the form > +1, 
: : . dv 34x 
of speed, v, with respect to time f, that is a = a The find / dx. 
x 


speed of the particle is the rate of change of 


d 
distance, s, that is v = = If the acceleration is given mm (2) Express 


t x2 + 2x41 
by 1+ 5? find expressions for speed and distance. x2 +1) 


; ; ae ; as its partial fractions. 
7 The speed, v, of a particle varies with time according 


2 42x41 
to (b) Hence find / ll 
x(x2 + 1) 
v@)=2—e" ‘ ee As 
13 The velocity, v, of a particle is given by 
(a) Obtain an expression for the distance travelled by 1/2 
: v=2+e 
the particle. 4 
(b) Calculate the distance travelled by the particle (a) Given distance, s, and v are related by 7 =v 
t 
between t = 0 and ¢ = 3. find an expression for distance. 
8 By writing sinh ax and cosh ax in terms of the (b) Acceleration is the rate of change of velocity 
exponential function find with respect to t. Determine the acceleration. 
(a) f sinhax dx 14 (a) Use the product rule of differentiation to verify 


(b) f coshax dx 


= (x e2*) = e%* + 2xe2* 
(c) Use your results from (a) and (b) to find dx 


jf 3sinh 2x + cosh 4x dx. (b) Hence show 
: ‘ 2 xe2* ex 
9 Acapacitor of capacitance 10~~ F has a current i(t) / xe* dy = —— —~ = 4+¢ 
through it where 2 4 
i(t)=10—e7% 15 Integrate 
. ; 2-7 4+ 
Find an expression for the voltage across the (a) —— (b) —- 
‘ i e 
capacitor. 
cos 4x 1 
10 sae the following: : (c) ain 4¢ (d) > sin 3x 
(a) =— () =, 24x 
+4 I +4 (©) 5 ~ (f) sin? ++ cos?t 
+x 
Solutions 
xll 1 
1 (a) aT. (b) 9x+c¢ (g) =o ee (h) 27f +c 
2 25 215 = ee « 
(c) sx" +e (d) =t~ +c G) —t+e Gj) --+c¢ 
5 3 7 x 
(e) In|z] +c (f) —3.2x+c 


3 
(k) re Fg 


(g) 


(b) 


(d) 


(e) 
(f) 


(g) 


(h) 
(i) 
Q@) 
(k) 
(1) 
(m 
(n) 
(0) 


(p) 


(q) 
(r) 
(s) 


(a) 
(b) 


(c) 


har 


= +e (b) +e 
et 
3 +c (d) -e*+c 
Je0-5t +c (f) 3e2/3 +e 
et 2e-2 Sx 
_ +e (h) — +c 
4 
cos 4x 
_ c 
4 
cos 9f 
_ c 
9 
2cos( ~ ) + 
—2cos|{ = : 
5 c 
5 2t 7 
——cos{| — c 
2 5 
sin 7x 
i 
7 
1, 
_ sin(—3x) +c 


2 sin 2 a 
= a c 
a 3 


In | sec 9x| 
——— +c 
9 


tin | cosec 2x — cot 2x|) +c 
z (ln | sect + tan 5t|) +c 

5 In| sin 85} +c 

5 sin(5t +1)4+c 
FIn|sec(3x +4) +e 

-; cos(3t — 7) +c 


1 
5 In| cosec(5z + 2) — cot(5z + 2)| +c 


ane ea ees 
sec 5 an 3 
3 2t 
cos l}+c 


—In|sin(S5 — x)| +c 


21n +c 


1 
= In| cosec(7t — 2x) — cot(7t — 2x)| +c 


tan~!x+c 


sin’ x+c 


sin7! é +c 
. 2 
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i ea ( 2 ee 
i 3 3 Cc 


(e) sin7! (2x) +e 
(f) 10tan-!(10v) +¢ 


(g) 10-3 tan! { 2 | +c 
103 


1 t 
h) — =tan7!{ — 
OY) (5) + 


(i) wt) + 
2 
(j) 3tan—! (3x) +e 


x2 
(a) 3x+ > + In |x| +e 


e2* e72x 
b) — 
(b) 5 + 5 +c 
(c) 2 (x) sin 3x 
c cos t t 
a ae 


(d) 0.5 In| sec(2t + 7t) + tan(2t + 7t)| 
, (: 
sin{ — — 7 
2 
1 
+ = In| cosec(3t — 7) 


t 
sec} — 
2 3 


—cot(3t — 7)| +c 
x2 
(f) —cosx+ a —e*+e 


+2I1n +c 


(e) 21In 


1 
(g) gales 3x + tan 3x| +c 
(h) E20 Dees 

3 errr 


e72* 


@ — 


1 
(j) z In| sec(4t — 3)|+2cos(-—t—1)+c 
2 : 
(k) x + = ma sitas +c 


t aa ee 
(1) 2so(5) sxin(5) 6 


P > 
(m) > —2P +41 +e 


(n) —3e? + 2e7/2 +.¢ 

(0) Ix—x’ —e* +c 
B 

(p) k2t +k? + ae 


in kt 
(q) —kcost — —— +c 
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-1{? . d Pes a , 
(r) on 5 +c (d) sin 3 +c 


(s) w'(;) +e (e) 2sin™! 5) +c 
5 2 
3 7. _sfv3 
=I x * . (f) —=sin —x)+c 
(t) 6tan ATG T ie re J3 (3 
ie ro att 
6 Speed: t+ ri +c, distance: z + rr +ct+d (g) J/2 sin~ eG: +c 


7 (a) 2t+e%4+ce (b) 5.0498 
11 3in|x|+x+c 


cosh ax 
8 (a) Cc 1 
ae 12 (a) —+—— _ —_ (b) Injxl + 2tan7! x +c 
(b) sinh ax x x41 


¢ 1 
3 I 13 (a) 2t—2e/2 +¢ (b) —-e 1/2 
(c) 3 cosh 2x + ri sinh 4x + c 2, 


15 —t-2 —] t . 
9 100(10t+e")+c (a) n|t| +c 


4 
: 7 (b) =50 ee 
10 (a) ww-t(5) + 1 
- 2 (©) zin|sin4x| +¢ 
i w-(3) (ay Min 3 t3x| + 
an Pec — In| cosec 3x — cot 3x c 
2/2 J2 6 


3 = (e) x+tan-i xtc 
(c) ao (V2x) +c (f) t+e 


| 13.3 | DEFINITE AND INDEFINITE INTEGRALS 


All the integration solutions so far encountered have contained a constant of integration. 
Such integrals are known as indefinite integrals. Integration can be used to determine 
the area under curves and this gives rise to definite integrals. 

To estimate the area under y(x), it is divided into thin rectangles. The sum of the 
rectangular areas is an approximation to the area under the curve. Several thin rectangles 
will give a better approximation than a few wide ones. 

Consider Figure 13.6 where the area is approximated by a large number of rectangles. 
Suppose each rectangle has width 5x. The area of rectangle | is y(x,)dx, the area of 
rectangle 2 is y(x,)dx and so on. Let A(x,,) denote the total area under the curve from x, 
to x,,. Then, 


A(x,,) © sum of the rectangular areas = > y(x,)dx 
12 


Let the area be increased by extending the base from x, to x. Then A(x) is the total area 
under the curve from x, to x (see Figure 13.7). Then, 


increase in area = 5A = A(x) — A(x,) © y(x) dx 
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xy Kg ee 
Figure 13.6 Figure 13.7 
The area is approximated by (n — 1) rectangles. The area is extended by adding an extra rectangle. 


So, 
6A 
i & y(x) 
In the limit as 6x — 0, we get 


km (84) — 4 _ 
Hook ae = 


Since differentiation is the reverse of integration, we can write 


A= [rar 


To denote the limits of the area being considered we place values on the integral sign. 


The area under the curve, y(x), between x = a and x = bis denoted as 


x=b 
jes 


or more compactly by 
b 
Je 


The constants a and b are known as the limits of the integral: lower and upper, respec- 
tively. Since an area has a specific value, such an integral is called a definite integral. The 
area under the curve up to the vertical line x = b is A(b) (see Figure 13.8). Similarly 
A(aqa) is the area up to the vertical line x = a. So the area between x = a and x = bis 
A(b) — A(a), as shown in Figure 13.9. 


The area between x = a and x = bis given by 


b 
Area = | y dx = A(b) — A(a) 


The integral is evaluated at the upper limit, b, and at the lower limit, a, and the 
difference between these gives the required area. 


444 


A(a) “ 
Figure 13.8 


The area depends on the limits a and b. 
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ha 
a b x 
x b x 
Figure 13.9 
The area between x = a and x = bis 
A(b) — A(a). 


The expression A(b) — A(a) is often written as [A(x)]?. Similarly [x* + 1]} is the value 
of x? + 1 atx = 3 less the value of x* + 1 at x = 2. Thus 


PP? +1B=38?+)-@+)=5 
In general 


[fal = fb) -—f@ 


Note that since 


b 
/ y dx = A(b) — A(a) 


a 


then, interchanging upper and lower limits, 


i y dx = A(a) — A(b) = —{A(b) — A(@)} 


that is, 


b a 
[ra=-/ y dx 
a b 


Interchanging the limits changes the sign of the integral. 


The evaluation of definite integrals is illustrated in the following examples. 


Evaluate 


Example 13.7 


Solution 


2, 1 mT 
@ fe+ia o) fe +tar © | sinx dx 
1 2 0 
2. 
(a) Let J stand for [ x +1 dx. 
1 


2 x3 2 
r= f #+ia=[F 45] 
1 3 1 


The integral is now evaluated at the upper and lower limits. The difference gives the 
value required. 


_ ae ane ate 4 10 
~\3 3 3 3° 3 
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(b) Because interchanging the limits of integration changes the sign of the integral, we 


find 
1 2; 10 
[ e+ie--/ xr +1dx=—-— 
2 1 3 


me 
(c) i, sinx dx = [—cosx]} = (— cos 7) — (—cos0) = 1 — (-1) =2 
0 
Figure 13.10 illustrates this area. 


sin x 


Figure 13.10 
0 Tv x The area is given by a definite integral. 


Note that: 


(1) The integrated function is evaluated at the upper and lower limits, and the difference 
found. 
(2) No constant of integration is needed. 


(3) Any angles are measured in radians. 


Example 13.8 Find the area under z(t) = e~ fromt = | tot = 3. 


. 3 3 et 
Solution Area= [ zdt =, e” dt = S| 
1 1 
e° e& 
s|-[5] 


If the evaluation of an area by integration yields a negative quantity this means that 
some or all of the corresponding area is below the horizontal axis. This is illustrated in 
Example 13.9. 


Example 13.9 Find the area bounded by y = x° and the x axis from x = —3 tox = —2. 


Solution Figure 13.11 illustrates the required area. 
=) 474-2 
fe( 
3 4 |, 


_ [eat] _ font) _ 6 
eh tyes 


The area is 65/4 square units; the negative sign indicates that it is below the x axis. 
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Example 13.10 


Solution 


Example 13.11 


Solution 


Figure 13.12 
Figure 13.11 The positive and negative areas cancel 
Areas below the x axis are classed as negative. each other out. 
(a) Sketch y = sinx for x = —7 tox = 71. 
TT. 
(b) Calculate / sinx dx and comment on your findings. 
—7 
(c) Calculate the area enclosed by y = sinx and the x axis between x = —7t and x = 7. 
(a) A graph of y = sinx between x = —7t and x = 7t is shown in Figure 13.12. 


(b) [ sinx dx = [— cos x]™, = —cos(7) + cos(—71) = 0 


Examining Figure 13.12 we see that the positive and negative contributions have 
cancelled each other out; that is, the area above the x axis is equal in size to the area 
below the x axis. 


(c) From (b) the area above the x axis is equal in size to the area below the x axis. From 
Example 13.7(c) the area above the x axis is 2. Hence the total area enclosed by 
y = sinx and the x axis from x = —7T to x = Tis 4. 


If an area contains parts both above and below the horizontal axis then calculating an 
integral will give the net area. If the total area is required, then the relevant limits must 
first be found. A sketch of the function often clarifies the situation. 


Find the area contained by y = sin x from x = 0 to x = 37/2. 


Figure 13.13 illustrates the required area. From this we see that there are parts both above 
and below the x axis and the crossover point occurs when x = 71. 


A 
/ sinx dx = [—cos x]f 
0 
= —cosm+cos0 = 2 


37/2 
/ sinx dx = [—cosx}?"”” 


™ 
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Figure 13.13 
The positive and negative areas are calculated 
separately. 


The total area is 3 square units. Note, however, that the single integral over 0 to 37/2 
evaluates to 1; that is, it gives the net value of 2 and —1. 


3/2 “as Sn 
sinx dx = [—cosx]) ‘~ = —cos > +cos0= 1 
0 


The need to evaluate the area under a curve is a common requirement in engineering. 
Often the rate of change of an engineering variable with time is known and it is required 
to calculate the value of the engineering variable. This corresponds to calculating the 
area under a curve. 


Engineering application 13.3 


Energy used by an electric motor 


Consider a small d.c. electric motor being used to drive an electric screwdriver. The 
amount of power that is supplied to the motor by the battery depends on the load on 
the screwdriver. Therefore the power supplied to the screwdriver is a function that 
varies with time. Figure 13.14 shows a typical curve of power versus time. Now, 


dE 
P= = 
dt 


where P = power (W), E = energy (J). Therefore, to calculate the energy used by 
the motor between times ¢, and ¢,, we can write 


Wy 
B= || Pdt 
t 


il 


This is equivalent to evaluating the area under the curve, P(t), between f, and f,, 
which is shown as the shaded region in Figure 13.14. 


Power (watts) 


Figure 13.14 
Shaded area represents the energy used to 
drive the motor during the time interval 


ty ty Time (s) fee 
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Engineering application 13.4 


Capacitance of a coaxial cable 


A coaxial cable has an inner conductor with a diameter of 1.02 mm and an outer 
conductor with an internal diameter of 3 mm, as shown in Figure 13.15. The insulator 
separating the two conductors has a relative permittivity of 1.55. Let us calculate the 
capacitance of the cable per metre length. 


Outer conductor 


Cable sheath 


1.02 mm 3 mm 


iearibvieve Figure 1 3.1 5 
Cross-section of a coaxial 
Inner conductor cable. 


Before solving this problem it is instructive to derive the formula for the capac- 
itance of a coaxial cable. Imagine that the inner conductor has a charge of +Q per 
metre length and that the outer conductor has a charge of —Q per metre length. Fur- 
ther assume the cable is long and a central section is being analysed in order that end 
effects can be ignored. 

Consider an imaginary cylindrical surface, radius r and length /, within the insula- 
tor (or dielectric). Gauss’s theorem states that the electric flux out of any closed sur- 
face is equal to the charge enclosed by the surface. In this case, because of symmetry, 
the electric flux points radially outwards and so no flux is directed through the ends 
of the imaginary cylinder; that is, end effects can be neglected. The curved surface 
area of the cylinder is 27trl. Therefore, using Gauss’s theorem 


D x 2nrl = Ql 


where D = electric flux density. 

When an insulator or dielectric is present then D = ¢,¢), where E is the electric 
field strength, ¢, is the relative permittivity, ¢) is the permittivity of free space and 
has a value of 8.85 x 10~!2 F m™!. Therefore, 


io 2uid) = (0) 


that is, 


i Q (13.9) 
2T1E,E9 


This equation gives a value for the electric field within the dielectric. In order to 
calculate the capacitance of the cable it is necessary to calculate the voltage difference 
between the two conductors. Let V, represent the voltage of the inner conductor and 
V,, the voltage of the outer conductor. 
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The electric field is a measure of the rate of change of the voltage with position. 
In other words, if the voltage is changing rapidly with position then this corresponds 
to a large magnitude of the electric field. This is illustrated in Figure 13.16. The 
magnitude of the electric field at point A is larger than at point B. As a positive 
electric field, E, corresponds to a decrease in voltage, V, with position the relationship 
between E and V, in general, is 


dV 
ia 
This expression can be used to calculate the voltage difference arising as a result of 
an electric field. In practice, this is a simplified equation and is only valid provided 


r is in the same direction as the electric field. If this is not the case, then a modified 
vector form of Equation (13.10) is needed. 


(13.10) 


V 


Figure 13.16 


dv 
The gradient of the curve, ae is proportional to the 
Le 


A B r magnitude of the electric field. 


In the case of the coaxial cable, E is in the same direction as r and so 
Equation (13.10) can be used to calculate the voltage difference between the two 
conductors. From Equation (13.10) 


do 
dr 
Therefore the voltage at an arbitrary point, r, is given by 


va- Bar 


Consequently, the voltage difference between points r = b and r = ais given by 


b 
ae -[ ear 


Q al 
—— —dr using Equation (13.9) 
ZigE EE dla 


Q 


OTERe, 


rae) 
== In{ — 
2TE, Eg a 


This gives the voltage of the outer conductor relative to the inner one. Thus the voltage 
of the inner conductor relative to the outer one is 


y= = =p In is 
" 2TE, Eo a 


[In r}? 
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More generally, capacitance is defined as C = Q/V, where V is the voltage differ- 
ence. Therefore 
Q ___ 2n8,£ 


Veena e 
a 


Note that this is the capacitance per unit length of cable. Using ¢, = 1.55, a = 
5.1 x 10-4 m, b = 1.5 x 107? m, we get 


2m x 155508:85 x 1O-"2 


iis Oe 
Lat (| —=———_— 
Sall x LO= 


C= 


= 7.99 x 107! = 80 pF m7“! 


Engineering application 13.5 


Characteristic impedance of a coaxial cable 


A commonly quoted parameter of a coaxial cable is its characteristic impedance, Z). 
The characteristic impedance is the ratio of the voltage to the current for a propagat- 
ing wave travelling on an electrical transmission line in the absence of reflections. 
The value of Z, is easy to select at the design stage by carefully choosing the dimen- 
sions a and b together with the type of insulating material within the cable. The two 
most common characteristic impedances for flexible cables are approximately 50 Q 
and 75 (2. The main reason for selecting 50 Q is that it represents a good compro- 
mise between the ability to handle high power and the minimization of losses that 
occur in thermoplastic dielectrics. The value of 75 Q is mainly considered optimal 
for situations of low power transmission and where losses are the most important 
consideration. Often these 75 Q cables are of the air-dielectric type where the inner 
conductor is supported by a spacer rather than a solid plastic dielectric. An example 
of an application for a 75 Q cable is the connection from a rooftop TV antenna to a 
TV set. 

It can be shown from fundamental transmission line theory that the characteristic 
impedance of a loss-free cable is 

Zz L 
cae 

It can be shown that the expression for the inductance of a coaxial cable is given by 


Ho b 
L=—In{ - 
sein( 2) 


As shown in Engineering application 13.4, the expression for the capacitance is 
_ 278,€o 


1 


13.3.1 
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Substituting for L and C in the equation for Z) 


b 
Poin (2) 
z De \ Gi _ | [eX in (2) 
2TE,Eq fin ( ”) 27Y E,€9 a 
a 


For the cable defined in Engineering application 13.4, and substituting for the per- 
meability of free space, 4) = 47 x 10’, 


T 1.5 x 1073 
Fo / at) (ee ) =f 
nV 1.55 x 8.85 x 10-2 \051 x 10 


Use of a dummy variable 


Consider the following integrals, J, and J,: 


1 1 
nef ea n= [ea 
0 0 


Then, 


a Py fl Ht 
ela) 
a al ae il 
=[3],-G)-9=3 


So clearly J; = J,. The value of J, does not depend upon ¢, and the value of J, does not 
depend upon x. In general, 


b b 
t=[ foa=f pou 


Because the value of J is the same, regardless of what the integrating variable may be, 
we say x and ¢ are dummy variables. Indeed we could write 


b b b 
i= figae= finar= [foray 


Then z, r and y are dummy variables. 


. i : 7/3 Tt 
Evaluate the following integrals: (e) [ sint dt (f) / sin(t + 3) dt 
0 0 
3 4 
1 n/2 2 
3 
(a) / xdx — (b) eo @ | cos 3¢ dt (h) / cos 7tt dt 
0 1 


1 1 12 
(c) / 2 dx (d) / e* dx (i) ‘| tanx dx 
0 -1 1 
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Evaluate the following integrals: 


1 
@ fo P+ia 
0 
wf ta 
0 1+2 


2 
(c) / 3e2* — 2e3* dx 
1 
2 
(d) / (x + 1)(@+ 2) dx 
1 
2 
(e) i 2 sin 4t dt 
0 
" t 
4 =| dt 
(f) [ cos 5 
Evaluate the following integrals: 


@ fe + 0.5t — 6 dt 
an 
© [are 
(c) [ oe — 36 * dx 
1 
2 
(d) [ 3 sin(4t — 7) + Scos(3t + 1/2) dt 
0 
1 
© | 2 tant dt 


15 1 
o qoat 
1+ x2 V4—x2 


Calculate the area between f(t) = cost and the ¢ axis 
as t varies from 


Tt 
to — 
(@) Oto; 


(b) Oto = 

ip 
2 
3n 

(c) Pa (d) Oto7 


Calculate the total area between f(t) = cos 2r and the 
t axis as t varies from 


Solutions 


1 


(a) 20 
(d) 2.3504 
1 


(g) ~3 


(b) 1.3863 
(e) 0.5 


(c) 2 
(f) —1.9800 


(h) 0 (i) 0.3995 


(b) (c) —184.75 


T 
4 
0 


aly ele 


(ec) 0.5728 (f) 8 


10 


11 


12 


13 


fa) Oto~ = (b) = to = =) Oto 
a ri 4°95 (e 5 


Calculate the area enclosed by the curves y = x? and 
yH=x 


Calculate the area enclosed by the graphs of the 
functions y = 1? + 5 and y = 6. 


Evaluate the following definite integrals: 
1.5 1 1 1 
(a) =F = + —— dt 
1 ¢ e  sint 


4 § 
b —— dx 
© ff 8 + 3x2 


1 
(c) / sinhx dx 
-1 
1 
(d) / coshx dx 
-1 
Calculate the area between y = 2 tant and the ¢ axis 


for-l <t< 1.4. 


Find the area between y = sint, y = cost and the y 
axis, fort > 0. 


The velocity, v, of a particle is given by 
v=(1+01)? 


Find the distance travelled by the particle from t = 1 
to t = 4; that is, evaluate i v dt. 


Evaluate the area under the function x = 1/t for 
1<t< 10. 


Evaluate 


2 
(a) / gitas 
3 


™ 
© | sinx cos x dx 
0 


i 
(b) [ (e)? de 
0 


2 
(d) / sinh?x dx 
1 


65 


@=—, (b) 2.275 (c) —0.639 
(d) 0.9255  (e)_ 1.2313 (f) 1.1175 
(a) 0.7071 (b) 1 
(c) 0.7071 (d) 2 
(a) . (b) : (c) I 

g 2 


1 

6 

4 

3 

(a) 1.0839 (b) 0.6468 
(c) 0 (d) 2.3504 
4.7756 
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0.4142 

39 

2.3026 

(a) —3.6202 (b) 3.1945 
(c) 0 (d) 5.4158 


REVIEW EXERCISES 13 


1 


3 


Find the following integrals: 


(a) [oe tra (b) [Gtatre 


() fatraa-o dz @) [¥- Gu 
(e) eae + 4x3 dx 
Given 

dy 


2 ‘ 
— =x +sinx+cos2x+ 1 
x x 


find an expression for y(x). 


Find the following integrals: 
(a) / 3e* + - dx 

(b) [oc +e*)(l—e™*) dx 
(c) / 2e™ + 1 dt 

(d) feo +e") dx 

(e) i: det — e*! dt 

Find the integrals 

(a) / sin 2x + cos 2x dx 


(b) [eosin — cost dt 


(c) [0o(5) dt 
(d) i sin(7 — z) + cos(m — 2z) dz 


(e) [rane +7) dt 


(f) [rina r2sn(2) dt 


5 


Find the following integrals: 


(a) [ coseecar + 7) dt 
(b) fox(5 ap ) dx 
2 
(c) feo) dt 
(d) [remeo(3 _ ) dy 


(e) / ; cot(m — 2z) dz 
(f) / : sec(2t — 7) dt 


Find the following integrals: 
4 1 
(a) ————. du (b) ———— _ dv 
V1—v? 2/1 — v2 


1 1 
5h a 
() lem (d) ae 


2 1 
(e) [se f) f= 
‘ V1 — 412 : V 36 — 9x2 


3 
>= dx 
(g) i Pa 
The speed, v(t), of a particle is given by 
v(t) =t+e’ 


(a) Find the distance travelled by the particle. 


(b) Calculate the distance travelled between t = 1 
and t = 3. 


The capacitance of a capacitor is 0.1 F. The current, 
i(t), through the capacitor is given by 


i(t) = 50 sin mt 


Derive an expression for the voltage across the 
capacitor. 
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9 By using suitable trigonometric identities find 
(a) ; sin? 2t + cos? 2r dt 


(b) / sin 2t cos 2t dt 


1 
(c) / a dt 
cos 2 4 
(d) i ane 
ar 4 
(@) / cone" 


10 By expressing 


2x +x+4+2 
xetx 
as its partial fractions, find 


= 
we4+x 


11 Evaluate the following definite integrals: 
1 
(a) [ Of +7 dt 
(b) ie = dx 
1 ris 
(c) ih T=t4-7 @ 
0 
2 
(d) / (z+ 1)(z4+2) dz 
0 
4 
(e) / Vx dx 
l 


-ly_ 
i / 
-2 


12 Evaluate the following definite integrals: 


1 1 
(a) / ext dx (b) / et 
0 —l 
2 1 
(c) , (®—1)* dz = (d) / e 2 + e* 
1 0 


0 
(e) / x+e* dx 
-1 


—e'+1dt 


13 Evaluate the following integrals: 


7/2 
(a) i 2 sin 3t dt 
0 


™ 


(b) sin 2t — cos 2t dt 
7/2 


7/4 
(c) i tant +t dt 
0 


14 


15 


16 


17 


18 


19 


20 


21 


7/4 ; t t 

@ | sin 3 + cos 3 dt 
0.1 

(e) J tan 3¢ dt 
0 


Evaluate the following definite integrals: 


2re/k 
(a) / sin kt dt 
0 


2n/k 
(b) / cos kt dt 
0 


where k is a constant. 


Evaluate the following definite integrals: 


0.5 
(a) ‘ cosec(2x + 1) dx 
0 
0.1 
(b) i} sec 3t dt 


7/4 
(c) [ tan(x + 7t) dx 
7/4 


Evaluate the following definite integrals: 
> 3 
(a) / >= a 
0 J9— x2 
: 8 
b dx 
©) [ 9+ x2 
oe | 
(c) / > de 
0 /8—2x2 


3 4] 
d —~ dt 
@) / 10 + 472 
3 


(a) Calculate the area enclosed by the curve y = x”, 
the x axis and x = 2. 


(b) Calculate the area enclosed by the curve y = x°, 


the y axis and y = 8. 


Find the total area between f(t) = t? —4 and thet 
axis on the following intervals: 
(a) [—4, —3] (b) [—3, -1] 


(c) [0, 3] (d) [-3, 3] 


Calculate the area enclosed by the curve 


y =x? — x — 6 and the x axis. 


Calculate the total area between y = x* — 3x — 4 and 
the x axis on the following intervals: 

(a) [-2, -1] (b) [—2, 1] 

(c) [2,4] (d) [2,5] 


Calculate the area enclosed by the curve y = x? and 
the line y = x. 


22 Calculate the area enclosed by y = x* + 4 and 


y= 12-27. 


Solutions 


5 

xX — (es 

2 
2In|t| +0? + 2t +c 


il! 
2 —~-—cosx+ a 


3 
3 (a) 


(b) 
(c) 
(d) 
(e) 
(f) 


3e* —3e *+e 
ete *+c 


—2t 
ait = ey 
2 
a cos 2x + § imei Cc 
2 2 


—2cost — sint +c 


cos(7t — z) 


8 ln +c 


1 
in(7t — 2z) 4 
5 sin( z+e 


In|sec(t + 7t)| +c 


: 3t—6 ‘ tb 
3 008 cos 3 c 


1 
5 (a) 3 In| cosec(3t + 7t) — cot(3t + 7)| +c 


(b) 2In «(5 Pe ) 2. wo(§ + ) 


(c) 2In ("Fee 
(d) 91n cone} - ) =e -) 


(e) 


1 
—q in| sin@e 22) +e 


+¢ 


+e 


23 


11 


12 


13 


14 
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455 


2. 
Calculate the area enclosed by y = sinx and y = = 
Tt 


1 
(f) 3 In | sec(2t — 7c) + tan(2t — 7t)| +c 


(a) Asin! uy +c 


(b) sin ne: 
— sin Uv Cc 
2 


(g) V3 tan7! (=) +e 


wo -e ie paar 


2 


500 
—— cos7i +c 
™ 


a 


(c) ~=In|cosec 2t — cot 2t| +c 
(d) =In|sin2t}+c 
(e) 5 In| See at) 6 


2In |x| + tan~! x +c 


es b) 0.1972 = 
@ 5 (b) 0. Clara 
38 14 
a = Os (f) 2.3863 
(a) 17.2933 (b) 3.2765 (c) 15.2630 
(d) 3.6269 (e) 0.1321 
(a) : (b) —1 (c) 0.6550 
(d) 0.9176 (e) 0.0152 
(a) 0 (b) 0 
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15 (a) 0.5238 (b) 0.1015 
16 (a) 2.1892 (b) 0.4290 

(c) 0.3702 (d) 0.0825 
17 (a) 4 (b) 12 


25 23 
18 (a) 3 (b) 4 (c) a: 


(c) 0 


(dd) > 


22 


23 


17 by Of i= 
@) | irs Pee Tae 


0.4292 


61 
(d) . 


apr Cee 
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i’ INTRODUCTION 


The previous chapter showed us how to integrate functions which matched the list of 
standard integrals given in Table 13.1. Clearly, it is impossible to list all possible func- 
tions in the table and so some general techniques are required. Integration techniques 
may be classified as analytical, that is exact, or numerical, that is approximate. We will 


now study three analytical techniques: 
(1) integration by parts; 
(2) integration by substitution; 


(3) integration using partial fractions. 


i974 6INTEGRATION BY PARTS 


This technique is used to integrate a product, and is derived from the product rule for 
differentiation. Let u and v be functions of x. Then the product rule of differentiation 


states: 


du 4 du 
v+u 
dx dx 


<w) = 


Rearranging we have 


du 
dx 


du d 
u— = —(uv)—v 
dx dx 
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Integrating this equation yields 


d d d 
uP de= f Sow) dr~ fv tar 
dx dk de 


Recognizing that integration and differentiation are inverse processes allows 


d(uv) 
[ee 


to be simplified to wv. Hence, 


fo()anm- fo(S) 


This is the formula for integration by parts. 


Example 14.1 Find f sind 


Solution We recognize the integrand as a product of the functions x and sinx. Let u = x, 


7 =sinx. Then i =1, v=—cosx. Using the integration by parts formula we get 


[ ssinxdr = x05) = [ cos)1 ds 


= —xcosx + sinx +c 
When dealing with definite integrals the corresponding formula for integration by parts 
is 
b b 
dv d 
[ a ax = twit — f vo( ) ax 
a \dx a \dx 


Example 14.2 Evaluate 


2 
i xe” dx 
0 


Solution We let 


d du ; 
“= an —=e 
. dx 
Then 

d 

= 1 and v=e 
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Using the integration by parts formula for definite integrals we have 


2 2 
[ xer= beg [ e*- Idx 
0 0 


= 2e” — [e*]} 
= 2e” —[e? — 1] 
=e+1 


Sometimes integration by parts needs to be used twice, as the next example illustrates. 


Example 14.3 Evaluate 


2 
/ ve dx 
0 


Solution We let 


— d —=e 
u=x an oF 
Then 
du 
—=2 and v=e 
di x 


Using the integration by parts formula we have 


2 2 
/ xe dx = [xe] - / 2xe* dx 
0 0 


2 
= 4e*—2 / xe" dx 
0 
2 
Now / xe” dx has been evaluated using integration by parts in Example 14.2. So 
0 


2 
i! xe’ dx = 4e? — 2[e* + 1] = 2e* — 2 = 12.78 
0 


The next example illustrates a case in which the integral to be found reappears after 
repeated application of integration by parts. 


Example 14.4 Find 


: e’ sint dt 


460 Chapter 14 Techniques of integration 


Solution 


Example 14.5 


Solution 


We let 
du 
u=e' and —=sint 
dt 
and so 
du 
—=e and v = —cost 
dt 


Applying integration by parts yields 


fe sint dt = -e' cost — f (cost) dr +c 


= -e'cost + fe costar +c 


We now apply integration by parts to fe‘ cost dr. We let 


v 
u=e' and — =cost 
dt 
Then 
du ; . 
—-=6 and v = sint 
dt 


So 
[ecoseat =e’ sint — / e’ sint dt 
Substituting Equation (14.2) into Equation (14.1) yields 
/ e’ sint dt = —e’ cost + e’ sint — / e’ sintdt +c 
Rearranging the equation gives 
2 | e’ sint dt = —e’ cost + e’ sint +c 


from which we see that 


a —e'cost+e'sint+c 
e sint dt = 


2 


Evaluate 


2 
/ x"e* dx 
0 


forn = 3,4, 5. 


(14.1) 


(14.2) 


The integral may be evaluated by using integration by parts repeatedly. However, this 
is slow and cumbersome. Instead it is useful to develop a reduction formula as is now 


illustrated. d d 
v ; u 
Let u = x" and — = e*. Then oe = nx"! andv =e". 
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Using integration by parts we have 


2 2 
- xe dx = [x"e* 2 — / nx"—'e* dx 
0 0 


2 
= 2" — nf x" e* dx 
0 


Writing 
2 
Le / vet dx 
0 
we see that 
2 
Lin free 
0 
Hence 
I, = 2" —nl,_, (14.3) 


Equation (14.3) is called a reduction formula. 
2 


We have already evaluated J,, that is i xe* dx in Example 14.2, and found 
0 


L=et+1 


Using the reduction formula with n = 2 gives 


2 
/ xe" dx = 1, = 2’e* — 21, 
0 
= 4e” — 2(e? + 1) 
= 2e°-2 
Note that this is in agreement with Example 14.3. 
With n = 3 the reduction formula yields 
2 
i xe" dx = I, = 23e* — 31, 
0 
= 8e” — 3(2e? — 2) 
= 2e" +6 
With n = 4 we have 
2 
/ x‘e* dx = I, = 2c? — 41, 
0 
= 16e” — 4(2e” + 6) 
= 8e” — 24 
With n = 5 we have 
2 
i xe" dx = 1, = 2°e* — 5/, 
0 
= 32e” — 5(8e” — 24) 
= 120 — 8e” 
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EXERCISES 14.2 


Use integration by parts to find the following: 
(a) - xsin(2x)dx — (b) / te! dt 
_ fv 
(c) [ scosxax (d) [2»s0(3) du 
©) / = de 
a 
Use integration by parts to find 
(a) / tint dt 
(b) / Int dt 
(©) f inca (n 4-1) 
(d) ; t sin(at + b) dt a, b constants 
(e) j tet t dt a, b constants 
Evaluate the following definite integrals: 
1 7/2 
(a) / x cos 2x dx (b) [ x sin 2x dx 
0 0 
1 3 
(c) / te’ dt (d) / ? Intdt 
=i 1 
2 
©) J at 
Find 
(a) / Pen dt 
Solutions 
sin2x  xcos2x 
(a) = 


4 2 


(b) o(; - 5) +c 


(c) cosx+xsinx+c 


(d) ssin(5) —4v co( 3] +e 


(e) -—e*(x+1)+c 


(b) / t? cos 3t dt 


(c) ie sin . dt 
2 
Evaluate the following definite integrals: 
2 
(a) i re! dt 
0 
l 
(b) / ?? sint dt 
-1 
l 
(c) i 1? cos 3t dt 
0 


Obtain a reduction formula for 


L,= / te! at n, k constants 


Hence find [Pe ar, f Pe dt and fe dt. 


Use integration by parts twice to obtain a reduction 


formula for 
7/2 
I= / t” sint dt 
0 
7/2 7/2 
Hence find i P sint dt, / 1 sint dt 
0 0 
7/2 
and / t’ sint dt. 
0 


Use integration by parts to find 


7/2 ; 
/ e** cos x dx 
0 


(b) tInt—t+c 
(Int)e?+! prtl 
+c 
n+1 (n+ 1)? 


sin(at+b)  tcos(at +b) 
2 ee 
a a 


t 1 
(e) ar! = 4 +C 


(c) 


(d) 


3 


5 
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(a) 0.1006 — (b) 0.7854 ~—(c):1.9488 mK fee x ; e* (97 — 6t + 2) 
— -1) > 
(d) 6.9986 (e) 0.4542 " k k” 27 
e! (272 — 2t +1) e* (943 — 91? + 6t — 2) 
a ae 27 ; 
e! (2714 — 3613 + 361? — 241 + 8) 
5 ane ee ee 81 
(b) stcos t + 3737) 5m t+e : oy 
7 rf = Th 2 = (n aa 1)I,-2 
t t 
(c) 8¢ an(5) 27 —8) co(5) +e 1.4022, 2.3963, 4.5084 
8 4.2281 


(a) 12.7781 (b) 0 (c) —0.1834 


14.3 


Example 14.6 


Solution 


Example 14.7 


Solution 


INTEGRATION BY SUBSTITUTION 


This technique is the integral equivalent of the chain rule. It is best illustrated by 
examples. 


Find / (3x4 1)"" dx 


d d 
Let z = 3x + 1, so that ;. = 3, that is dx = ma Writing the integral in terms of z, it 
becomes 


rl 1 172" 1 (3x4. 15°7 
2.77 q == 2.74 es (are ae 
[2ze 5 | ete a\37)/7°=3 37 1S 


3 
Evaluate / t sin(t”) dr. 
2 


du 
Let v = f* so ae = 2r, that is 


dt = — dv 
2t 


When changing the integral from one in terms of ¢ to one in terms of v, the limits must 


also be changed. When ¢ = 2, v = 4; whent = 3, v = 9. Hence, the integral becomes 


sind 
[= veal cos v]} =5I- cos 9 + cos 4] = 0.129 


Sometimes the substitution can involve a trigonometric function. 
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2. 
Example 14.8 Evaluate / sint cos’ t dt. 
1 


: dz : eee 
Solution Put z=cosf so that — = —sinf, that is sint dt = —dz. Whent = 1, z = cos 1; when 


t = 2,z = cos2. Hence 


cos2 


2 cos 2 23 
/ sint cos? dt = ~ f Zaz =-|5| 
1 cos | 3 eel 


cos? 1 — cos? 2 


= ————— = 0.0766 
3 
elanx 
Example 14.9 Find ; 57 ox 
cos? x 
: dz 2 
Solution Put z= tanx. Then — = sec’ x, dz= . Hence, 
dx cos? x 
elanx 
f= ar dx = feare +coxe™*+¢ 
cos 


dx 
to be integrated. 


Integration by substitution allows functions of the form 


2, 
1 
Example 14.10 Find f ax 
wP+x+2 


. d . 
Solution Put z= x? +.x+2, then = — 3x? +1, that is dz = (3x2 + 1) dx. Hence, 


3x7 +1 d 
[ere [ Sei tenin eta) te 
x +x4+2 Z 


Example 14.11 Find [UE as 


: d d 
Solution Putz=/f. Then a a. that is 
dx dx 
d 
dz = df dx. 
dx 
Hence, 
df/dx 
[+ fl dx = i =In|z| +c =In|fl +c 
and so 


[ Fee miise 


1 
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The result of Example 14.11 is particularly important. 


[ FAaamipste 


4 342 
3t° + 2t 
Example 14.12 Evaluate f Aaa Sel t 
2 P+P+1 


Solution The numerator is the derivative of the denominator and so 


4 377 +2 a ri 
[ Fee = nie +P + UT = In81 — In 13 = 1.83 


Example 14.13 Find 
@ fsa 
a ——— 
5x -7 
t 
b ——_ dt 
© fsa 
el/?2 
(c) / rer dt 
Solution The integrands are rewritten so that the numerator is the derivative of the denominator. 
@ f gant f Ayers Gine-714 
set eee ee “ 


o) | ’ a=>f sa dt 
r+ 2 e+ 


=5in+li+e 


eff? rl /2 
—= _ i 


EXERCISES 14.3 


Use the given substitutions to find the following 2 Evaluate the following definite integrals: 
integrals: 2 m/2 

(a) / (2t +3)! dt (b) ; sin 2t cos* 2t dt 
(a) fore nra. z=4x+4+1 1 0 


1 2, 
i 312e" dr d i J4 + 3x dx 
(b) [Psine? + an, z=Ptl (c) [ e (d) : Xx 
(c) oer dt, z= ’ f sin( /%) - 
e eae 
1 Ve 


d tag a t=1- 
(@) [i 2) i 3. Find the area between y = x(3x2 + 2)4 and the x axis 


5 . from x = Otox = 1. 
(e) cos t(sin> ft) df, Z=sint 
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4 Find 5 Evaluate the following: 
a / 4x+1 ai (b) <sS 2 sin 2x 1 2 m/2 
a) 2 ; 
ee eS er al (a) [ (4 3a) dx (b) [ sint./cos t dt 
(c) [x dx (d) Ix = dt [ 34x [ evi 
= > dx (d — dt 
9— 2 2+ ©} areaqt® Of F 
1 
— dt 2 
(©) len (e) / xsin(m — x”) dx 
0 
Solutions 
8 3 
Ba) oe ze #8 tO be 4 (a) In(2x?+x+3)+ce 
(b) —In(cos2x +7) +c 
2 3 
—Ie-t : an _ ,)4/3 
(c) —2e" +e (d) ra ye te (c) —Fin(9 — 2x) +e 
1. 
(c) zsin’t te (d) pin + Ite 
2 (a) 3.3588 x 10° (b) 0.2 (c) 1.7183 (e) Indnr) +c 
(d) 5.2495 (e) 0.7687 5 (a) 0.5 (b) 0.6667 (c) 0.3769 


(a INTEGRATION USING PARTIAL FRACTIONS 


The technique of expressing a rational function as the sum of its partial fractions has 
been covered in Section 1.7. Some expressions which at first sight look impossible to 
integrate may in fact be integrated when expressed as their partial fractions. 


Example 14.14 Find 


1 

(a) / pia dx 

(b) / 13x —4 Ag 

6x2 —x —2 
Solution (a) First express the integrand in partial fractions: 
1 1 A Bx+C 
etx x@+l) x w+) 
Then, 


1 =A(x* + 1) +. x(Bx + C) 
Equating the constant terms: 1 =A so thatA = 1. 
Equating the coefficients of x: 0 = C so that C = 0. 
Equating the coefficients of x*: 0 = A + B and hence B = —1. 


1 


Then, 


1 
dx 
I= 


13x-—4 
6x2 —x-2 


(b) 


[=- 


14.4 Integration using partial fractions 


1 x 
x? +1 


x 


1 x 
=In|x|— =Inj? + 1)+c=In [+e 
bl — 5 Ink? +1 oF 
13x -4 
(2x + 1)(x — 2) 
/ 2 + 2 dx i tial fracti 
=, —— + —_ using partial fractions 
Ox+1 3x—2 aa 
3 2 2 3 
=5/ dee / a 
2J 2x4+1 3) 3x-—2 


3 2 
= 5 Inox +1) + 5 In[3x—2| +e 


— 27° +3t-1 


dt. 


1 43 
Example 14.15 Evaluate f 
0 2r? + 1 


Solution Using partial fractions we may write 


4 — 27? +3r—1 


2r27+1 
Hence, 
{= —2P+3t-—1 
0 2r+1 


EXERCISES 14.4 


By writing the integrand as its partial fractions find 


0 [5 


er 3 4 


8x + 10 


ic) an 
20° +3r+3 4 


Oa 2+ 1) 
© fee eee ae 


: t 
2t — 1+ ——_ 
i + oP 


1 1 
= [1-14 513] -[o-o+ jini] = 0.275 


1 
dt = [@—e+ Fine +1] 


Evaluate the following integrals: 


Bb 4 
(a) iy lady 
) : 3x +5 
. (x + 1)(%+ 2) 
(c) i 3x 

1. 2xe+ a 


4 1 
of X 


pee 


x2 4+2x-1 
ofa @+DG-1) 


467 


1 


0 
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4x +7 


Wa eng and the x 
4x* + 8x4+3 


Find the area between y = 


axis from x = Otox = 1. 


Use partial fractions to find 
3 
3x +2 
(a) / 5 dx 
2 x -1 


Solutions 


(a) 3Inx—2Inw~+1)+c 
(b) 2In¢@+1)—-—In@—-1)+c 


1 
(c) 3 In(2x + 3) + 5 In(2x + l)+ec 


(d) In@@+1)+ we 8 =F 


(e) 2In(x+1)—- 2 +c 
x 


(a) 2.1587 
(b) 1.7918 


() / t+3 
era ee 


e peeve, 
p+t 


6t +3 
d dt 
© {oo Sst+2 


(c) —0.09971 
(d) 0.1823 
(e) 0.9769 


1.2456 
(a) 1.8767 


2 
b) In(@ + 1) — —— 
(b) In@ + 1) she 


1 
(c) 3tan! r+ In? +1) +Int +e 
(d) 5In@¢ —2) —21In(2t— 1) +c 
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1 


Use the given substitution to find the following 
integrals: 


1 
(a) [ or+2y! ar z=9+2 
0 


5 
(b) / (—t + 1)° dt z=-—t+1 
3 


3 
(c) [aera z=4x—-1 
6 
(d) [via z=3t+1 
(e) for-n"y z=9y-2 
2 3 
a ad =2 
(f) [ (0 +5)6 z y=2z+5 


(g) [esine®yar z=P 
(h) / Pert! ay 


0.5 
(i) / sin(2r)eS) dr = z = cos(2F) 
0 


z=041 


T™ 
Qj) / sin t cos” t dt z= cost 
0 


(k) [os tv sint dt z= sint 


2 Use integration by parts to find 


7/2 7/2 
(a) / e~ cos x dx (b) / e-* sinx dx 
0 0 


Find 


using 
(a) integration by parts 
(b) the substitution z = Int. 


The integral J, is given by 


7/2 
I =} sin” 6 dO 
0 


(a) State I,» 
(b) Show 
n—-1 
[= i) 


n n= 
n 


(c) Evaluate J, /,, J, and J,. 


Evaluate 


2 


1 


2 


3 


7/3 
(b) i sint cost dt 


3x —7 


Evaluate 
13 
: 2 
(a) [ a elena t dt 
(b) i 4t2e +11 +22)! at 


(c) [ sin? wt + cos? wt + w dt 
0 


@ constant 


fa) - Late a, 
1 


t(1 7+) ° 
(e) - (t +e!) sint dt 


3 
eae 
© f 5a 
1 2x+4x 


1+ 2e 


Calculate the area under y(x) = 


tox = 3. 


Solutions 


(a) 2.8819 x 10? 
(b) 2322 
(c) —1.2 x 10*° 


2 
(d) 5 t +13? +4¢ 


(Sy — 218 
6 
(f) 


9.092 x 1075 
1 
(g) — zoos?) +c 


er tl 
h 
(h) 3 


(i) 0.5009 
‘ 2 
0 3 


(k) 5 (sin t) 


+c 


3/2 4 


(a) 4.2281 (b) 9.4563 


(Int)? 
2 


+e 


2x 


+ e2r 


from x = 1 
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Evaluate the following integrals: 
(a) fo +0.1)* dr 
(b) fo +x) sinxdx 


2, 
(c) / xsin(1 + x) dx 
1 


dt 


6 ¢ 
@ | 
3 Vr+1 
1 
(©) [=a 


5 1 
— a 
of ica 


Find 


@ | 
(c) ean o 


(ec) [ame 


xInx 
Find / xe" dr. 


Zsint cost 4 
1+sin?¢ 


cost ai 
10+ sint 


Oy eee 


(d) / ive aT dt 


(a) I, 


7/2 
= [ sin”? 6 d0 
0 


™ 


(c) =, 1, 0.7854, 0.6667 


a 


(a) —Injeti[te 


(b) 8 


(c) 0.2657 
1 
In |x — 2| 


(d) 5 21n |x — 3| + 
(e) Injew+1]/ +c 

(a) 1.5778 (b) 317.3 
(d) 1.0149 (e) 1.2105 


3.8805 


(c) m1 +o) 
(f) 0.9730 


(—2t +0.1)° 
(a) 10 


(b) —(1+x)cosx+sinx+c 
(c) 0.7957 
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5 
In|x —4| +c 
2 
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(d) 2.9205 (c) Indi +Int) +c 
1 _ 
(e) In|t| — Inj + 1] + = +e (d) —In(l+e“)+ce 
(f) 0.3066 (e) In(@vInx) +¢ 
: 5 cal a ea 
9 (a) In(sint+10)+c 10 € = uae 


(b) In(sin?t +1) +c 


= ree ae a 


Contents 
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INTRODUCTION 


Currents and voltages often vary with time. Engineers may wish to know the average 
value of such a current or voltage over some particular time interval. The average value 
of a time-varying function is defined in terms of an integral. An associated quantity is 
the root mean square (1.m.s.) value of a function. The r.m.s. value of a current is used in 
the calculation of the power dissipated by a resistor. 


AVERAGE VALUE OF A FUNCTION 


Suppose f(t) is a function defined on a < t < b. The area, A, under f is given by 


a= fdt 


A rectangle with base spanning the interval [a, b] and height h has an area of h(b — a). 
Suppose the height, h, is chosen so that the area under f and the area of the rectangle are 
equal. This means 


b 
h(b—a) -| fdt 
Io fat 
b-—a 


Then / is called the average value of the function across the interval [a, b] and is illus- 
trated in Figure 15.1: 


h= 


> F dt 


average value = 
b-—a 
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Example 15.1 


Solution 


fO 


Figure 15.1 
The area under the curve from t = a tot = band the 
a bt area of the rectangle are equal. 


Find the average value of f(t) = t* across 


(a) [1,3] 
(b) [2,5] 


(a) average value = 


3-1] 34 3 
fed ipeP 
(b) average value = a3 = sls, = 13 


Example 15.1 shows that if the interval of integration changes then the average value of 
a function can change. 


Engineering application 15.1 


Saw-tooth waveform 


Recall from Engineering application 2.2 that engineers frequently make use of the 
saw-tooth waveform. Consider the saw-tooth waveform shown in Figure 15.2. 


Figure 15.2 
A saw-tooth waveform. 


Calculate the average value of this waveform over a complete period. 
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Solution 
We first need to obtain an equation for the waveform. We choose the interval 
OXF BD, 

The general equation for a straight line is 


v=mt+c 
When ¢t = 0 then v = 0. So 

0=O0+c 

c— 0) 
When ¢t = 2 then v = 5. So 

5 = m(2) 

iS 25) 
Hence 

D = 250 


The average value is given by 


1? 1 [2.50277 
sf ed 
US Lp an 


2 
ll 


Engineering application 15.2 


A thyristor firing circuit 


Figure 15.3 shows a simple circuit to control the voltage across a load resistor, R,. 
This circuit has many uses, one of which is to adjust the level of lighting in a room. 
The circuit has an a.c. power supply with peak voltage, V,. The main control element 
is the thyristor. This device is similar in many ways to a diode. It has a very high 
resistance when it is reverse biased and a low resistance when it is forward biased. 
However, unlike a diode, this low resistance depends on the thyristor being “switched 
on’ by the application of a gate current. The point at which the thyristor is switched 
on can be varied by varying the resistor, Rg. Figure 15.4 shows a typical waveform 
of the voltage, uv, , across the load resistor. 

The point at which the thyristor is turned on in each cycle is characterized by 
the quantity wT, where 0 < a < 0.25 and T is the period of the waveform. This 
restriction on @ reflects the fact that if the thyristor has not turned on when the supply 
voltage has peaked in the forward direction then it will never turn on. 

Calculate the average value of the waveform over a period and comment on the 
result. 
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Figure 15.3 
A thyristor firing circuit. 


Figure 15.4 
Load voltage waveform. 


Solution 
The average value of load voltage is 


yes ie eae 2a 
= v, dt = = V, sin{ —— }dt 
dk Tans fe 
Veer Te Ne 
8 | -cos( ie = 5, (1 + cos 27) 


If w« = O, then the average value is V,/7t, the maximum value for this circuit. If 
a = 0.25, then the average value is V,/27t which illustrates that delaying the turning 
on of the thyristor reduces the average value of the load voltage. 


1 


EXERCISES 15.2 


Calculate the average value of the given functions os 
across the specified interval: 

(a) f(t) = 1+¢ across [0, 2] 

(b) f(x) = 2x — 1 across [—1, 1] 

(c) f(t) =? across [0, 1] 

(d) f(t) =? across [0, 2] 

(e) f(z) =22 +zacross [1, 3] 

Calculate the average value of the given functions 

over the specified interval: 

(a) f(x) =x across [1, 3] 

(b) f(x) = : across [1, 2] 4 


(c) f= a/t across [0, 2] 
(d) f(z) =2 — Lacross [—1, 1] 


(ec) f= 5 across [—3, —2] 


Calculate the average value of the following: 


(a) f(t) = sint across [o, ;| 


(b) f(t) = sint across [0, 7t] 


(c) f(t) = sinat across [0, 7t] 
(d) f(t) = cost across [o | 


(e) f(t) = cost across [0, 7] 
(f) f(t) = cos at across [0, 71] 
(g) f(t) = sin@t + cos wt across [0, 1] 


Calculate the average value of the following 
functions: 


(a) f(t) = Vt+ 1 across [0, 3] 
(b) f(t) =e! across [—1, 1] 
(c) f(t) =1+e! across [—1, 1] 


15.3 Root mean square value of afunction 475 


Solutions 
1 4 19 1 2 
1 (a) 2 (b) -1 (c) 3 (d) 3 (e) 3 (c) —-[1 —cos(7w)] (d) — 
TID TT 
2 (a) 10 (b) 0.6931 (c) 0.9428 (e) 0 go 
TID 
(d) —-1 (e) : is 1+ sinw—cosw 
3 2 iS 14 : 
ae a 4 (a) os (b) 1.1752 (c) 2.1752 


MET R007 MEAN SQUARE VALUE OF A FUNCTION 


If f(t) is defined on [a, b], the root mean square (r.m.s.) value is 


LEM) at 
b-a 


rm.s. = 


Example 15.2 Find the rm.s. value of f(t) = ¢? across [1, 3]. 


; fe (P)2 dt / frrde | [s/sp [242 
lution = = = = ,/—=492 
Solutio r.M.s. =i 4 7 5 ia 


Example 15.3 Calculate the r.m.s. value of f(t) = A sint across [0, 271]. 


| & = t dt 
Solution rm.s. = 4{ ——____ 
a Af = 


Thus the r.m.s. value is 0.707 x the amplitude. 
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Example 15.4 Calculate the r.m.s. value of f(t) = A sin(@t + @) across [0, 27t/a]. 


Solution 


| fy A? sin? (wt + §) dt 
r.m.s. = In/o 


A2w 27t/w 
= =f 1 — cos 2(a@t + @) dt 
Ant 0 


_ Aol sin2(ot + ¢) 7” 
\ 4n 2w 4 


[2 ( _ sin2Qrn+ 9) 


4n \ w 2w 20 


Now sin2(27 + @) = sin(47 + 2@) and since sin(t + @) has period 27t we see that 
sin(47 + 2%) = sin 2¢. Hence, 


A2o 27 A2 A 
rm.s. = = = = 0.707A 
4n ow 2 x/2, 


Note that sin(@t + @) has period 27t/w. The result of Example 15.4 illustrates a general 
result: 


The r.m.s. value of any sinusoidal waveform taken across an interval of length one 
period is 


0.707 x amplitude of the waveform 


Root mean square value is an effective measure of the energy transfer capability of a 
time-varying electrical current. To see why this is so, consider the following Engineering 
application. 


Engineering application 15.3 


Average power developed across a resistor by a time-varying 
current 


Consider a current i(t) which develops a power p(t) in a load resistor R. This current 
flows from time t = ¢, to time t = ¢,. Let P,,, be the average power dissipated by the 
resistor during the time interval Ev B\. We require that total energy transfer, FE, be 
the same in both cases. So we have 


Ep eer = [roe 


fi 
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Now 


p(t) = (i(t))’R 
and so 
Pa) = | Pre 


If we now consider the average power dissipated by the resistor to be the result of an 
effective current /,,, then we have 


uy) 
EIA) i / PR dt 


Mi 


U) 
Lael — 1) = f i dt 
t 


1 


ds 
Te = i) 
€ 
f,—t, 
iE 2 
F Ae dt 
eff a = t, 


We see that the equivalent direct current is the rm.s. value of the time-varying 
current. 


Engineering application 15.4 


Average value and r.m.s. value of a periodic waveform 


Consider the periodic waveform shown in Figure 15.5. 
The current i(t) is 


A@) S20 0<t< 10, period T = 10 


(a) Calculate the average value of the current over a complete period. 
(b) Calculate the r.m.s. value of the current over a complete period. 


i(t) 
20 


Figure 15.5 


Waveform for Engineering application 
0 10 20 30 t 15.4, 
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(a) As the waveform is periodic, we need only consider the interval 0 < t < 10. 


1 1 1 10 
l,== it) dt = — 20e' dt 
aw rf iw Tide ae 


1 = 
= jo! 20¢ ale 


1 
= 70 (720 a! 4. Oe) 


20 —10 
= Flt — 271) = 2 x 0.99995 = 2.000 A 


(b) Loge = 
Here 
LU = 105 ae) —20e" 

First we evaluate 


b 10 
/ Pdt= i 400e-~ dt 
t 0 


1 


400 
i =) @o es e°) 
= 200(1 — e~”’) = 200.00 
So 
he /200.00 
eff 10 =, 0 
= 4.4721 A 
Calculate the r.m.s. values of the functions in Calculate the r.m.s. values of the functions in 
Question | in Exercises 15.2. Question 3 in Exercises 15.2. 
Calculate the r.m.s. values of the functions in Calculate the r.m.s. values of the functions in 
Question 2 in Exercises 15.2. Question 4 in Exercises 15.2. 
(a) 2.0817 (b) 1.5275 (c) 0.4472 (a) 0.7071 
(d) 1.7889 (e) 6.9666 (b) 0.7071 
(a) 12.4957 (b) 0.7071 (c) 1 1 sin7tw cos Tw 


(d) 1.0690 (e) 0.1712 2 27a 


(d) 0.7071 
(e) 0.7071 


1 sin7tw cos tw 
—_ + siamo 


(f) 


27 
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(a) 1.5811 (b) 1.3466 = (c) 2.2724 


REVIEW EXERCISES 15 


1 


Find the average value of the following functions 
across the specified interval: 
(a) f(t) =3-—t across [0, 4] 
(b) f(t) = 1? — 2 across [1, 3] 
1 
(c) f(t) =t+ : across [1, 4] 
(d) f(t) = Vt + 1 across [0, 4] 
(e) f(t) = 27/3 across [0, 1] 
Calculate the average value of the following: 
: Tt 
(a) f(t) = 2sin 2t across [o ;| 
: Tl 
(b) f(t) = Asin 4t across [o | 
(c) f(t) = sint + cost across [0, 7t] 
(d) f(t) = cos = ee 
f(t) = cos 5 across | 0, 5 
(e) f(t) = sintcost across [0, 7] 
Solutions 
7 
(a) 1 (b) 3 (c) 2.9621 
7 3 
d) — ca 
(d) 5 (e) 5 
4 2 
(a) — (bo) 0 (c) = 
7 T 
(d) 0.9003 (e) 0 
k 
-1 
(a) a5 , (b) 0.1663 
(c) 2.8410 (d) 3.6269 
(e) 4.1945 
A? + B? 
(a) average = 0, r.m.s. = 


2 


Calculate the average value of the following 
functions: 


(a) f(t) =Ae™ across [0, 1] 

(b) f(t) = = across [0, 2] 

(c) f(t) =3—e across [1, 3] 
(d) f(t) =e! +e across [0, 2] 
(e) f(t) =t+e' across [0, 2] 


Find the average and r.m.s. values of 
Acost + Bsint across 


(a) [0, 27t] 
(b) [0, 7] 


Find the r.m.s. values of the functions 
in Question 1. 


Find the r.m.s. values of the functions in 
Question 2. 


Find the r.m.s. values of the functions 
in Question 3. 


(b) average = EO ee Ae 
™ 2 

(a) 1.5275 (b) 3.2965 (c) 3.0414 
(d) 2.3805 (e) 0.6547 
(a) 1.4142 (b) = (c) 1 
(d) 0.9046 (e) 0.3536 

e2k _ 
(a) A Ik (b) 0.2887 
(c) 2.8423 (d) 3.9554 
(e) 4.8085 


ys 
1s = 


= a 
Bie a 
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INTRODUCTION 


This chapter examines some further topics in integration. Orthogonal functions are intro- 
duced in Section 16.2. These functions are used extensively in Fourier analysis 
(see Chapter 23). Some integrals have one or two infinite limits of integration, or have 
an integrand which becomes infinite at particular points in the interval of integration. 
Such integrals are termed ‘improper’ and require special treatment. They are used ex- 
tensively in the theory of Laplace and Fourier transforms. The Dirac delta function, 5(¢), 
has been introduced in Chapter 2. The integral properties are examined in Section 16.4. 
The chapter concludes with the integration of piecewise continuous functions and the 


integration of vectors. 
ORTHOGONAL FUNCTIONS 


Two functions f(x) and g(x) are said to be orthogonal over the interval [a, b] if 


b 
il f(x)g(x) dx = 0 


To show that two functions are orthogonal we must demonstrate that the integral of their 


product over the interval of interest is zero. 
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Example 16.1 Show that f(x) = x and g(x) = x — 1 are orthogonal on o, lt 


3/2 3/2 3 2743/2 9 9 
Solution / x(x—1)dx = vr —xdx= ogee a |) 
0 0 3 2 Jo 8 8 


3 
Hence f and g are orthogonal over the interval o. Al 


Clearly functions may be orthogonal over one interval but not orthogonal over others. 
For example, 


1 
/ x(x — 1)dx 40 
0 


and so x and x — | are not orthogonal over [0, 1]. 


Example 16.2 Show f(t) = 1, g(t) = sint and A(t) = cost are mutually orthogonal over [—7, 71]. 
Solution We are required to show that any pair of functions is orthogonal over [—7,, 71]. 
™ 
/ 1 sint dt = [—cost]™,, = —cos7 + cos(—7) 


~(-l)+(-1) =0 


vad 
/ lcostdrt = [sint]”_ = sin — sin(—7) = 0 


TT 


Using the trigonometric identity sin2A = 2 sinA cos A, we can write 


a Te WP cos(2t) ]™ 
sint cost dt = = sin(2t) dt = — 
= Bes: 4 “pe 


_ cos(27t) —cos(—27t) _ 
i = 


Hence the functions 1, sint, cost form an orthogonal set over [—7, 7t]. 


0 


The set of Example 16.2 may be extended to 


{1, sint, cost, sin(2r), cos(2t), sin(3t), cos(3t), ..., sin(nt), cos(nt)} neN 


Example 16.3 Verify that {1, sint, cost, sin(2r), cos(2r), ...} forms an orthogonal set over [—7t, 71]. 


Solution Suppose n,m € N. We must show that all combinations of 1, sinnf, sin mt, cos nt and 
cos mt are orthogonal. 


[ 1 sin(nt) dt = —| at cos(n7t) + cos(—nr7t) = 
= n i 


n 


™ 
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In a similar manner, it is easy to show 


TT 
/ 1 cos(nt) dt = 0 


T™ 


Also, using the trigonometric identities in Section 3.6 


7 i TL 
i cos(nt) sin(mt) dt = 5 / sin(n + m)t — sin(n — m)t dt 


TT —T7 


We have seen that ce sinnt dt = O for any n € N. Noting that (n + m) € N and 
(n —m) €N, we see that 


TT 
/ sin(n + m)t — sin(n — m)t dt = 0 
—™ 


It is left as an exercise for the reader to show that 


7 
/ sin nt sin mt dt = 0 nA~m 


Tt. 


“me 
/ cos nt cos mt dt = 0 nx~m 


TT 


The functions thus form an orthogonal set across [—71, 71]. 


The result of Example 16.3 can be extended: 


{1, sint, cost, sin 2f, cos 2r, .. .} is an orthogonal set over any interval of length 27t 


More generally: 


_ (2nt 2mt , | aca Ant 
1, sin{ —— }, cos| —— ], sin{ —— ]}, cos{ —— ],... 
vw T T Th 


is an orthogonal set over any interval of length 7. In particular, the set is orthogonal 


ih 
over [0, 7] and | —~, — }. 
OL 8) 


These results are used extensively in Fourier analysis. 


Example 16.4 Find 


(a) [stan neZ n#0 


0) [costa neZ n#0 
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Solution (a) We use the trigonometric identity 
9 1 — cos 2nt 
sin” nt = ————— 
2 
to get 


™ 1 — cos 2nt 


ae 
sin’ (nt) dt = / 
if s 


using the orthogonal properties of cos(nt). 


-dt=7 


[3 
aD 


™ i 
(b) i cos? (nt) dt = / 1 — sin? (nt) dt 
—7 —7 
=2n-Tmt=T7 


It is a simple extension to show that integrating sin’ (nt) or cos?(nt) over any interval 
of length 27t yields the same result, namely 7t. It is also possible to extend the result of 


2ntt 


Example 16.4 to show 
T/2 T/2 
/ sin?( ) dt = / 
—T/2 —T/2 


2nttt 
Finally, integrating sin ( : 


T 
the same result, that is ri 


EXERCISES 16.2 


1. Show f(x) = x? and g(x) = | — x are orthogonal 
0 4 
across | 0, = |. 
3 4 
1 
2 Show f(x) = - and g(x) = x? are orthogonal over 5 
[—k, k]. 
3 (a) Show f(t) = 1 —tand g(t) = 1++¢ are 
orthogonal over [0, V3). 
Solutions 
3 (b) [-V3, 0] 


f 16.3 IMPROPER INTEGRALS 


cos'( 
) and cos'( 


2nTtt 


d@=— neZ n#0 


) 


2ntt 


) over any interval of length T gives 


(b) Find another interval over which f(t) and g(t) are 
orthogonal. 


Show f(t) =e! and g(t) = 1 — e~! are orthogonal 
across [—1, 1]. 


Show f(x) = /x and g(x) = 1 — ./x are orthogonal 


on [0 — 


There are two cases when evaluation of an integral needs special care: 


(1) one, or both, of the limits of an integral are infinite; 


(2) the integrand becomes infinite at one, or more, points of the interval of integration. 
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If either (1) or (2) is true the integral is called an improper integral. Evaluation of 
improper integrals involves the use of limits. 


Example 16.5 Evaluate [ ait 
2 


7 1] 
Solution , z= l-7| 
~ t ae 


1 
Example 16.6 Evaluate / e* dx. 


—cCO 


1 ex 1 
Solution : eax =|F| 


2x 
e 
We need to evaluate lim —. This limit is 0. So, 
x>—00 2 


1 _ e2t 1 e2 
dx=|}—| =—-0=3.69 
ix IS |.. 2 


Engineering application 16.1 


Capacitors in series 


Engineers are often called upon to simplify an electronic circuit in order to make 
it easier to analyse. One of the arrangements frequently met is that of two or more 
capacitors connected together in series. It is useful to be able to replace this con- 
figuration by a single capacitor with a capacitance value equivalent to that of the 
original capacitors in series. Derive an expression for the equivalent capacitance of 
two capacitors connected together in series (see Figure 16.1). 


Solution 
In Engineering application 13.2 we obtained an expression for the voltage across a 
capacitor. This was 


[i 
v=— 
Cc i 
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= This can be written as a definite integral to give the voltage expression across the 
: iHe4I capacitor at a general point in time, ft. The expression is 
Cc Cc ita 
: ——— / idt 
= _ E 
Vv] 1) ao 


Now consider the situation depicted in Figure 16.1. Writing an equation for each of 


Figure 16.1 : : 

the capacitors gives 
Two capacitors ' : 4 ; 
connected in series. v} vee v, idt 


aCe. ao ae 


By Kirchhoff’s voltage law, v = v, + v, and so 


=f ate fl i dt ete iL i dt 
) 1 = iu — — 1 
C, —oo G =e) C; C; oo 


Therefore, the two capacitors can be replaced by an equivalent capacitance, C given 


by 
1_1,1_446 
ene aun celia een 
so that 
= C,C, 
O46, 


The result is easily generalised for more than two capacitors. For example, for three 
capacitors we have 


a een! 
CAC PGE 
so that 


=, C,C,C; 
7 60, 4+6,C,4+ CC, 


You may wish to prove this result. 


ao 62 1 
Example 16.7 Evaluate f —— — -dt. 
3 2t+i1 tf 


=< 2 1 
Solution / — 7 de = [In |2t + 1] — In|el]5° 
3 


2t+1 
2t+1|]~ 1 
t |d, t 


7 
=In2—-In-=I1n 
3 
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Example 16.8 


Solution 


Example 16.9 


Solution 


Example 16.10 


Solution 


Example 16.11 


Solution 


foe) 
Evaluate , sint dt. 
I 


[oe] 
/ sint dt = [—cost]{° 
1 


Now lim,_,,(— cost) does not exist, that is the function cost does not approach a limit 
as t — oo, and so the integral cannot be evaluated. We say the integral diverges. 


1 
1 
Evaluate / — dx 
0 vx 


1 
The integrand, Wes becomes infinite when x = 0, which is in the interval of integration. 
x 


1 
1 
The point x = 0 is ‘removed’ from the interval. We consider / — dx where } is 
b 


slightly greater than 0, and then let b > 0*. Now, 
1 
1 
J qear= eva 
Then, 


Ya ra 
[ Jere im | Fo = jim 2 —2vb) =2 


The improper integral exists and has value 2. 


2 
1 
Determine whether the integral / — dx exists or not. 
0 x 
=f 
As in Example 16.9 the integrand is not defined at x = 0, so we consider : — dx for 
bx 
b > Oand then let b > O°. 
=i 
/ — dx = [In |x|] = In2—Inb 
bx 
So, 


2] 
iim( f - ar) = lim(In2 — Inb) 
b>0 bp & b>0 


Since lim, _,9 In b does not exist the integral diverges. 


2 
1 

Evaluate / — dx if possible. 
-_1 % 


We ‘remove’ es point x = 0 where the integrand becomes ae and consider two 


integrals: i. — dx where b is slightly smaller than 0, and ie — dx where c is slightly 


larger than 0. If these integrals exist as b > O07 andc —> Ot then ie — dx converges. If 
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2 
either of the integrals fails to converge then — dx diverges. Now, 
_1 x 


a | 
/ — dx = In|b| — In|—1] = In|D| 
_1 x 


b 
1 
lim | —dx = lim (n|d}) 
b>07- a} & b->07- 
2 


This limit fails to exist and so — dx diverges. 
_1 x 


Engineering application 16.2 


Energy stored in a capacitor 


A capacitor provides a useful means of storing energy. This energy can be discharged 
by connecting the capacitor in series with a load resistor and closing a switch. The 
stored electrical energy is converted into heat energy as a result of electrical cur- 
rent flowing through the resistor. Consider the circuit shown in Figure 16.2 which 
consists of a capacitor, C F, connected in series with a resistor with value R Q and 
isolated by means of a switch, S. We wish to calculate the amount of energy stored in 
the capacitor. The switch is closed at t = 0 and a current, 7, flows in the circuit. We 
have already seen in Chapter 2 that for such a case the time-varying voltage across 
the capacitor decays exponentially and is given by 


v = Ve t/RC 
So, using Ohm’s law 
vi Ve t/RE 
=i 
S i 


oa 


Now the effect of closing the switch is to allow the energy stored in the capacitor to 
be dissipated in the resistor. Therefore, if the total energy dissipated in the resistor 
is calculated then this will allow the energy stored in the capacitor to be obtained. 
However, the energy dissipation rate, that is power dissipated, is not a constant for 
the resistor but depends on the current flowing through it. The total energy dissipated, 
E, is given by 


E =[ Pow 
0 


where P(t) is the power dissipated in the resistor at time ¢. This equation has been 
discussed in Engineering application 13.3. Now, 


i 


Figure 16.2 
The capacitor is discharged by closing the switch. 


: RV2 en 2/RC 2. @-21/RC 
‘ R R 
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loo) y2 e72t/RC y2 lo) 
E= if dt = / eFC dt 
0 0 


R R 
at VERC —2t/RC]© 
=a Dip [e lk 
Now 
ince. 
t—oo 


and so the energy stored in the capacitor is given by 
= CVs 
co! 


E 


Example 16.12 Find 


lo) 
/ e” sint dt s>0 
0 


Si 


‘ ae F : du : 
Solution Using integration by parts, with u = e~” and a sint, we have 


lo,e) lo, <) 
= : = lo, @) = 
/ e~ sint dt = [—e™ cos¢], -sf e~" cost dt 
0 0 


Consider the first term on the r.h.s. We need to evaluate [-e* cos t| ast — oo and when 
t = 0. Note that —e~ cost — Oast — oo because we are given that s is positive. When 
t = 0, —e ” cost evaluates to —1, and so 


CO [oe 
/ e“ sintdt = 1— sf e “cost dt 
0 0 


Integrating by parts for a second time yields 
/ e “sintdt =1—s {le sint]> + sf ma sine ar} 
0 0 
=1- ef e ” sint dt 
0 


because (i sin a evaluates to zero at both limits. At this stage the reader might sus- 
pect that we have gone around in a circle and still need to evaluate the original integral. 
However, some algebraic manipulation yields the required result. We have 


CO CO 
/ e-™ sint dt +s? / e sintdt = 1 
0 0 


a+s) [ e sintdt = 1 
0 


/ ales 1 
e “ sintdt = 5 
0 I+s 


EXERCISES 16.3 


1. Evaluate, if possible, 


CO 
(b) i oe dt k is aconstant, k > 0 


Solutions 


1 
1 (a) l (b) k (c) does not exist 
(d) 1 (e) does not exist 


2 (a) does not exist (b) does not exist 


3 


16.4 Integral properties of the delta function 


Evaluate the following integrals where possible: 
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(a) is > ax (b) [ Eats ade 
a —— 
go x-2 go x-l x—2 


2 1 oe) 
(c) / rn) / sin 3t dt 
0 x1 0 


3 
(e) / xe* dx 


Find 


oe) 
i e' cost dt s>0 
0 


(c) does not exist (d) does not exist 
(e) 2e 

Ss 
stl 


INTEGRAL PROPERTIES OF THE DELTA FUNCTION 


The delta function, 6(¢ —d), was introduced in Chapter 2. The function is defined to be a 
rectangle whose area is | in the limit as the base length tends to 0 and as the height tends 
to infinity. Sometimes we need to integrate the delta function. In particular, we consider 


the improper integral 


[ v0-a 


The integral gives the area under the function and this is defined to be 1. Hence, 


[s0-ayar=t 


CO 


In Chapter 21 we need to consider the improper integral 


[ f()d(t — d) dt 


where f(t) is some known function of time. The delta function 5(¢ — d) is zero every- 
where except at t = d. Whent = d, then f(t) has a value f(d). Hence, 


ie f(@)d(t — d) dt =). Sf (d)d(t — d) dt = fla) [3a —aya 
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lee) 


since f(d) is a constant. But / d(t — d) dt = 1 and hence 


—o0o 


/ f(t)8¢ — d) dt = f(a) 
[o0-aa=i 


i f(t)d(t — d) dt = fd) 


The result 
/ Md —d) dt = fd) 


is known as the sifting property of the delta function. By multiplying a function, f(t), 
by 5(¢ — d) and integrating from —oo to oo we sift from the function the value f(d). 


Example 16.13 Evaluate the following integrals: 


(a) [P00 -2a (b) [ese-na 
= 66 0 


Solution (a) We use 
/ f@)d(t — d) dt = f(d) 
with f(t) = ft? and d = 2. Hence 


[ P00-na= say =2=4 


oO 


(b) We note that the expression e’5(t — 1) is 0 everywhere except at t = 1. Hence 


[eve =id= i. e'8(t — 1) dt 
0 = 


lo, <) 


Using 
/ f(t)d(t — d) dt = f(d) 
with f(t) =e’ andd = 1 gives 


[ ese- 1)dt = f(1) =e'=e 
0 


EXERCISES 16.4 


1. Evaluate (c) / e'5(t + 3) dt 
(a) / e!8(t) dt ~ 
— af t75(t —3) dt 


—oo 


w | e!3(t — 4) dr 
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oe é(t—1 0 
© | Ss © | e'5(t +3) dt 


2 


(f) [- e 8 (t) dt 


© f- e*$(t — a) dt 


10 

@ [ x5(x — 2) dx 
0 
1 

(e) / x75(x + 2) dx 
-1 


3 Evaluate the following: 


(h) ie e K-49) §(¢ — a) dt (a) / t(sin 2t)d(t — 3) dt 
2 Evaluate the following integrals: (b) i d(¢+1) —d(t—1)dt 
(a) / (sint)5(t — 2) dt (c) [ 8(t —d) dt 
(b) / e ‘6(t+1)dt (d) [+ — d) dt 
Solutions 
1 (a) 1 (b) 54.5982 (d) 8 
(c) 4.9787 x 107-2 (d) 36 (e) 0 
1 1 
> a i 1 3 (a) —0.8382 
(b) —1 
2 (a) 0.9093 
(b) 2.7183 (c) lifd > 0, 0 otherwise 
(c) 20.0855 (d) 1if—a<d <a, 0 otherwise 


INTEGRATION OF PIECEWISE CONTINUOUS FUNCTIONS 


Integration of piecewise continuous functions is illustrated in Example 16.14. If a dis- 
continuity occurs within the limits of integration then the interval is divided into sub- 
intervals so that the integrand is continuous on each sub-interval. 


Example 16.14 


Solution 


Given 
fo= 2 O0<t<l 
|? let <3 
evaluate f f(t) dt. 


The function, f(t), is piecewise continuous with a discontinuity at f = 1. The function 
is illustrated in Figure 16.3. The discontinuity occurs within the limits of integration. 
We split the interval of integration at the discontinuity thus: 


3: 1 3 
/ fayar= f fevers [ f(t) dt 
0 0 1 
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fO 


Figure 16.3 


2 0<t<K<l 
1 3 t The function f(t) =| casi 


Pr l<t <3. 


On the intervals (0, 1) and (1, 3), f(t) is continuous, so 


3 1 3 
[ro=f 2d¢+ | Pdr 
0 0 1 
3 


-PLEl-eb--% 


ie Given 5 6 
(c) : g(t) de (d) ' g(t) dt 


fMO=42t 1l<t<2 
? 2%@<t<3 
evaluate eI Given u(t) is the unit step function, evaluate 


ul 1.5 
(a) i fdr (b) / far : 
-1 —0.5 (a) i u(t) dt 
2.5 3 7 
(©) i fa (a) / fat 
0 —1 


2 Given 


4.5 
() / g(t) dt 
3:5 


2 
w | u(t) dt 
—3 


4 
3t 0<1<3 (c) [rues van 
r= [15-2 3<t<4 > 
: dd (a) / tu(t) dt 
evaluate =| 
ss ae 
(a) / g(t) dt (b) / g(t) dt (e) [ e“ u(t — 3) dt k constant 
0 2 
i (a) 6 (b) 5.75 By (a) 4 (b) 2 
() 85417 (d)_:15.3333 (c) 10 (d) 2 
4k 3k 
2 @ 6 (b) 15.5. () 14 apes 


(d) 33.5 (e) 6.75 
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| 16.6 INTEGRATION OF VECTORS 


If a vector depends upon time f, it is often necessary to integrate it with respect to time. 
Recall that i, j and k are constant vectors and are treated thus in any integration. Hence 
the integral 


I= [roi + g(t)j + h()k) dt 


is simply evaluated as three scalar integrals, and so 


I= ([ ras (/ doa) é (/ wou) 


Example 16.15 If r = 3ri + #?j + (1 + 20k, evaluate [) rdr. 


1 1 1 1 
Solution [r-([ wat) (| eayix(| 14200) 
0 0 0 0 


327', FAT, wie. “ae «he 
0 0 0 


EXERCISES 16.6 


1. Givenr = 3 sinti—costj+ (2 —1)k, evaluate i rdf. acca ‘ ri 
2 Given v = i— 3j+k, evaluate (a) [ adr (b) / adr (c) / adr 
1 2 4 Let aand b be two three-dimensional vectors. Is the 
(a) ) vdt (b) [ vdt following true? 

: 7) 2) 7) 

3 The vector, a, is defined by / adt x [ bdt = i: a x bdt 
1 1 1 
a=ri+e’ jtc¢k Recall that x denotes the vector product. 
Solutions 

7) 61+ 1.348k (b) 6.3331 + 0.0855j + 2.5k 


2 (a) i-3j+k (b) 2i — 6j + 2k (c) 21i+ 0.3496j + 7.5k 
3 (a) 0.333i + 0.632j + 0.5k 4 no 
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REVIEW EXERCISES 16 


1 


Show f(x) = x" and g(x) = x” are orthogonal on 
[—k, k] if n + m is an odd number. 


Show f(t) = sint and g(t) = cost are orthogonal on 
[a, a+ nm]. 


Show f(t) = sinht and g(t) = cosht are orthogonal 
over [—k, k]. 


Determine the values of k for which i tk dt exists. 


Evaluate if possible 


(a) / ae dt 
a 

(b) / e 1000r dt 
0 1 oa 

(c) [ see 

(d) / u(t)e ! dt 


| 
(e) [=e 


(Note that u(t) is the unit step function.) 


Find the values of k for which 5 e* dt exists. 


Evaluate 


@ f ( + 1)6(t — 1) dt 
w) | te’S(t — 2) dt 


(c) / (0? +1 +2)6(t — 1) dt 


% §(t+2) 
(d) [. aaa dt 


(e) i b(t + 1)d(t + 2) dt 
(a) Evaluate 


/ (7 + 1)8(2r) dt 
[Hint: substitute z = 2t.] 
(b) Show 
sO 1 
/ f(t)d (nt) dt = 7,4 O): 


n>0O 


Evaluate 


@ f (1 +18 — 2) dr 
0 


o | 18(1 +t) dt 
(c) iy (1 +1)8(—t) dt 
0 
@ | 5(t —6) +5(¢ +6) dt 


4k 
(e) / b(t +k) + d(t + 3k) + 6(¢ + 5k) dt 
—2k 
k>0 


The function g(t) is piecewise continuous and 
defined by 


w= 2t O<t<l 
ae Pepe 
Evaluate 
1 
(a) / g(t) dr 
0 
LS 
(b) / g(t) dt 
0 
17 
(c) i g(t) dt 
0.5 
2 
(d) i g(t) dt 
i ae 
(e) / g(t) dt 
1.3 
Given 
l+t -l<t<3 
f@= yt-1 3<t<4 
0 otherwise 
evaluate 


2 

(a) [foo 
4 

(b) [ tow 


5 
(c) [ f(t) dt 
0 
(d) [ feosa-241 


(e) 7 fut) dt 


12 Given 
? —2<t<2 
hth=%{P 2<t<3 
4 3<t<5 

evaluate 


2 
(a) i h(t) dt 
-1 


2.5 

o f h(t) dt 
0 
4 

(c) i h(t) + dt 
2 
4 

(d) / h(t + 1) dt 
0 


2 
(e) / A(2t) dt 
-1 


13 Ifa=ri— 2rj + 3rk, find {| adr. 


Solutions 


4 k<-l 


5 (a) does not exist 
(b) does not exist 


(c) does not exist 


(d) 1 
(e) does not exist 
6 k<0 
7 (a) 2 (b) 14.7781 (c) 4 
(d) 0.2 (e) 0 
8 (5 
9 (a) 3 (b) -l (© 1 
(d) 1 (e) 1 
10 (a) 1 (b) : (c) 2.85 


(d) 3 (e) 0.6 


Review exercises 16 


14 Given 


15 


11 


12 


13 


14 


15 


v(t) = 21+ (3 —2?)j +k 
find 


1 
(a) i vdt 
0 
2 
(b) i vdt 
1 
2 
(c) / vdt 
0 


Evaluate the following integrals: 


3 
(a) / \t| de 
-1 


2 
(b) / |t + 2| de 
3 


2 
(c) [ |3¢ — 1| dr 
0 


(a) 4.5 (b) 10.5 (c) 10 

(d) 3 (e) 10 

(a) 3 

(b) 8.4323 

(c) 22.25 

(d) 26.5833 

(e) 12.7917 

0.5i—j + 1.5k 

(a) i+ 5 +k 

3" 4 

15 


2 
b) 3i+ ~j+ —k 
(b) i+ d+ 4 
», 10, 
(c) eS ais 


13 
(a) 5 (b) 8.5 (c) cs 
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INTRODUCTION 


x 


e 
Many functions, for example sin x* and —, cannot be integrated analytically. Integra- 
x 


tion of such functions must be performed numerically. This section outlines two simple 
numerical techniques — the trapezium rule and Simpson’s rule. More sophisticated ones 
exist and there are many excellent software packages available which implement these 
methods. 


TRAPEZIUM RULE 


We wish to find the area under y(x), from x = a to x = b, that is we wish to evaluate 
Pe y dx. The required area is divided into n strips, each of width h. Note that the width, 


b— 
h, of each strip is given by h = a Each strip is then approximated by a trapezium. 
n 


1 
A typical trapezium is shown in Figure 17.1. The area of the trapezium is hi +Yi44]- 


Summing the areas of all the trapezia will yield an approximation to the total area: 


; h h h 
area of trapezia = 5% sea) se 5% ao) aoe 5 Ont a) 


h 
= 700+ 2M Lp ya tt eye) 


Example 17.1 Use the trapezium rule to estimate il 


Solution 

Table 17.1 

x y=e) 

0.5 1.2840 = yo 
0.7 1.6323 = y, 
0.9 2.2479 = y, 
1.1 3.3535 = y, 
1.3 5.4195 = y, 
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a Xi Xia b x Figure 17.1 
Each strip is approximated by a trapezium. 


If the number of strips is increased, that is h is decreased, then the accuracy of the ap- 
proximation is increased. 


13 
2. ‘é 
e“? dx using 
0.5 


(a) astrip width of 0.2 (b) astrip width of 0.1. 


(a) Table 17.1 lists values of x and corresponding values of e), 


We note that h = 0.2. Using the trapezium rule we find 


0.2 
sum of areas of trapezia = rR aa + 2(1.6323) + 2(2.2479) 


+2(3.3535) + 5.4195} 
= 2.117 
Table 17.2 
Hence 
1.3 ; x y=e™) 
/ e™) dx & 2.117 

05 Xq = 0.5 1.2840 = yo 
x, = 0.6 1.4333 = y, 
Xy = 0.7 1.6323 = y, 
xX; = 0.8 1.8965 = y3 
x4 = 0.9 2.2479 = yy 
Xs = 1.0 2.7183 = yz 
Xe = ll 3.3535 = Ye 
x7 = 122 4.2207 = yy 
Xg = 1.3 5.4195 = ye 


(b) Table 17.2 lists x values and corresponding values of e), 
Using Table 17.2, we find 


0.1 
sum of areas of trapezia = yee + 2(1.4333) + 2(1.6323) +--- 


+2(4.2207) + 5.4195} 
= 2.085 


Pe 


13 
Hence [ e dx & 2.085. 
0 


Dividing the interval [0.5, 1.3] into strips of width 0.1 results in a more accurate estimate 
than using strips of width 0.2. 
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Engineering application 17.1 


Distance travelled by a rocket 


A rocket is released and travels at a variable speed v. A motion sensor on the rocket 
measures this speed and the value is sampled by an onboard computer at | second 
intervals. The computer is required to calculate the distance travelled by the rocket 
and relay the value to a ground station at regular intervals. Table 17.3 records values 
of the measurements taken by the computer during the first 10 seconds of flight. 
Assuming that the computer uses the trapezium rule to estimate the distance travelled 
by the rocket, calculate the value that the computer will relay to the ground station 
after 10 seconds. 


Table 17.3 

Time (s) Speed (m s~') 
0 10.1 
1 17.2 
2 24.4 
3 29) 
4 34.6 
5 41.2 
6 50.9 
Wl 57.8 
8 60.3 
9 61.2 

10 62.1 

Solution 


d 
We know that v = ~ where v = speed and s = distance travelled in time t. So 


t 
= || v dt 
0 


We estimate the value of this integral using the trapezium rule. We choose a strip of 
width h = | as this is the time interval at which data is collected by the computer. 
Therefore 


10 
1 
s= / udt ~ Buel + 2(17.2 + 24.4 + 29.2 + 34.6 
0 
Sele OE eo OU. sole) Ome 
1 
= =(825. 
56 ) 
= 412.9 


The distance travelled after 10 seconds is approximately 412.9 m. 


17.2.1 Error due to the trapezium rule 


Suppose we wish to estimate i jf dx using the trapezium rule. The interval [a, b] is 
—a 


divided into n equal strips, each of width h = 


Example 17.2 


Solution 
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The difference between the estimated value of the integral and the true value of the 
integral is the error. So 


error = estimated value — true value 


We are able to find the maximum value of |error|. Firstly we calculate the second deriva- 
tive of f, that is f”. Suppose that | f”| is never greater than some value, M, throughout 
the interval [a, b], that is 


If" | <M for all x values on [a, b] 


We say that M is an upper bound for | f”| on [a, b]. Then the error due to the trapezium 
tule is such that 
(b—a) 
ey 


—a) 


b 
The expression VE!) By is an upper bound for the error. Note that the error 


| error | < i’M 


depends upon h”: if the strip width, h, is halved the error reduces by a factor of 4; if 
h is divided by 10 the error is divided by 100. 


Find an upper bound for the error in the estimates calculated in Example 17.1. Hence 
find upper and lower bounds for the true value of es e*) dx. 


We have f =e“. Then 
fi =2xe™ and  f” =2e" (1 + 2x’) 


We note that f” is increasing on [0.5, 1.3] and so its maximum value is obtained at 
x = 1.3. Thus the maximum value of f” on [0.5, 1.3] is 2e-9” [1 + 2(1.3)?]. Noting 
that 2e'3”[1 + 2(1.3)2] = 47.47 we see that 48 is an upper bound for f” on [0.5, 1.3], 
that is M = 48. 

We note that a = 0.5, b = 1.3. Thus 


(0.8)h? (48) 
12 
(a) In Example 17.1(a), A = 0.2 and so 


| error | < = 3.2n 
| error | < 3.2(0.2)” = 0.128 
Thus an upper bound for the error is 0.128. We have 
—0.128 < error < 0.128 
The estimated value of the integral is 2.117 and so 
2.117 — 0.128 < true value of integral < 2.117 + 0.128 


that is 


1.3 
1.989 < / e™) dx < 2.245 
0. 


iS 


An upper bound for pe e) dx is 2.245 and a lower bound is 1.989. 
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(b) In Example 17.1(b), 1 = 0.1 and so 
| error | < 3.2(0.1)7 = 0.032 
Hence an upper bound for the error is 0.032. Now 
—0.032 < error < 0.032 


The estimated value of the integral is 2.085 and so 


13 
2.085 — 0.032 < / e™) dx < 2.085 + 0.032 
0.5 
that is 
13 


2.053 < / e™) dx < 2.117 
0. 


iS 


EXERCISES 17.2 


1 Estimate the following definite integrals using the 
trapezium rule: 


132! 3% 
1 e 
ee = w | —dx use five strips 
w | sin(t~ ) dt use h = 0.2 1 x 


Solutions 


1 (a) 0.3139 (b) 0.5467 


Technical Computing Exercises 17.2 


1 (a) Plot the second derivative of f(t) = sin(t?) for (b) Repeat (a) with f(x) = et forl <x < 1.2. 
0 <t < 1. Use your graph to find an upper bound 
for f” (t) for 0 < t < 1. Hence find an upper 
bound for the error in Question 1(a) in 
Exercises 17.2. State upper and lower bounds for 
the integral given in the question. 


Hence find an upper bound for the error in 
Question 1(b) in Exercises 17.2. State upper and 
lower bounds for the integral given in the 
question. 


SIMPSON’S RULE 


In the trapezium rule the curve y(x) is approximated by a series of straight line segments. 
In Simpson’s rule the curve is approximated by a series of quadratic curves as shown in 
Figure 17.2. The area is divided into an even number of strips of equal width. Consider 
the first pair of strips. A quadratic curve is fitted through the points A, B and C. Another 
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quadratic curve is fitted through the points C, D and E. After some analysis an expression 
for approximating the area is found. 


Simpson’s rule states: 


h 
BREE Ol eS 3) ERS sar ee oe a yam iio iy) 


Figure 17.2 h 

In Simpson’s rule an = 300 Se ea 02) ree) 
even number of strips 
is used. The curve is 
approximated by 
quadratic curves. 


where n is an even number. 


Example 17.3 Estimate ie e*) dx using Simpson’s rule with (a) four strips, (b) eight strips. 
Solution (a) We use Table 17.1 and note that # = 0.2. Using Simpson’s rule we have 
: 0.2 
estimated value = gr eee ra CL Oe eo) Paes rah 


= 2.0762 


Using Simpson’s rule we have found i. e™) dx & 2.0762. 
(b) We use Table 17.2 and note that = 0.1: 


0.1 
estimated value = =q 28) + 4(1.4333 + 1.8965 + 2.7183 + 4.2207) 


+ 2(1.6323 + 2.2479 + 3.3535) + 5.4195} 
= 2.0749 


Using Simpson’s rule we have found ie e&) dx & 2.0749. 


Example 17.4 Estimate i. /1 + x3 dx using Simpson’s rule with 10 strips. 
Solution With 10 strips h = 0.1. Using Table 17.4, we find 


0.1 
estimated value ~ Be eee aan Oey he SOU Le aT Ie ean re ee) 


+2(1.6517 + 1.9349 + 2.2574 + 2.6138) + 3.000} 
= 2.130 


In some cases the numerical values are not derived from a function but from actual 
measurements. Numerical methods can still be applied in an identical manner. 
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Example 17.5 


Solution 


Table 17.4 Table 17.5 

Using Simpson’s rule to estimate Measurements used to 
f s 1+.x3 dx. estimate the integral. 
x y=V14+x% t Measurement (f) 
S10 1.4142 = yg 0 4 

4214 1.5268 = y, 1 47 
Set 1.6517 = y, 2 4.9 

= 1a 1.7880 = y3 3 5.3 

x, = 14 1.9349 = yy 4 6.0 
~=15 2.0917 = ys 5 5.3 

xg = 1.6 2.2574 = Ye 6 5.9 

x7 = 1.7 2.4317 = yy 

xg = 1.8 2.6138 = ye 

Xy = 1.9 2.8034 = yo 

X19 = 2.0 3.0000 = y19 


Measurements of a variable, f, were made at | second intervals and are given in 
Table 17.5. Estimate [” f dr using 


(a) the trapezium rule 
(b) Simpson’s rule. 


(a) The sum of the areas of the trapezia is 
3l4 + 2(4.74+4.9+5.3 + 6.0 + 5.3) + 5.9] = 31.2 

(b) The area has been divided into six strips and so Simpson’s rule can be applied: 
approximate value of integral = 5(4 +4(4.74+5.3 + 5.3) 


+ 2(4.9 + 6.0) + 5.9] = 31.0 


Engineering application 17.2 


Energy dissipation in a resistor 


A resistor is being used to dissipate energy from a variable d.c. supply. A calculation 
is needed of how much energy has been dissipated over a period of time. Table 17.6 
contains values of current, /, through the resistor, and voltage, V, across the resistor 
for the first 100 seconds since electrical power was first applied. Calculate the energy 
dissipation during this time period using Simpson’s rule with a step interval of 10 
seconds. 


Solution 
The energy dissipated, E, is given by 


t 
E= | pa 
0 


17.3.1 
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Table 17.6 Table 17.7 


Time(s) Voltage (V) Current (A) Time(s) Voltage(V) Current(A) Power (W) 


0 50.5 10.1 0 50.5 10.1 510.05 
10 101.0 20.2 10 101.0 20.2 2040.20 
20 67.5 13.5 20 67.5 135) 911.25 
30 80.5 16.1 30 80.5 16.1 1296.05 
40 92.0 18.4 40 92.0 18.4 1692.80 
50 96.0 12 50 96.0 IP) 2 1843.20 
60 78.5 15.7 60 78.5 15.7 1232.45 
70 82.0 16.4 70 82.0 16.4 1344.80 
80 90.5 18.1 80 90.5 18.1 1638.05 
90 107.0 21.4 90 107.0 21.4 2289.80 


100 86.0 2 100 86.0 fe? 1479.20 


where P is the power. Also P = IV, and so 


t 
B= || IV dt 
0 


We first need to evaluate P as shown in Table 17.7. 
Then using Simpson’s rule with h = 10 we have 


100 10 
Ee / IV dt © a elle + 4(2040.20 + 1296.05 + 1843.20 + 1344.80 
0 


+ 2289.80) + 2(911.25 + 1692.80 + 1232.45 + 1638.05) 
+ 1479.20} 
= 160648.5 J 
= 160.649 kJ 
The energy dissipated is therefore approximately 160.6 kJ. 


Error due to Simpson’s rule 


Simpson’s rule provides an estimated value of a definite integral. The difference between 
the estimated value and the true (exact) value is the error. Just as with the trapezium rule, 
we can calculate an upper bound for this error. 

We need to calculate the fourth derivative of f, that is f. Suppose | f| is never 
greater than some value, M, throughout the interval [a, b], that is 


yl<M for all x values on [a, b] 


Clearly M is an upper bound for || on [a, b]. The error due to Simpson’s rule is such 
that 


(b—a)h'M 


| error | < 
180 


Note that the error is proportional to h*. 
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Example 17.6 Find an upper bound for the error in the estimates calculated in Example 17.3. Hence 
fi 13 (2) 
nd upper and lower bounds for i. ev dx, 
Solution Here we have f = em), Calculating the fourth derivative gives 
FO =4e (4x4 + 127 +3) 


We seek an upper bound for | f | on [0.5, 1.3]. We note that f increases as x increases 
and so its maximum value in the interval occurs when x = 1.3: 


f (1.3) = 752.319 < 753 


Hence 753 is an upper bound for | f|. 
In this example a = 0.5 and b = 1.3 and so 


(0.8)h4 (753) 
180 
(a) Here h = 0.2 and so 


| error | < = 3.347h* 


| error | < 3.347(0.2)* = 0.0054 
An upper bound for the error is 0.0054. Now 
—0.0054 < error < 0.0054 


The estimated value of the integral is 2.0762 and so 


1 


3 
2.0762 — 0.0054 < / e) dx < 2.0762 + 0.0054 


0.5 
that is 
i3 
2.0708 < / e*) dx < 2.0816 
0.5 


(b) Here h = 0.1 and so 
| error | < 3.347(0.1)* = 3.347 x 1074 
An upper bound for the error is 3.347 x 10~*. Now 
—3.347 x 10+ < error < 3.347 x 1074 


Noting that the estimated value of the integral is 2.0749 we have 


1; 


3 
2.0749 — 3.347 x 10-*< / e™ dx < 2.0749 + 3.347 x 1074 


0.5 


that is 


1.3 
2.0746 < / e™) dx < 2.0752 


0.5 


EXERCISES 17.3 


1. Estimate the values of the following integrals using 
Simpson’s rule: 


3 
(a) / Ince +1)dx use 10 strips 
2 


2.6 
(b) te! dt use eight strips 
1 


Solutions 


1 (a) 2.7955 (b) 15.1164 


2 (a) trapezium rule: 1.5900, Simpson’s rule: 1.5681 


Technical Computing Exercises 17.3 


1 (a) Plot the fourth derivative of f(x) = 
InQ? + 1) for2 <x <3. Use your graph 


to find an upper bound for f (x) for 2 < x <3. 


Hence find an upper bound for the error in 
Question 1(a) in Exercises 17.3. State upper and 
lower bounds for the integral given in the 
question. 
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2 Evaluate, using the trapezium rule and Simpson’s 
rule, 


1 
(a) [ (? +1)°/? dx use four strips 
0 


1.6 gin 2t 
(b) / dt 
1 t 


use six strips 


(b) trapezium rule: 0.2464, Simpson’s rule: 
0.2460 


(b) Repeat (a) with f(t) = J/te’ forl <t < 2.6. 
Hence find an upper bound for the error in Ques- 
tion 1(b) in Exercises 17.3. State lower and upper 
bounds for the integral given in the question. 


2 Use MATLAB® or a similar technical computing 
language to find upper and lower bounds for the 
integrals in Question 2 in Exercises 17.3. 


REVIEW EXERCISES 17 


1 If f(t) =VP? +1 find {7 f@) dr using 
(a) the trapezium rule with h = 0.25 


(b) Simpson’s rule using eight strips. 


2. Estimate the following definite integrals using the 
trapezium rule with six strips: 


4 0.6 
(a) / Vx +1 dx (b) [ sin(t?) dt 
1 0 


0.8 @) 0.3 3 
dt d —— dx 
© fie (d) [ =e) 


0.2 
2:5 ef 7 
—d 
(e) / : 


3 Estimate the definite integrals in Question 2 using 
Simpson’s rule with six strips. 


4 Estimate the following definite integrals using the 
trapezium rule with eight strips: 


4 
(a) / cos(4/f) dt 
0 
6 
(b) / @? +13 det 
2 
3 
(c) / eV dx 
1 
0.8 
(d) 7 tan(f?) dt 
0 


5 
(e) i; In? + 1) dx 
3 
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a (a) 1.8111 (c) 0.806 43 
(b) 1.8101 (a) 0.448 49 

2 @ 129113 (e) 5.17879 
(b) 0.07227 @ «) 081058 
(c) 0.808 60 ‘eyes 
(d) 0.448 45 
(e) 5.19384 (c) 8.27686 

8) @ 1287181 aaeed 
(b) 0.071334 (e) 5.63032 


language to find upper bounds for the errors in these questions. 


i Use MATLAB® or a similar technical computing | upper and lower bounds for the integrals given in 
Questions | to 4 in Review exercises 17. Hence find 


Aer 5 mis 
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INTRODUCTION 


Often the value of a function and the values of its derivatives are known at a particular 
point and from this information it is desired to obtain values of the function around that 
point. The Taylor polynomials and Taylor series allow engineers to make such estimates. 
One application of this is in obtaining linearized models of non-linear systems. The great 
advantage of a linear model is that it is much easier to analyse than a non-linear one. It is 
possible to make use of the principle of superposition: this allows the effects of multiple 
inputs to a system to be considered separately, and the resultant output to be obtained by 
summing the individual outputs. 

Often a system may contain only a few components that are non-linear. By linearizing 
these it is then possible to produce a linear model for the system. We saw an example of 
this when we analysed a fluid system in Engineering application 10.6. Although electri- 
cal systems are often linear, mechanical, thermal and fluid systems, or systems contain- 
ing a mixture of these, are likely to contain some non-linear components. Unfortunately 
it may not be possible to obtain a sufficiently accurate linear model for every non-linear 
system as we shall see in this chapter. 

Taylor polynomials of higher degree can be found which approximate to a given func- 
tion. This is dealt with in Sections 18.3 and 18.4. The difference between a given function 
and the corresponding Taylor polynomial is covered in Section 18.5. The chapter closes 
with a treatment of Taylor and Maclaurin series. 
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O a x 


Figure 18.1 

Graphical representation 
of a first-order Taylor 
polynomial. 


LINEARIZATION USING FIRST-ORDER TAYLOR 
POLYNOMIALS 


Suppose we know that y is a function of x and we know the values of y and y’ when 
x = a, that is y(a) and y’(a) are known. We can use y(a) and y'(a) to determine a linear 
polynomial which approximates to y(x). Let this polynomial be 


P(x) = cp + e\x 

We choose the constants c, and c, so that 
P\ (a) = y(a) 
pi (a) = y'(@) 


that is, the values of p, and its first derivative evaluated at x = a match the values of y 
and its first derivative evaluated at x = a. Then, 


p\(a) = ya) =co +c, 

P| (a) = y (a) =Cc; 
Solving for cy and c, yields 

co =yla)—ay(a) oe, = y'(a) 
Thus, 

Pp, (x) = y(a) — ay'(a) + y'(a)x 


P(x) = y(a) + (a(x — a) 
Pp, (x) is the first-order Taylor polynomial generated by y at x = a. 


The function, y(x), is often referred to as the generating function. Note that p, (x) and 
its first derivative evaluated at x = a agree with y(x) and its first derivative evaluated at 
x= a. 

First-order Taylor polynomials can also be viewed from a graphical perspective. 
Figure 18.1 shows the function, y(x), and a tangent at Q where x = a. Let the equa- 
tion of the tangent at x = a be 


p(x) =mx+c 


The gradient of the tangent is, by definition, the derivative of y at x = a, that is y'(a). 
So, 


p(x) =y(ax+e 
The tangent passes through the point (a, y(a)), and so 
y(a) = y'(ajat+e 
that is c = y(a) — y (a)a. The equation of the tangent is thus 
P(x) = y'(a)x + y(a) — y'(aja 
p(x) = y(a) + (a) — a) 


Example 18.1 


Solution 


Example 18.2 


Solution 


18.2.1 
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This is the first-order Taylor polynomial. We see that the first-order Taylor polynomial 
is simply the equation of the tangent to y(x) where x = a. 

Clearly, for values of x near to x = a the value of p, (x) will be near to y(x); p, (x) isa 
linear approximation to y(x). In the neighbourhood of x = a, p, (x) closely approximates 
y(x), but being linear is a much easier function to deal with. 


A function, y, and its first derivative are evaluated at x = 2. 
y2)=1 y (2) =3 


(a) State the first-order Taylor polynomial generated by y at x = 2. 
(b) Estimate y(2.5). 


(a) py) = yQ2) +y(2)@— 2) =14+3@—-—2) =—-5 + 3x 
(b) We use the first-order Taylor polynomial to estimate y(2.5): 


p, (2.5) = —5 + 3(2.5) = 2.5 
Hence, y(2.5) © 2.5. 


Find a linear approximation to y(t) = f° near t = 3. 


We require the equation of the tangent to y = 7? at t = 3, that is the first-order Taylor 
polynomial about t = 3. Note that y(3) = 9 and y'(3) = 6. 


P(t) = ya) + (a(t — a) = y(3) + ¥'(3)(t — 3) 
= 9+ 6(t — 3) 
= 6t —9 


Att = 3, p,(t) and y(t) have an identical value. Near to t = 3, p,(t) and y(t) have 
similar values, for example p, (2.8) = 7.8, y(2.8) = 7.84. 


Linearization 


It is a frequent requirement in engineering to obtain a linear mathematical model of a 
system which is basically non-linear. Mathematically and computationally linear models 
are far easier to deal with than non-linear models. The main reason for this is that linear 
models obey the principle of superposition. It follows that if the application, separately, 
of inputs u,(t) and u,(t) to the system produces outputs y, (¢) and y,(¢), respectively, 
then the application of an input uw, (t) + u,(t) will produce an output y, (t) + y,(¢). This 
is only true for linear systems. 

The value of this principle is that the effect of several inputs to a system can be cal- 
culated merely by adding together the effects of the individual inputs. This allows the 
effect of simple individual inputs to the system to be analysed and then combined to 
evaluate the effect of more complicated combinations of inputs. A few examples will 
help clarify these points. 


510 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series 


Engineering application 18.1 


A d.c. electrical network 


Consider the d.c. network of Figure 18.2. This network is a linear system. This is 
because the voltage/current characteristic of a resistor is linear provided a certain 
voltage is not exceeded. Recall Ohm’s law which is given by 


We = UR 


where V is the voltage across the resistor, / is the current through the resistor and R 
is the resistance. This makes the analysis of the network relatively easy. The voltage 
sources, V,, V,, V;, V,, can be thought of as the inputs to the system. It is possible to 
analyse the effect of each of these sources separately, for example the voltage drop 
across the resistor, R,, resulting from the voltage source V,, and then combine these 
effects to obtain the total effect on the system. The voltage drop across R, when all 
sources are considered would be the sum of each of the voltage drops due to the 
individual sources V,, V,, V; and V4. 


Figure 18.2 
A d.c. electrical network. 


Engineering application 18.2 


A gravity feed water supply 


Consider the water supply network of Figure 18.3. The network consists of three 
source reservoirs and a series of connecting pipes. Water is taken from the network at 
two points, S, and S,. In a practical network, reservoirs are usually several kilometres 
away from the points at which water is taken from the network and so the effect of 
pressure drops along the pipes is significant. For this reason many networks require 
pumps to boost the pressure. 

The main problem with analysing this network is that it is non-linear. This is be- 
cause the relationship between pressure drop along a pipe and water flow through a 
pipe is not linear: a doubling of pressure does not lead to a doubling of flow. For this 


Figure 18.3 
A water supply network. 
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reason, it is not possible to use the principle of superposition when analysing the 
network. For example, if the effect of the pressure at S, for a given flow rate was 
calculated separately for each of the inputs to the system — the reservoirs R,, R,, R, — 
then the effect of all the reservoirs could not be obtained by adding the individual 
effects. It is not possible to obtain a linear model for this system except under very 
restrictive conditions and so the analysis of water networks is very complicated. 


Having demonstrated the value of linear models it is worth analysing how and when 
a non-linear system can be linearized. The first thing to note is that many systems may 
contain a mixture of linear and non-linear components and so it is only necessary to lin- 
earize certain parts of the system. A system of this type has been studied in Engineering 
application 10.6. Therefore linearization involves deciding which components of a sys- 
tem are non-linear, deciding whether it would be valid to linearize the components and, 
if so, then obtaining linearized models of the components. 

Consider again Figure 18.1. Imagine it illustrates a component characteristic. The 
actual component characteristic is unimportant for the purposes of this discussion. For 
instance, it could be the pressure/flow relationship of a valve or the voltage/current 
relationship of an electronic device. The main factor in deciding whether a valid linear 
model can be obtained is the range of values over which the component is required to 
operate. If an operating point Q were chosen and deviations from this operating point 
were small then it is clear from Figure 18.1 that a linear model — corresponding to the 
tangent to the curve at point Q — would be an appropriate model. 

Obtaining a linear model is relatively straightforward. It consists of calculating the 
first-order Taylor polynomial centred around the operating point Q. This is given by 


Pi (x) = y(a) + (a(x — a) 
Then p, (x) is used as the linearized model of the component with characteristic y(x). It 
is valid provided that it is only used for values of x such that |x — a| is sufficiently small. 
As stated, p, (x) is also the equation of the tangent to the curve at point Q. The range of 
values for which the model is valid depends on the curvature of the characteristic and 
the accuracy required. 


Engineering application 18.3 


Power dissipation in a resistor 


The power dissipated in a resistor varies with the current. Derive a linear model for 
this power variation valid for an operating point of 0.5 A. The resistor has a resistance 
of 10 Q. 


Solution 

For a resistor 
Pie iEIR 

where 
P = power dissipated (W) 
I = current (A) 


R= resistance (Q2). 
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The first-order polynomial, valid around an operating point J = 0.5, is 
p, (1) = P(0.5) + P’(0.5)(I — 0.5) 

Now P(0)5) = (05) 10 = 25. P (0:5) — 20'S) (10) —_ 10, and sa 
jo UD) = 23 46 OW = Os) = WOl— 2.5) 


It is interesting to compare this linear approximation with the true curve for values 
of J around the operating point. Table 18.1 shows some typical values. Figure 18.4 
shows a graph of the power dissipated in the resistor, P, against the current flowing 
in the resistor, 7. Notice that the linear approximation is quite good when close to the 
operating point but deteriorates further away. 


Table 18.1 


A comparison of linear approximations with true values. 


I (A) True value of P (W) Approximate value of P (W) 
0.5 2 mS) 

0.499 2.49001 2.49 

0.501 2.51001 Pei 

0.49 2.401 2.4 

0.51 2.601 2.6 

0.4 1.6 RS) 

0.6 3.6 3,5) 

1.0 10 WES 

P 


Operating point 


‘ Pid] 10125 


2 
Figure 18.4 
0 We see that the tangential approximation is 
0.2 04 06 08 1 1 good when close to the operating point. 


EXERCISES 18.2 


1. Calculate the first-order Taylor polynomial generated 3 Calculate the first-order Taylor polynomial generated 
by y(x) = e* about by y(x) = cos x about 
(a) x=0 (b) x =2 (c) x= 3 (a)x =0 

2 Calculate the first-order Taylor polynomial generated (b)x = 1 
by y(x) = sin x about (c) x = —0.5 


(a) x=0 (b+) x=1 (©) x= -0.5 


4 (a) Finda linear approximation, p, (t), to h(t) = eb 
about t = 2. 


(b) Evaluate h(2.3) and pj (2.3). 


Solutions 


1 (a) py) =x4+1 
(b) py) =e (x- 1) 
(c) py) =eFr+4) 


2 (a) py(x) =x 
(b) p, (x) = 0.5403x + 0.3012 
Pp, (x) = 0.8776x — 0.0406 


3 (a) py) =1 
(b) p; (x) = —0.8415x + 1.3818 
P(x) = 0.4794x + 1.1173 


= 
a 
YS 


= 
fe) 
ae] 
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1 
Find a linear approximation, p, (t), to R(t) = ; 
about t = 0.5. 
Evaluate R(0.7) and p, (0.7). 


p(t) = 12r — 16 
h(2.3) = 12.167, p, (2.3) = 11.6 
p,(t) = —4t +4 
R(0.7) = 1.4286, p, (0.7) = 1.2 


| 18.3 SECOND-ORDER TAYLOR POLYNOMIALS 


Suppose that in addition to y(a) and y'(a), we also have a value of y’(a). With this 
information a second-order Taylor polynomial can be found, which provides a quadratic 


approximation to y(x). Let 


p(x) = Cy tex + cx" 


We require 
P2(a) = y(a) 
P,(a) = y'(a) 
P3(a) = y"(a) 


that is, the polynomial and its first two derivatives evaluated at x = a match the function 
and its first two derivatives evaluated at x = a. Hence 


p,(a) = cy) +¢,a+c,@ = y(a) 
p>(a) = c, + 2c,a = y'(a) 


p3(a) = 2c, = y"(a) 


Solving for cy, c, and c, yields 


y"(a) 
Cy => 5) 


c, = y'(a) — ay"(a) 


(18.1) 
(18.2) 
(18.3) 


from Equation (18.3) 
from Equation (18.2) 


2 
Cy = ya) — ay'(a) + sy @ from Equation (18.1) 
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Hence, 


2. 
p(x) = y(a) — ay'(a) + 53" @) 


y oe 


+ {y'(a) — ay"(a)}x + 

Finally, 
(i ” (x = a) 
[Bl C) SW) ae! ONCE = @) yy (O)-—— 


P(x) is the second-order Taylor polynomial generated by y about x = a. 


Example 18.3 Given y(1) = 0, y(1) = 1, y’(1) = —2, estimate 
(a) y(1.5) 
(b) y(2) 
(c) y(0.5) 


using the second-order Taylor polynomial. 


Solution The second-order Taylor polynomial is p, (x): 
, ” (~- LF 

PO=DFyYOE— Dey) a 

Gaip_ 
5 = 
We use p,(x) as an approximation to y(x). 


=x—-1-2 = 1= GH 1S ay 4-2 


(a) The value of y(1.5) is approximated by p, (1.5): 
y(1.5) © p, (1.5) = 0.25 

(b) The value of y(2) is approximated by p, (2): 
y(2) © p,(2) =0 

(c) The value of y(0.5) is approximated by p, (0.5): 
y(0.5) © p,(0.5) = —0.75 


Example 18.4 (a) Calculate the second-order Taylor polynomial, p,(x), generated by 
y(x) = x +27 — 6 about x = 2. 


(b) Verify that y(2) = p,(2), y'(2) = p,(2) and y"(2) = p5(2). 
(c) Compare y(2.1) and p,(2.1). 


Solution (a) We need to calculate y(2), y’(2) and y’”(2). Now 
y(x) =x +27 —6,y (x) = 3x + 2x, y"(x) = Ox +2 
and so 


y(2) = 6, y'(2) = 16, y"(2) = 14 
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The required second-order Taylor polynomial, p,(x), is thus given by 


’ ” G2)" 
Px) = y2)+ yQ)@—-2)+y (2) — 


@=2y 


= 6+ 1l6(x—-2)4+ 14 
= 6+ 16x — 32+ 707 — 4x +4) 

= 7x" — 12x +2 

(b) Using (a) we can see that 

py (x) = 7x° — 12x +2, p(x) = 14x — 12, p(x) = 14 


and so 
p,(2) = 6, p,(2) = 16, p(2) = 14 
Hence 
y(2) = p,(2), y'(2) = p(2), y"(2) = p5(2) 
(c) We have 


y(2.1) = (2.1)? + (2.1)? — 6 = 7.671 
py(2.1) = 7(2.1)? — 12(2.1) + 2 = 7.67 


Clearly there is a very close agreement between values of y(x) and p,(x) near to 
x=2. 


Engineering application 18.4 


Quadratic approximation to a diode characteristic 


In Engineering application 10.5 we derived a linear approximation to a diode char- 
acteristic suitable for small signal variations around an operating point. Sometimes 
it is not possible to use a linear approximation because the variations are too large 
to maintain sufficient accuracy. Even so, an approximate model may be desirable. 
In general, a higher order Taylor polynomial will give a more accurate model than 
that of a lower order polynomial. We will consider a quadratic model for a diode 
characteristic. The V — J characteristic of typical diode at room temperature can be 
modelled by the equation 


Ay) ie. il) 
Given an operating point, V,, the second-order Taylor polynomial is 
(V —V,)° 


PAV) = 1 EEE VY Vi) 1 (V,) 5 


Now 
TV) = 40l 6°" 7 (Vy = 16001 6" 
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3 


4 


SO 


Vay 
p,(V) = 1(e™ — 1) + 407, e%™”(V — V,) + 16007, ce” ae 
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y 


The coefficients need to be calculated only once. After that the calculation of a current 
value only involves evaluating a quadratic. 


EXERCISES 18.3 


(a) Obtain the second-order Taylor polynomial, 


P(x), generated by y(x) = 3x* + 1 about x = 2. 


(b) Verify that y(2) = p)(2), y'(2) = p4(2) and 
y"(2) = p5(2). 

(c) Evaluate p,(1.8) and y(1.8). 

(a) Calculate the second-order Taylor polynomial, 
P(x), generated by y(x) = sin x about x = 0. 


(b) Calculate the second-order Taylor polynomial, 
P2(Xx), generated by y(x) = cos x about x = 0. 


(c) Compare your results from (a) and (b) with the 


small-angle approximations given in Section 6.5. 


A function, y(x), is such that y(—1) = 3, y’(—1) =2 

and y"(—1) = —2. 

(a) State the second-order Taylor polynomial 
generated by y about x = —1. 

(b) Estimate y(—0.9). 


A function, y(x), satisfies the equation 


ysytx y(l)=2 

(a) Estimate y(1.3) using a first-order Taylor 
polynomial. 

(b) By differentiating the equation with respect to x, 
obtain an expression for y”. Hence evaluate 
y'(1). 

(c) Estimate y(1.3) using a second-order Taylor 
polynomial. 


Solutions 


(a) po(x) = 72x? — 192x + 145 
(c) po (1.8) = 32.68, y(1.8) = 32.4928 


2 


(b) py(x) =1- a 


(a) py) =x 

(@) prix) =? +4 

(b) p (—0.9) = 3.19. This is an approximation to 
y(—0.9). 


A function, x(t), satisfies the equation 


x=x+VJt4+1 x(0) =2 
(a) Estimate x(0.2) using a first-order Taylor 
polynomial. 


(b) Differentiate the equation w.r.t. t and hence 
obtain an expression for x. 


(c) Estimate x(0.2) using a second-order Taylor 
polynomial. 
A function, h(t), is defined by 
h(t) = sin 2t + cos 3t 


Obtain the second-order Taylor polynomial generated 
by A(t) about t = 0. 


The functions y, (x), yz (x) and y3 (x) are defined by 
y, (x) = Ae*, yn (x) = Bx +Cx, 
3 (x) = Vy (x) + Yo (x) 
(a) Obtain a second-order Taylor polynomial for 
y, (x) about x = 0. 
(b) Obtain a second-order Taylor polynomial for 
y (x) about x = 0. 
(c) Obtain a second-order Taylor polynomial for 
y3(x) about x = 0. 


(d) Can you draw any conclusions from your 
answers to (a), (b) and (c)? 


(a) 3.5 
(b) y" = 2yy' +1, y"(1) = 21 
(c) 4.445 
(a) 2.6 
(b) #=X4 : 
2/t+1 
(c) 2.67 
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mm p(t) =1+2t-4.517 (b) Cx 
2 


2 AAs 
7 (a) (rere) tO) Be AR te 


Technical Computing Exercises 18.3 


Many technical computing languages have the 2 (a) 
capability of producing a Taylor polynomial of a 

function. In some languages this is available as a 

default whereas in others, such as MATLAB®, this is (b) Draw y(x) and p,(x) for —2 <x <2. 
offered via a toolbox function which may need to be 
loaded or purchased separately. If the software you 
are using does not have the capability to produce the py(x), generated by y(x) = on(2) aboutsc= 3: 
Taylor series automatically you may choose to do this x 

part manually and then plot the results using the 


Calculate the second-order Taylor polynomial, 
P2(x), generated by y(x) = sinx about x = 0. 


3 (a) Calculate the second-order Taylor polynomial, 


technical computing language. (b) Draw y(x) and p5(x) forl <x <5. 
1 (a) Calculate the second-order Taylor polynomial, 4 (a) Calculate the second-order Taylor polynomial, 
P(x), generated by y(x) = x° about x = 0. P(x), generated by y(x) = e°°S* about x = 0. 
(b) Draw y(x) and p,(x) for -—2 <x < 2. (b) Draw y(x) and p,(x) for -—2 <x <2. 


TAYLOR POLYNOMIALS OF THE nTH ORDER 


If we know y and its first n derivatives evaluated at x = a, that is y(a), y'(a), 
y"(a), ..., y (a), then the nth-order Taylor polynomial, p, (x), may be written as 


2 
Oo 


_ ERS 
Py) =y(a) + y'@a—a) +y"@) = = 7 


(.—= n 
Heo oe 
n!} 


This provides an approximation to y(x). The polynomial and its first n derivatives eval- 
uated at x = a match the values of y(x) and its first n derivatives evaluated at x = a, 


that is 
DP, (a) = ya) 
pi, (a) = y (a) 


pi (a) = y"(a) 


pria=y"'@ 
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Example 18.5 Given y(0) = 1, y((0) = 1, y’(0) = 1, y9 (0) = —1, y” (0) = 2, obtain a fourth-order 
Taylor polynomial generated by y about x = 0. Estimate y(0.2). 


Solution In this example a = 0 and hence 


2 3 4 
= / "(Q)~ GB) (yy ~ 4) yy* 
Pa) = yO) +YOx+Y OT ty Oty OF 


zs <x 
=l4+x+2—-— 


2 6 12 
The Taylor polynomial can be used to estimate y(0.2): 


0.2)? (0.2)3 — (0.2)4 
p,(0.2) =1+0245 ~ ae = +02 = 1.2188 


y(0.2) © 1.2188 


Example 18.6 (a) Calculate the first-, second-, third- and fourth-order Taylor polynomials generated 
by y(x) = e* about x = 0. 


(b) Plot y = e* and the Taylor polynomials for —2 < x < 2. 


Solution (a) We have 


y@) =y &) =O) Hy" He'G) Se" 


and 


y(0) = y'(0) = yy’) =y 0) =v) = 1 


Thus the Taylor polynomials, p, (x), p,(x), p;(x) and p,(x), are given by 
P(x) = yO) +y(O)x = 14x 


x x2 

Px(x) = y(O) +y¥O)x + "OF =1l4+x+ oi 
; bin @ es 3 er . 
P= Ot ORt VO 7 + yO) = ltat + 


2 3 4 
as ' nay B)(q\* 4) (qy* 
P4(x) = yO) +y (O)x + y > +y OF +y OF 


x? ra xt 
=4 aoe diceae ed me 
are rae ay. 


(b) The graphs of y = e* and the Taylor polynomials are shown in Figure 18.5. Note 


that the Taylor polynomials become better and better approximations to e* as the 
order of the polynomial increases. 


Example 18.7 Given that y satisfies the equation 
yO +2y=x" (18.4) 
and also the conditions 


yO)=1 y0)=2 
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YX), Pi(X) 
or NW BUN A 4 C 
yx), pj (x) 
CorN WwW HM AW ~] C 


—2.0-1.5-1.0-0.5 0 0.5 1.0 1.5 2.0 —2.0-1.5-1.0-0.5 0 0.5 1.0 1.5 2.0 
x x 
(a) (b) 


Figure 18.5 
A graph of y = e* and four Taylor polynomials. 


use a third-order Taylor polynomial to estimate y(0.5). 
Solution To write down the third-order Taylor polynomial about x = 0 we require y(0), y’(0), 
y’(0) and y®) (0). From Equation (18.4), 
y"QX) =x" + fy @P — 2y@) (18.5) 
So, 
y'" (0) = 0+ {y'(0)} — 2y@) =2 
To find y® (0), Equation (18.5) is differentiated w.r.t. x: 
y x) = 2x + 2y'@)y"(x) — 2y'@) 
y (0) = 2y'(O)y" (0) — 2y'(0) = 4 
The Taylor polynomial may now be written as 
x? x 
P3(x) = yO) + y'(O)x + y"(0) zt y (0) ra 


2x3 
=1+2xtx° += 


We use p,(x) as an approximation to y(x): 
p3(0.5) = 1+ 140.25 + 0.0833 = 2.333 
that is, 


y(0.5) © 2.333 


EXERCISES 18.4 


1. A function, y(x), has y(0) = 3, y’(0) = 1, 2 A function, h(t), has h(2) = 1, h'(2) = 4, 
y’(0) = —1and y®) (0) = 2. h’ (2) = —2, AO (2) = 1 and h“ (2) = 3. 
(a) Obtain a third-order Taylor polynomial, p, (x), (a) Obtain a fourth-order Taylor polynomial, p4(t), 
generated by y(x) about x = 0. generated by h(t) about t = 2. 


(b) Estimate y(0.2). (b) Estimate h(1.8). 
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8) Given y(x) = sinx, obtain the third-, fourth- and (a) Obtain a third-order Taylor polynomial generated 
fifth-order Taylor polynomials generated by y(x) by y about x = 1. 
about x = 0. (b) Estimate y(1.3) using (a). 

4 Given y(x) = cos.x, obtain the third-, fourth- and (c) Obtain a fourth-order Taylor polynomial 
fifth-order Taylor polynomials generated by y(x) generated by y about x = 1. 
about x = 0. 


(d) Estimate y(1.3) using (c). 
5 (a) Given y(x) = sin(kx), k a constant, obtain the 


third-, fourth- and fifth-order Taylor polynomials 9 A function, y(x), satisfies the equation 
generated by y(x) about x = 0. 
(b) Write down the third-, fourth- and fifth-order y" + ¥ =x yO) = 1, y (0) =—1 
Taylor polynomials generated by y = sin(—x) 
about x = 0. (a) Estimate y(0.25) using a third-order Taylor 
. . polynomial. 

6 (a) Given y(x) = cos(kx), k a constant, obtain the 
third-, fourth- and fifth-order Taylor polynomials (b) Estimate y(0.25) using a fourth-order Taylor 
generated by y(x) about x = 0. polynomial. 


(b) Write down the third-, fourth- and fifth-order 
Taylor polynomials generated by y = cos(2x) 
about x = 0. 


10 A function, y(x), has y(1) = 3, Y(1) = 6, y’(1) = 1 
and y3) (1) = -1. 


7 If p,,(x) is the nth-order Taylor polynomial generated 
by y(x) about x = 0, show that p, (kx) is the nth-order 
Taylor polynomial generated by y(kx) about x = 0. 


(a) Estimate y(1.2) using a third-order Taylor 
polynomial. 


(b) Estimate y’ (1.2) using an appropriate 


8 The function, y(x), satisfies the equation second-order Taylor polynomial. 


yl =yt 3x2 y(1) =1,y(1) =2 [Hint: define a new variable, z, given by z = y’.] 
Solutions 
x x3 
m (a) 3+x- a Ss (b) 3.1827 5 (a) Py) = 
4 3 3,3 
t 5t 31 kx 
B® 5-5 a : Pax) = kx — Se, 
b wl BB 3 kb 5 
(b) 0.1589 bh =e - 2, x 
3 3! 5! 
3 p3(xy)=x-—, x3 
a (b) p(x) =-x+ >, 
a 3! 
P4(x) =x 31 x3 
pa) = x45, 
3 5 : 
x x 
PACE a x3 x 
‘ Ps(xX) = —x+ 31 3 
Mm p3@)-1-7, 232 
2 4 6 (a) P3 (x) =1- a? 
x ! 
Pa@=1— oF + ap x2 kd 
2 EN aa 
Ps (x) =1 oT + a e , k2x2 kAx4 
Ps\X)= ay ah 
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(b) p(x) = 1 — 22’, 5xt xe 
His case ig er ga ag 
pa) =] 1 2° 3 
3 (d) 1.8194 
4 
ps(x) =1—2x2 + 2x" 9 (a) 0.7240 (b) 0.7240 
3 
43 1 10 (a) 4.2187 (b) 6.18 
8 (a) 2x* + 2x 
3 3 
(b) 1.816 


Technical Computing Exercises 18.4 


1 
1. Draw y(x) = — for 1 
x 


the fourth-order Taylor polynomial generated by y(x) 


about x = 3. 


<x < 8. On the same axes draw 3 Draw y(x) = Inx for 0.5 < x < 10. On the same axes, 
draw the third-, fourth- and fifth-order Taylor 
polynomials generated by y(x) about x = 1. 


2 Draw y = tanx for —1 < x < 1. Using the same axes, 
draw the fifth-order Taylor polynomial generated by 


y(x) about x = 0. 


TAYLOR’S FORMULA AND THE REMAINDER TERM 


So far we have found Taylor polynomials of orders 1, 2, 3, 4 and so on. Example 18.6 
suggests that the generating function, y(x), and the Taylor polynomials are in close agree- 
ment for values of x near to the point where x = a. It is reasonable to ask: 


“How accurately do Taylor polynomials generated by y(x) at x = a approximate to 
y at values of x other than a?’ 

‘If more and more terms are used in the Taylor polynomial will this produce a better 
and better approximation to y?’ 


To answer these questions we introduce Taylor’s formula and the remainder term. 
Suppose p,,(x) is an nth-order Taylor polynomial generated by y(x) about x = a. 
Then Taylor’s formula states: 


y(x) = p, (x) + RB, (&) 


where R,, (x) is called the remainder of order n and is given by 


yaa 
(n+ 1)! 


remainder of ordern = R, (x) = 


for some number c between a and x. 


The remainder of order 7 is also called the error term. The error term in effect gives 
the difference between the function, y(x), and the Taylor polynomial generated by y(x). 
For a Taylor polynomial to be a close approximation to the generating function requires 
the error term to be small. 
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Example 18.8 Calculate the error term of order 5 due to y(x) = e* generating a Taylor polynomial 
about x = 0. 


Solution Heren =5 andson+1 = 6. In this example a = 0. We see that y = e* and so 
YQ="¥ Oss =p =y" oe! 
and so y®)(c) = e°. The remainder term of order 5, R;(x), 1s given by 
e&x® 


R5(x) = 6! 


for some number c between 0 and x 


Example 18.9 (a) Calculate the fourth-order Taylor polynomial, p,(x), generated by y(x) = sin 2x 
about x = 0. 


(b) State the fourth-order error term, R, (x). 
(c) Calculate an upper bound for this error term given |x| < 1. 
(d) Compare y(0.5) and p,(0.5). 


Solution (a) y(x) = sin 2x, y(0) =0 
y (x) =2cos 2x, y (0) =2 
y"(x) = —4 sin 2x, y"(0) =0 
y®) (x) = —8 cos 2x, y®)(0) = —8 
y (x) = 16 sin 2x, y(0) = 0 


The fourth-order Taylor polynomial, p,(x), is 


) ” x (3) * (4) a 
Pale) =O yy Oey Oye ry Oty OF 


8x3 
a a Cea 


43 
= ee 


3 


(b) We note that y© (x) = 32 cos 2x and so y) (c) = 32 cos(2c). The error term, R, (x), 
is given by 


5 ae 
R,@) = POR 


__ 32cos(2c)x° 
- 120 
4 : : 
= 15 cos(2c)x° where c is a number between 0 and x 
(c) In order to calculate an upper bound for this error term we note that | cos(2c)| < 1 
for any value of c. Hence an upper bound for R, (x) is given by 
4x0 


R < |— 
| 4 (x)| 15 


1 
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5 
4 
We know |x| < 1, and so = is never greater than 1s Hence an upper bound for 


4 
the error term is 15° If we use p,(x) to approximate y = sin 2x the error will be no 


4 
greater than 5 provided |x| < 1. 
(d) We let x = 0.5. 
y(0.5) = sin 1 = 0.8415 


p4(0.5) = 2(0.5) — $(0.5) = 0.8333 


The difference between y(0.5) and p,(0.5) can never be greater than an upper bound 
of the error term evaluated at x = 0.5. This is verified numerically. 


y(0.5) — p,(0.5) = 0.8415 — 0.8333 = 0.0082 


and 
a 5 
IR,(0.5)| < 73(0.5)° = 0.0083 
EXERCISES 18.5 

The function, y(x), is given by y(x) = sinx. 4 (a) Find the third-order Taylor polynomial generated 
(a) Calculate the fifth-order Taylor polynomial by A(t) = ui about t = 2. 

generated by y about x = 0. f 
(b) Find an expression for the remainder term of iby Site ie eer erat 

order 5. (c) Find an upper bound for the error term given 
(c) State an upper bound for your expression in (b). I<t<4. 
Repeat Question 1 with y(x) = cos x. 5 The function y(x) = x° + x° is approximated by a 

: third-order Taylor polynomial about x = 1. 
. — ’ 
The function y(x) = e* may be approximated by the (a) Find an expression for the third-order error term. 
‘ ‘ x ‘ 
quadratic expression | + x + 2 Find an upper bound (b) Find an upper bound for the error term given 
for the error term given |x| < 0.5. O<x<2. 
Solutions 
ee x x4 
(a) x 31 =F 51 2 (a) 1 a =F n 
(sin c)x® (cos c)x® 

(b) ones Gas (b) a a 

where c lies between 0 and x where c lies between 0 and x 

x8 x® 

(c) 61 (c) 61 
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3) 3.435 x 10-2 (c) 0.5 
B 72 3¢ 5 (a) 5c(1 + 3c)(x — 1) for c between 1 and x 
B® -44+b- ys 
16 2 2 (b) 70 
¢—2)* . 
(b) 5 where c lies between 2 and t 


| 18.6 | TAYLOR AND MACLAURIN SERIES 


We have seen that y and its first n derivatives evaluated at x = a match p, (x) and its 
first n derivatives evaluated at x = a. We have studied the difference between y(x) and 
p, (x), that is the error term, R, (x). 

As more and more terms are included in the Taylor polynomial, we obtain an infinite 
series, called a Taylor series. We denote this infinite series by p(x). 


Taylor series: 


(x— a) 
2! 


2 
+y(a) 


_ aint 
p(x) = y(a) +y'(a)@—a) + y"(a) re 


(x — a)" 
n! 


See (a) ae 


For some Taylor series, the value of the series equals the value of the generating func- 
tion for every value of x. For example, the Taylor series for e*, sinx and cos x equal the 
values of e*, sin x and cos x for every value of x. However, some functions have a Taylor 
series which equals the function only for a limited range of x values. Example 18.14 
gives a case of a function which equals its Taylor series only when —1 < x < 1. 

To determine whether a Taylor series, p(x), is equal to its generating function, y(x), 
we need to examine the error term of order n, that is R, (x). We examine this error term 
as more and more terms are included in the Taylor polynomial, that is as n tends to 
infinity. If this error term approaches 0 as n increases, then the Taylor series equates to 
the generating function, y(x). Sometimes the error term approaches 0 as n increases for 
all values of x, sometimes it approaches 0 only when x lies in some specified interval, 
say, for example, (—1, 1). Hence we have: 


If R, (x) — 0 as n — ov for all values of x, then the Taylor series, p(x), and the 
generating function, y(x), are equal for all values of x. 

If R,, (x) — 0asn — oo for values of x in the interval (a, 6), then the Taylor 
series, p(x), and the generating function, y(x), are equal for x values on the interval 
(a, B). For values of x outside the interval (a, 6), the values of p(x) and y(x) are 
different. 


By examining the error terms associated with y = e*, y = sinx and y = cosx it is 
possible to show that these errors all approach 0 as n — oo for all values of x. Hence 
the functions y = e*, y = sinx and y = cos are all equal to their corresponding Taylor 
series for all values of x. 
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A special, commonly used, case of a Taylor series occurs when a = 0. This is known 
as the Maclaurin series. 


Maclaurin series: 


oa 
p(x) = y(0) + y/(O)x + yor = ee oO -+y” OF 


Example 18.10 Determine the Maclaurin series for y = e*. 


Solution In this example y(x) = e* and clearly y’(x) = e* too. Similarly, 
¥'@) =yP@) = =yP@) se 


for all values of n. Evaluating at x = 0 yields 


yO) =y'0) =y'@ =---=yPO=1 
and so 
x2 x x” 
PQalere ar 
3! n! 
As mentioned earlier, the series and the generating function are equal for all values of x. 
Hence, 
Pe 
e'=1+x + Fy 1+ 3y 31 alee 
for all values of x, that is 
: an = x" 
© La nl 


Example 18.11 Obtain the Maclaurin series for y(x) = sinx. 


Solution y(x) = sinx, y(0) = 
y' (x) = cos x, y(0) =1 
y"(x) = —sinx, y'O)= 
y® (x) = —cosx, yO) ==1 
y (x) = sinx, y® (0) =0 


0 3) (Q)x3 
y" (O)x* 4 (O)x 


p(x) = yO) +y'(O)x + 7 31 


3). 7! 


_ OT (= Lie! 
(2i+ 1)! 
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Since the generating function and series are equal for all values of x we have 
( 1)'x 2i+1 3 5 7 


; x x x 
snsm DG ae Rt at 


Example 18.12 Obtain the Maclaurin series for y(x) = cos x. 


Solution y(x) = cos x, y(0)=1 
y (x) = — sinx, y(0) = 
y" (x) =—cosx, y’0)=- 
y®) (x) = sinx, y)(0) =0 
y (x) = cos x, yO) =1 


and so on. Therefore 


PQ) = 0) +¥Ox+y" OS - OX, ce or 


From Examples 18.10, 18.11 and 18.12 we note three important Maclaurin series: 


4 


e=l+x+ — aoe +. for all values of x 

: . xed 

UE oe a a for all values of x 
exw x 

CS ng = a for all values of x 


Example 18.13 Find the Maclaurin series for the following functions: 
(a)y=e® (b)y=sin3x  (c)y=cos (5) 


Solution We use the previously stated series. 


(a) We note that 


2 3 4 


&=14+245 a ae 


18.6 Taylor and Maclaurin series 


Substituting z = 2x we obtain 


2 3 4 
=14%4 (2x) (2x) (2x) 


2! 3! 4! 
4 i) 2 A 
=i) 04072" 42> & 
3 3 
(b) We note that 
7 a ee 
ame E— aT st aI 


By putting z = 3x we obtain 
(3x)° | Gx)? Gx)’ 


sin 3x = 3x 
3! 5! 7! 
9 81 243 
—3 3 5 a 
*— 9% + 79" — 5607 T 
(c) We note that 
2 A 76 
i ae 
Putting z = , we obtain 
x ee G@/j2)* _ (/2)° 
cos (5) = 1 aT 61 
2 x x6 


8 7 384 46080 = 


Example 18.14 Determine the Maclaurin series for y(x) = Ea 
x 


Solution The value of y and its derivatives at x = 0 are found. 


ya) =, y) =1 
I@oO= aw y@)=- 
y'(x) = oe y’(0) =2! 
y"(a) = qa y"(0) =- 
yO") = a yO) = (-1)"2! 


Hence using the formula for the Maclaurin series we find 


x2 x n 


p(x) = 1-12) +215 - 31 + iy 
3! n! 


=1l-x4+7—-P4x4—---4(-1)"4+--- 
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: . 1 
It can be shown that this series converges to i for |x| < 1. Hence, 
x 


1 a 
i+% 


x+x Hees lH)" for |x| < 1 
0 


1 
For values of x outside (—1, 1) the values of T+ and 2 (—1)"x" are simply not equal; 
x 


try evaluating the l.h.s. and r.h.s. with, say, x = —2. 


Example 18.15 Find the Taylor series for y(x) = e~ about x = 1. 


Solution y=e™, yd) =e" 
y =-e%, y'(1) =-e7! 
y" =e, yd) =e"! 


and so on. Hence, 
1-1? _,@-1) 
nny 
x— 1)? -—1) 
@-1? @ ee 
2! 3! 


e*=e!-(e'!)@—I)t+e 


=efi- w+ 


e* =e! S°(-1)" (x _s 
, ! 


Example 18.16 Find the Maclaurin series for y(x) = x cos x. 


Solution The Maclaurin series, p(x), for cos x is 


 £ 
cosx= p@)=1- T+ a @ 
So the Maclaurin series for x cos x is xp(x), that is 
Sk 
XCOSX = xp(x) =x — aT = Ar 


An alternative form of Taylor series is often used in numerical analysis. We know 
that the Taylor series for y(x) generated about x = a is given by 


—a)y ae) 
y(x) = y(a) + y' (a) (x — a) + Or + ql die ig 
Replacing a by x) we obtain 
j ‘ (x — Xy)? @) (x — xX)? 
y(x) = YX) + Y' Hq) (K — XH) + Co oer eal +y rT cee 


If we now let x — x) = A, we see that 


/ ” h? hs 
YO +A) = VO) FY Hoh + y" Gp) 5, + ¥%) 3, foe 


To interpret this form of Taylor series we refer to Figure 18.6. 
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y@) 
Figure 18.6 
The value y(xp + /) can be estimated using values of y 
% Xoth - and its derivatives at x = Xp. 


If y and its derivatives are known when x = xp, then the Taylor series can be used to 
find y at anearby point, where x = x) +h. This form of Taylor series is used in numerical 
methods of solving differential equations. 


Engineering application 18.5 


Electric field of an electrostatic dipole 


Engineers often need to calculate the electrical field due to several stationary electric 
charges. These are known as electrostatic problems. One of the simplest electrostatic 
configurations is that of two charges of opposite polarity separated by a distance, d. 
This arrangement is known as an electrostatic dipole. It is illustrated in Figure 18.7 
for charges +Q and —Q. 

The origin of the x axis is located midway between the two charges so that the 
charge —Q has coordinate —d/2 and the charge +Q has coordinate d/2. 


d Figure 18.7 


—O——__o——— x Electrostatic dipole with two point charges of —Q 
and +Q coulombs respectively. 


We wish to calculate the combined electric field of the two charges as a function 
of the distance along the x axis. 
The electric field, E, of a single charge is given by 


__4 
Ane r? 


where q is the charge in coulombs, €) is the permittivity of free space (a constant 
of approximately 8.85 x 10~!? Fm’), and r is the distance from the charge to the 
measuring point. 

The electric field at point x due to the left-hand charge is 


=u 


dw 
4 i 
TE (x+ >) 


and for the right-hand charge the electric field at point x is 
0 


Erigut = rae 
ATE, (« = 5) 


Evgrr = 
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The total electric field, E,, can be obtained by adding the two contributions together 


=Q Q 
Ey = Evppy + Egicnr = 


= ae dw 
(<9) (9 
Do 2 


It is possible to gain further insight into the properties of the electrostatic dipole by 


carrying out a power series expansion. To prepare the equation we take a factor of 
1/x* outside of the brackets 


1 1 
Q = 


eee dy ay 
FIPS! | ie ee ieee 
fo% ( =) ( +5) 


Now consider the Maclaurin series of 


aa Following the process explained in 
—a 
the previous examples, the first five terms are calculated: 


1 
HO) Gea” y@) =1 
D) 
y(@)= ane y (0) = 2! 
3! 
y'(a) = =e y’(0) = 3! 
A! 
5! 
ya) = Geo. y (0) = 5! 
1 az a ae 
——— ! ! ! ! sien es 2 3 
H GigE eet ae Pp a Bietas is 
DOE sp 602, 
Following a similar process for the Maclaurin series of 1/(1 + a)”, 
1 
HO) Sean y(0) =1 
—2! 
y(a) = esa y (0) = —2! 
3! 
¥@Qe Gta)” iV @) = 3h 
—4! 
me Oe 
5! 
y (a) = ailmenye? y (0) =5! 


oe 


ee Ae a eye 

= —2!)a {!— + (—4!)— = ap oes 

(Caren 2! 3! A! 
=1-2a+ 3a? —4a3+5a4+--:. 


These two expansions may be used to approximate the total electric field by sub- 
stituting a = d/(2x). 


we obtain 
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The total electric field is thus 
E, = eeu [C+ 2a + 3a? + 40° + 5a* + ---) 
Arte yx 
— (1 — 2a + 30? — 4a? + 5a*+---)] 
Q 3 
if, = — Idi ee 
“ Tage ae, 
Providing x is much larger than d, then d/(2x), that is @ is small, and the higher 
order terms in the series become increasingly small. Thus we can approximate the 
field using only the first term, 
Q Oe Od 


Ate yx? big a 


 Amejx? 2x 2re 533 
It can now be seen that the total electric field of the dipole decays as x~? rather than 
x~*, as would be the case for a single point charge. 


E, = 


EXERCISES 18.6 


Use the Maclaurin series for sin x to write down the 5 (a) Find the Maclaurin series for 
Maclaurin series for sin 5x. y(x) = x? + sinx. 


Use the Maclaurin series for cos x to write down the 
Maclaurin series for cos 3x. 


Use the Maclaurin series for e* to write down the 


: . 1 
Maclaurin series for a 
e 


Find the Taylor series for y(x) = ./x about x = 1. 


(b) Deduce the Maclaurin series for 
y(x) = x" + sinx for any positive integer n. 


6 (a) Obtain the Maclaurin series for y(x) = xe*. 


(b) State the range of values of x for which y(x) 
equals its Maclaurin series. 


7 Find the Taylor series for y(x) = x + e* about x = 1. 


8 Find the Maclaurin series for yx) = In(1 + x). 


Solutions 
5x)3 5x)> ao Sat 
x Sr @ P4x-F45-7 
2 4 6 3 4 5 
1 (3x) (3x) (3x) — Mo x4 24 x x x 
2! 4! 6! 2° 3!) (4! 
1 x x x4 (b) For all values of x 
Xx t 
2 3! ‘ BS 2 = 48x 
x-1 (x — 1) (x — 1) 
I+ 7 46 a G1? Gal? Gai), 
Sq 1)" . - 30 ap 
3 3 5 7 8 x = x4 12 
- Xx 
(a) x+x2 ats ae a re ae 
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REVIEW EXERCISES 18 


1 A function, f(t), and its first and second derivatives (a) Calculate y’(1), y’(1) and y Sa). 
are evaluated at ¢ = 3. Find the second-order Taylor (b) State the third-order Taylor polynomial generated 
polynomial generated by f about t = 3. by y(x) about x = 1. 
f@G)=2 f'B) =f f"@) =] (c) Estimate y(1.25). 


2 A function, s(t), and its first derivative are evaluated 8 (a) Find the third-order Taylor polynomial generated 


att = 3; s(3) = 4, s/(3) =—1. by y(x) = e* about x = 1. 
(b) State the third-order error term. 


(a) Find the first-order Taylor polynomial generated 


by satt = 3. (c) Find an upper bound for the error term given 
(b) Estimate s(2.9) and s(3.2). Ix] <1. 
(c) If additionally we know s”(3) = 1, use a 9 (a) Find a quadratic approximation to y(x) = sin? x 
second-order Taylor polynomial to estimate about x = 0. 
s(2.9) and (3.2). (b) State the remainder term of order 2. 
3 Obtain a linear approximation to (c) State an upper bound for the remainder term 
yt) = at* + bt? + ct? + dt +e around t = 1. given |x| < 0.5. 
4 Find a quadratic approximation to y = x near x = 2. 10 (a) Find a cubic approximation to y(x) = xcosx 
about x = 0. 


5 Find li imations t 
a cals (b) State the error term of order 3. 


ast = 
a Sl (c) State an upper bound for the error term given 


(b) w(t) = sin3¢ neart = 1 |x| < 0.25. 
=. At 4 = 
(c) v(t) =e! + sin 3¢ neart = 1 11 Given that y(x) = x2, 


The fi ti tisfies th ti 
g GON, yx) salieles the equasien (a) Calculate the Taylor series of y(x) about x = a. 


y +xy —3y=xr 41 (b) Calculate the Maclaurin series of y(x). 
— / —_— 
HOY 1 ¥O)=2 12 Find the Maclaurin expansion of y(x) = In(2 + x) up 
(a) Evaluate y’(0). to and including the term in x*. 
(b) Differentiate the equation. 13 By considering the Maclaurin expansions of sin(kx) 
(c) Evaluate y (0). and cos(kx), k constant, evaluate if possible 
(d) Write down a cubic approximation for y(x). (a) Ii sin(kx) 
a) lim 
(e) Estimate y(0.5). x30 =X 
. . . cos(kx) — 1 
7 Given that y satisfies the equation (b) lim ————_ 
x0 x 
dy y sin(kx) 
dx i 2 sd ae (c) 0 1 — cos(kx) 
Solutions 
2 
1 ona et 5 (a) et (b) —2.97¢+3.11 


(c) —0.25¢ + 3.11 
6 (a) 4 (b) y” +xy" — 2y! = 2x 


2 
(c) 4 (d) 1+ 2x 4+ 2x2 + ul 
2 


2 (a) -t+7  (b) 4.1,3.8 — (c) 4.105, 3.82 


3) (44a+3b4+2c+d)t —3a—2b-—ct+e 


7 


x? 2 10 
(a) 0,2, —2 (b) 3 + 2x° — 3x+ 3 
(c) 2.0573 
e 5x 8 

me i Te +} 
(b) aie where c lies between | and x 

2e , 
(c) a 
(a) x 

2 sin(2c)x3 


for c between 0 and x 


(b) — 
(c) 0.07 


10 


11 


12 


13 


Review exercises 18 


3 


(a) x— oT 
x4 

(b) (4sinc + ccosc) a for c between 0 and x 
(c) 2x 1074 
(a) x2 (b) x? 
in? x x a xe x4 2 
n2- Sat 

2 8 24 64 
(a) k (b) 0 (c) not defined 
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INTRODUCTION 


The solution of problems concerning the motion of objects, the flow of charged particles, 
heat transport, etc., often involves discussion of relations of the form 
are ae ee Of bp Gnd 
bales —+2x= or — = sin 
dr? dt do 
; . . dx , 
In the first equation, x might represent distance. For this case aT is the rate of change 
2 
: : . : x : 
of distance with respect to time f, that is speed, and a represents acceleration. In the 
t 


second equation, g might be charge and + the rate of flow of charge, that is current. 


These are examples of differential equations, so called because they are equations in- 
volving the derivatives of various quantities. Such equations arise out of situations in 
which change is occurring. To solve such a differential equation means to find the func- 
tion x(t) or g(t) when we are given the differential equation. Solutions to these equations 
may be analytical in that we can write down an answer in terms of common elementary 
functions such as e’, sint and so on. Alternatively, the problem may be so difficult that 
only numerical methods are available, which produce approximate solutions. 
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In engineering, differential equations are most commonly used to model dynamic 
systems. These are systems which change with time. Examples include an electronic 
circuit with time-dependent currents and voltages, a chemical production line in which 
pressures, tank levels, flow rates, etc., vary with time, and a semiconductor device in 
which hole and electron densities change with time. 


BASIC DEFINITIONS 


In order to solve a differential equation it is important to identify certain features. Recall 
from Chapter 2 that in a function such as y = x? +3x we say x is the independent variable 
and y is the dependent variable since the value of y depends upon the choice we have 
made for x. 
In a differential equation such as 
dy 
— — 2y = 3x 
Ae y 
x is the independent variable, and y is the dependent variable. 
Similarly, for the differential equation 


dx 
—+7x=e 
dt =ae 
t is the independent variable and x is the dependent variable. 
We see that the variable being differentiated is the dependent variable. 
Before classifying differential equations, we will derive one. 


Engineering application 19.1 


An RC charging circuit 


Consider the RC circuit of Figure 19.1. Suppose we wish to derive a differential 
equation which models the circuit so that we can determine the voltage across the 
capacitor at any time, t. Clearly there are two different cases corresponding to the 
switch being open and the switch being closed. We will concentrate on the latter and 
for convenience assume that the switch is closed at t = 0. From Kirchhoff’s voltage 
law we have 


Ug = Up + Ue that is Up = Vs — Ue 
where vg is the voltage of the supply, uv, is the voltage across the capacitor, and vp is 
the voltage across the resistor. Using Ohm’s law for the resistor then gives 
Us — Uc 
R 
where / is the current flowing in the circuit after the switch is closed. For the capacitor 


i= 


dig _ Us — Ue 
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19.2.1 


Example 19.1 


Solution 


19.2.2 


Figure 19.1 
An RC charging circuit. 
and hence 
du 
RC + Ue = Us 


This is the differential equation which models the variation in voltage across the ca- 
pacitor with time. Here vu, is the dependent variable and ¢ is the independent variable, 
and when we are required to solve this differential equation we must attempt to find 
Uc as a function of f. 


Differential equations which have features in common are often grouped together and 
given certain classifications and it is usually the case that appropriate methods of solution 
depend upon the classifications. Some important terminology is now given. 


Order 


The order of a differential equation is the order of its highest derivative. 


State the order of 
dy dy | 


(a) 4 b) = an)? 
Fd. ay x ( 7 


a 
(a) The highest derivative is al a second derivative. The order is therefore two. 


dx 
(b) The only derivative appearing is DT’ a first derivative. The order is therefore one. 


Linearity 


d 
Recall that in a differential equation such as * + 3y = x’, the independent variable is 
x and the dependent variable is y. 
A differential equation is said to be linear if: 
(1) the dependent variable and its derivatives occur to the first power only, 
(2) there are no products involving the dependent variable with its derivatives, and 
(3) there are no non-linear functions of the dependent variable such as sine, exponen- 
tial, etc. 
If an equation is not linear, then it is said to be non-linear. Note from (2) that a product 
d 
of terms involving the dependent variable such as i is non-linear. Note from (3) that 


the existence of terms such as y’, sin y and e” causes an equation to be non-linear. 


Example 19.2 


Solution 


19.2.3 


Example 19.3 


Solution 
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Note also that the conditions for linearity are conditions on the dependent variable. 
The linearity of a differential equation is not determined or affected by the presence of 
non-linear terms involving the independent variable. 

The distinction between a linear and a non-linear differential equation is important 
because the methods of solution depend upon whether an equation is linear or non- 
linear. Furthermore, it is usually the case that a linear differential equation is easier to 
solve. 


Decide whether or not the following equations are linear: 
(a) sins + 
a) sinx— 4 
dx - 


6) dee? 
a x= 
dt 

dy 
dx2 


Gi ape 
—— sny = 
ae y 


() 5 +y =0 


In (a), (c) and (d) the dependent variable is y, and the independent variable is x. In (b) 
the dependent variable is x and the independent variable is f. 
(a) This equation is linear. 


(b) This equation is linear. It does not matter that the term in ft, the independent variable, 
is raised to the power 3. 


(c) This equation is non-linear, the non-linearity arising through the term y’. 


(d) This equation is non-linear, the non-linearity arising through the term sin y. 


The solution of a differential equation 


The solution of a differential equation is a relationship between the dependent and in- 
dependent variables such that the differential equation is satisfied for all values of the 
independent variable over a specified domain. 


Verify that y = e” is a solution of the differential equation 


dy _ 
as y 

dy ; dy 7 
If y = e* then — = e’. For all values of x, we see that — = y and soy = e* isa 


solution. Note also that this equation is first order and linear. 


There are frequently many different functions which satisfy a differential equation; that 
is, there are many solutions. The general solution embraces all of these and all possible 
solutions can be obtained from it. 
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: dy 
Example 19.4 Verify that y = Ce" is a solution of a where C is any constant. 


Solution 


Example 19.5 


Solution 


d d 
If y = Ce’, then — Ce*. Therefore, for all values of x, ;. = y and the equation 


is satisfied for any constant C; C is called an arbitrary constant and by varying it, all 
possible solutions can be obtained. For example, by choosing C to be 1, we obtain the 


d 
solution of the previous example. In fact, y = Ce’ is the general solution of ;. =y. 


More generally, to determine C we require more information given in the form of a 
condition. For example, if we are told that, at x = 0, y = 4 then from y = Ce* we have 


4=C&=C 


so that C = 4. Therefore y = 4e* is the solution of the differential equation which 
additionally satisfies the condition y(0) = 4. This is called a particular solution. In 
general, application of conditions to the general solution yields the particular solution. 
To obtain a particular solution, the number of given independent conditions must be the 
same as the number of constants. 

Consider the following example. 


Consider the second-order differential equation 


dy 
aa t?=° 


The general solution of this equation can be shown to be 
y =Acosx+ Bsinx 


where A and B are arbitrary constants. 
Find the particular solution which satisfies the conditions 


(a) when x = 0, then y = 0, and 


37 
(b) when x = then y = 1. 


We note that because the general solution has two arbitrary constants, A and B, then two 
conditions are necessary to obtain a particular solution. Applying the first condition to 
the general solution gives 


0=Acos0+ BsinO 
=A 


Therefore A = 0, and the solution reduces to y = B sin x. Applying the second condition 
we find 


_ 3n 
1 = Bsin — 
2 


Example 19.6 


Solution 


EXERCISES 19.2 


d 
Verify that y = 3 sin 2x is a solution of 2 + 4y = 0. 


Verify that 3 e*, Ax e* 
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from which 
1 = B(-1) 
B=-1 


The particular solution becomes y = — sin x. 
Sometimes the conditions involve derivatives. 


For the differential equation of Example 19.5 find the particular solution which satisfies 
the conditions 


(a) when x = 0, then y = 0, and 
d 

(b) when x = 0, then > =5. 
dx 


Application of the first condition to the general solution y = A cos x + Bsinx gives 
0 =Acos0+ BsinO 
=A 


Therefore A = 0 and the solution becomes y = Bsin x. To apply the second condition 
we must differentiate y: 


es Bcosx 
dx 
Then applying the second condition we get 
5 = Bcos0 
=B 
so that B = 5. Finally the required particular solution is y = 5 sin x. 


In this example both conditions have been specified at x = 0, and are often referred 
to as initial conditions. 


4 Verify that y = Acosx + Bsinx satisfies the 


differential equation 


, Axe“ + Be*, where A, B are 


2 
constants, all satisfy the differential equation 3 +y=0 
@?y dy ; 
G2 ae +y=0 Verify also that y = A cosx and y = Bsinx each 
individually satisfy the equation. 
: = 72) . : d 
Verify that x = t* + Alnt + Bisa solution of 5 If y = Ae is the general solution of ;. = 2y, find 
ae dy the particular solution satisfying y(0) = 3. What is 
Te) + a a the particular solution satisfying ~ = 2 when x = 0? 
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Identify the dependent and independent variables of 
the following differential equations. Give the order of 
the equations and state which are linear. 


Show that x(t) = 7 cos 3t — 2 sin 2r is a solution of 


dx : 
— + 2x = —49cos 3t + 4sin 2t 
dr? 

The general solution of the equation 

a aren 

— —3— + 2x = Ois given 

dr? dt - J 
x=Ae + Be” 

Solutions 


5 yx) = 3e™, v(x) = e* 
(a) yis the dependent variable; x is the independent 
variable; first order, linear 


(b) y is the dependent variable; x is the independent 
variable; second order, non-linear 


10 


Find the particular solution which satisfies x = 3 and 


— =5whent = 0. 
at when 


Th faction ot © 20 a yi 
e general solution of —> — 2— = Ois 
g ae a 

y = Axe* + Be". Find the particular solution 


d 
satisfying y(0) = 0, 2 (0) = 1. 
dx 
. dx 
The general solution of az = Tw 


x =Ae™ + Be, where j* = —1. Verify that this 
is indeed a solution. What is the particular solution 


2x is 


dx 
satisfying x(0) = 0, a (0) = 1? Express the general 


solution and the particular solution in terms of 
trigonometric functions. 


(c) xis the dependent variable; t is the independent 
variable; third order, non-linear. 

x=el +207 

x 


y=xe 


particular solution: sin wt/w; general solution: 
(A + B) cos at + (A — B)j sin wt 


F 19.3 | FIRST-ORDER EQUATIONS: SIMPLE EQUATIONS 
AND SEPARATION OF VARIABLES 


19.3.1 Simple equations 


The simplest first-order equations to deal with are those of the form 


o = fo) 


where the r.h.s. is a function of the independent variable only. No special treatment is 
necessary and direct integration yields y as a function of x, that is 


y= f foyer 


Example 19.7 


Solution 


19.3.2 
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d 
Find the general solution of ;, = 3cos 2x. 


d 
Given that ac 3.cos 2x, then y = f 3cos2xdx = s sin 2x + C. This is the required 
dx 


general solution. 


i = ie) theny = f fo) de 


Separation of variables 


When the function f on the r.h.s. of the equation depends upon both independent and 
dependent variables the approach of Section 19.3.1 is not possible. However, first-order 
equations which can be written in the form 


dy _ 19.1 
a = f(x)g(y) (19.1) 


form an important class known as separable equations. For example, 


dy 
dx 


=3’e% 
is a separable equation for which 
f@=32 and = gy) =e” 


To obtain a solution we first divide both sides of Equation (19.1) by g(y) to give 


g(y) dx 


Integrating both sides with respect to x yields 


i 1 is / 1 d fr ) dx 
— — = — — x 
gy) dx ZO) > 


The equation is then said to be separated. If the last two integrals can be found, we obtain 
a relationship between y and x, although it is not always possible to write y explicitly in 
terms of x as the following examples will show. 


Separation of variables: 


d 
The solution of = = f(x)g(y) is found from 


il 
—= aly = dx 
ao” [re 
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Example 19.8 


Solution 


Example 19.9 


Solution 


Solve dy = a 
dx 


1 
Here f(x) = e™ and g(y) = —. Multiplication through by y yields 
y 


dy = 

— +-e- 
ax 

Integration of both sides with respect to x gives 


rus [eva 


so that 


2 
v = 
I me C 
5 ev + 


Note that the constants arising from the two integrals have been combined to give a 
single constant, C. Finally we can rearrange this expression to give y in terms of x: 


y =—2e*+2C 


that is, 
y=t+VJVD—2e* 
where D = 2C. 


It is important to stress that the constant of integration must be inserted at the stage at 
which the integration is actually carried out, and not simply added to the answer at the 
end. 


d 
Solve ;, = 3x’ e” subject to y(0) = 1. 


Here g(y) = e~’ and f(x) = 3x’. Separating the variables and integrating we find 


feos [ara 


so that e” = x3 + C. Imposing the initial condition y(0) = 1 we find 
e' = (0) +C 

so that C = e. Therefore, 
e&=xr+e 


Note that since the exponential function is always positive, the solution will be valid 
only for x* + e > 0. Taking natural logarithms gives the particular solution explicitly: 


y = In(v +e) 
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Example 19.10 Solve 


Solution 


Example 19.11 


Solution 


oe ai 
dt e+ 


Separating the variables and integrating we find 


[erie [esis 


Therefore, 
a a 
Sb eS EEG 
aa 


which is the general solution. Here we note that x has not been obtained explicitly in 
terms of t, although we have found a relationship between x and ¢ which satisfies the 
differential equation. To obtain the value of x at any given ¢ it would be necessary to 
solve the cubic equation. 


Sometimes, equations which are not immediately separable can be reduced to separable 
form by an appropriate substitution as the following example shows. 


By means of the substitution z = se solve the equation 
xe 


dy yoy 

S=i+t+4+1 19.2 

dx x? - x = ape) 

Ifz= ss then y = zx. Because the solution, y, is a function of x the variable z depends 
x 


d d 
upon x also. The product rule gives ;. = oe so that Equation (19.2) becomes 


hae =¢+z+1 
= 
that is, 
dz 2 
ony 1 
a, z+ 


This new equation has independent variable x and dependent variable z, and is separable. 
We find 


| dz -/[< 
2+1 J x 


so that tan~! z = In |x| + C. Writing C = In |D| we have 


tan”! z = In |x| + In|D| = In|Dx| 


so that z = tan(In|Dx|). Returning to the original variables we see that the general 
solution is 


y = zx = xtan(In |Dx|) 
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Engineering application 19.2 


RL circuit with step input 


Write down the differential equation governing the current, i, flowing in the RL circuit 
shown in Figure 19.2 when a step voltage of magnitude E is applied to the circuit at 
t = 0. Solve this differential equation to obtain i(t). Assume that when t = 0, i = 0. 


Solution 
Applying Kirchhoff’s voltage law and Ohm’s law to the circuit we find 
di 
Re =e fore yO 
dt 
that is, 


i 


EIR 
Closed 0.63E/R 
at t=0 
T t 
Figure 19.2 Figure 19.3 
A step voltage is applied to the circuit at Response of the circuit of Figure 19.2 to 
/ =O. a step input. 


Note in particular that in this equation L, E and R are constants and so the variables i 
and t have been separated. If the applied voltage, E, varied with time this would not 
have been the case since the I.h.s. would contain terms dependent upon tf. Integrating, 
we find 
iL, . Lb —-R i, ; 
Hl Cy — il — di = ——In(E —iR) =t+C 
E-iR R E-iR R 

To find the constant of integration, C, a condition is required. The physical condition 
i = 0 at t = 0 provides this. Applying i = 0 when t = 0 we find 


It 
Ca =f lhnvs 
R 
Substituting this value gives 
I . IL 
——In(E —iR) =t——InE 
R R 
Thus, 


STs In(E —iR)] = 
ale —In(E —iR)| =t 


1 


EXERCISES 19.3 


19.3 First-order equations: simple equations and separation of variables 


so that 
ib E 
—In — It 
R E-iR 
Then 
E Rt 
In = 
E-iR I, 
hence 
E 
: = ekt/L 
E-—iR 


Rearranging to obtain E gives 

E = (E —iR)e/+ 
hence 

B= Fe Rent! 
and so 

jhe — Bie) 
so 

UK Elen) 
Finally we have 


ae fe 
i= Al =e i) 
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The graph of this current against time is shown in Figure 19.2. We note that ast — oo, 


E 
i — —. The rate at which the current increases towards its final value depends upon 


the values of the components R and L. It is common to define a time constant, tT, for 


cake L 
the circuit. In this case t = — and the equation for the current can be written as 


i= =(1 —e/") 


The smaller the value of t, the more rapidly the current reaches its final value. It is 
possible to estimate a value of t from a laboratory test curve by noting that after one 


time constant, that is t = t, i has reached 1 — e~! © 0.63 of its final value. 


Find the general solution of the following equations: 


(a) 


(c) 


dy 
dx 


dy 
dx 


(b) 


dx 
—=5 
dt 
d 
Pe sey 
dt 


(e) 


(g) 


(@) 


(f) 


(h) 


) 
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dy 6sinx dx  9cos4t dy 
(k) ad — ; (1) rr = = ee (b) > = —ky, k constant 
dx  3cos2r +8 sin 4t (c) oe y 
(m) = dx 
dt x2 +x dy 
: : ; : (d) y— = sinx 
2. Find the particular solution of the following dx 
equations: dy _ 2 
rs i) tt 
a) — =3t, x(0) = 1 d 
@ & ” () P= 27 + yx 
“) dx 
dy 6x 4 
(b) — = —, yO) =1 dx f 
dx a (g) de =35 
dy — 3sint 0) =2 : 2 
(©) dt y ? Os 5 Find the general solutions of the following equations: 
dy e* dx dy x 
qd) —~=—,  y(0)=3 — =nxt b) === 
dx y Oa ~ Mary 
dx 4sint+6cos2t dx 
(e) = , x(0) =2 (c) t— =tanx 
dt x dt 
dx 2 
3 Find the general solution of — = Int. Find the (d) = = l 
t t 


t 
particular solution satisfying x(1) = 1. 


: : : . 6 Find the general solution of the equation 
4 Find the general solutions of the following equations: 


— = f(x — 2). Find the particular solution which 


d dt 
(a) . = kx, k constant satisfies x(0) — 5. 
Solutions 
1 (a) y=3x4+ec (b) x =S5t+e 3) tin|t} -—t+c,tIn|t|_-t+2 
(ce) y=x? +e (d) y=30 +c ke 
(ce) y= =e +e (f) x= "—tt+c 
3 4 (b)Ae™ 
= es eee : 
- a @ 5 4 CL nwe 
2 3 —2x 
(i) Sad¢te (i) > =04 (d) y? = 2(c — cos x) 
2 go 3g (e)y2 =x +4x+ce 
(k) + =c-—6cosx gd) ~—=-—sin4t+c x 
: _ 2 © 77a 
(m) aes > nde Buel? 7 a 
m = -sin cos t 
a 7° 3 . wie 
‘ (g) 675 +e 
Mm (a) x= 2? +1 b) yr =4e 41 , 
(a) 2 ( y 5 (a) x=Ael/2 (b) y=xrrte 
2 
es= 2—11-2e* 1+ Ar 
(c) 5 =5-—3cost (d) y =11-—2e @ 2=atw aie ~ : 
; 1—At 
(c) 5 = 3sin2r — 4cost + 6 M24 Ac®?2.243¢/2 
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Example 19.12 


Solution 
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FIRST-ORDER LINEAR EQUATIONS: USE OF AN 
INTEGRATING FACTOR 


In this section we develop a method for solving first-order linear differential equations. 


Exact equations 
Consider the differential equation 


dy _ 
a 


This can be solved very easily by simply integrating both sides to give 


y= [ore 


=x+¢ 


3x7 


where c is the constant of integration. An equation which can be solved by integrating 
both sides is said to be an exact equation. A more complicated example which, never- 
theless, can be solved in the same way is 


d 2 
—_ = 3x2 
Fe 
Integrating both sides we find 
x= i 3x7 dx 
=x +c 
and dividing through by x gives the general solution 
c 
yart- 
x 


The differential equation we have just solved is an exact equation. 


Solve the equation 
= (x’y) = cosx 


dx 


Integrating both sides we find 


ry = [cosas 
=sinx+c 
so that 
sin.x c 
=o 


Xx 
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Example 19.13 


Solution 


Consider again the differential equation of the previous example: 
d 
an y) = cosx 
Using the product rule for differentiation we can expand the |.h.s. as follows: 
do 2 dy 
= =~ 42 
Oey =F tay 
Doing this, the differential equation can be written in the equivalent form 
d 
ve + 2xy = cosx 


Suppose we had posed the question in this form. A method of solving this equation 
would be to recognize that the equation is exact and that the l.h.s. could be written as 


: (x°y) 
— (xy). 
dx " 
It is easy to recognize an exact equation because it will always take the form 


O26 pet) 
be thy =F 


where jz is some function of x. That is, the coefficient of y is the derivative of the coef- 


d d 
ficient of — When this is the case the |.h.s. can be written ah (Ly). 


The following equations are exact. Note in each case that the coefficient of y is the 


d 
derivative of the coefficient of a Solve them. 
dy 
a 304 3 2 = et 
rae 
(b) cosx® — (sinx)y = 1 
cosx— — (sinx)y = 

de y 

(a) The equation can be written 


d 3 2% 
an y)=e 
and so, upon integrating, 


xy = ‘| e*dx 


et 


=a +¢ 

so that 
e*  ¢ 
738" 8 


(b) The equation can be written 


: =1 
= ((c0s.x) y) = 


1 


1 


EXERCISES 19.4. 


Each of the following 


d 
(a) ee + 2xy = x 
Solutions 


' ~F,C¢ 
ae ee dae 


19.4.2 
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and so, upon integrating, 
(cosx)y=x+c 


so that 


Xx Cc 


COS X 


COS X 


1 
equations is exact. Solve them. (b) as dy = 2. =5x° 
wedx 3° 
dy 
x Serr ao 
(c) e (>+ 4 cos x 


(c) y=e*sinx+Ce~* 


5x6 
(b) y= +08 | 


A preliminary result involving separation of variables 

Consider the following differential equation for the dependent variable ju: 
du 
ao 


where P is some function of x only. Using separation of variables we have 


uP (19.3) 


and integrating both sides 
1 
—du = | Pdx 
bh 
Ing = i Pdx 


w=e if Pdx 
In this development the constant of integration has been omitted. The reason for this will 
be apparent in what follows. So the solution of Equation (19.3), for any function P(x), 
isu = ef Par, 

du 


1 
For example, if P(x) = —, then Equation (19.3) is an a 
x 


— and its solution is 
x 


p= ef (/xdx = elnx =x. 
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19.4.3 


19.4.4 


First-order linear equations 


First-order linear differential equations can always be written in the ‘standard’ form 


° + PQ)y = Q) (19.4) 


where P and Q are both functions of the independent variable, x, only. In some cases, 
either of these may be simply constants. 
An example of such an equation is 


d 
7 + 3xy = 7x 


Comparing this with the standard form in Equation (19.4), note that P(x) = 3x and 
Q(x) = 7x’. As a second example consider 


d 2 

ge = 4e” 

dx x 

2 . : : : 
Here P(x) = —-— (note in particular the minus sign) and Q(x) = 4e™. 
Finally, in the equation 

dy 

—~ _5y= si 

ae y = sinx 


note that P(x) is simply the constant —5. 
Variables other than y and x may be used. So, for example, 


dx . 
—+8tx = 3t —5t 
dt 


is a first-order linear equation in the form of Equation (19.4) but with independent vari- 
able t and dependent variable x. Here P(t) = 8t and Q(t) = 317 — 5t. 
In what follows it will be important that you can distinguish between the dependent 
and independent variables, and also that you are able to identify the functions P and Q. 
Equations such as these arise naturally when modelling many engineering applica- 
tions. For example, the equation which determines the current flow in a series RL circuit 
when the applied voltage takes the form of a ramp is given by 


R t 
This is a first-order linear equation in which P(t) is the constant Z and O(t) = i You 


will learn how to solve such equations in the following section. 


The integrating factor method 


All first-order linear equations, even when they are not exact, can be made exact by 
multiplying them through by a function known as an integrating factor. As we have 
seen, the solution then follows by performing an integration. For example, the linear 
equation 

dy 3 e* 

asx 2 
is not exact, but multiplying it through by x° produces the differential equation in 
Example 19.13(a) which is exact. 
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Consider again a first-order linear equation in standard form: 


dy 
—4+Py= 19. 
= tPy=9 pe 


where P and Q are functions of x only. We are aiming to solve this and express the 
dependent variable y in terms of the independent variable x. The aim is to multiply (19.5) 
through by a function jz to make the equation exact. That is, so that the I.h.s. can be 
written in the form 


uy) 
ay 
At this stage the function jz is not known. Multiplying (19.5) through by ju yields 


dy 
— 4+ Py = 19.6 
we + wPy = uO (19.6) 


d 
If the 1.h.s. is to equal a (jy) then we must have 
© (uy) = w+ uP 
qy UY) Sg + Py 
Expanding the I.h.s. using the product rule gives 


dy du dy 
5 = P 
Be ae a ad 


which simplifies to 


du 

=F) =P 

y ie Mery 
and consequently 


du 
— = uP 
a 


Using the result in Section 19.4.2 we see that this equation has solution 


w= e/ Pax 


The function jz is called an integrating factor and is a function of x only. With this 
choice of 2, the I.h.s. of (19.6) is the same as ch (yy) and hence (19.6) can be written 
< (uy) = nO 
ay sy) = 
This exact equation can be solved by integration to give 
by= / LO dx 


and consequently 


1 
y= = f uoas 
mm 
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Example 19.14 


Solution 


In summary: 


Given any first-order linear equation in standard form 


ee =Q 
Co. 


where P and Q are functions of x, the integrating factor jz is given by 


pel Pe 
and the solution of the equation is then obtained from 


wy = f woes 


It is important when working on a particular differential equation to rewrite the stan- 
dard formulae in the correct form before use. Essentially this means using the correct 
dependent and independent variables in the equations. For example, if x is the dependent 
variable and ¢ is the independent variable then the equations are as follows: 


The integrating factor for 
dx 
= 4p Pe = 
dt g 
where P and Q are functions of t, is given by 


pw=e [Pat 
and the solution of the equation is obtained from 


wx= f wow 


These results are illustrated in the examples which follow. 


d 
Solve the differential equation *. + a 1 using the integrating factor method. 
x 


Referring to the standard first-order linear equation 


dy P _ 
ce + P(x)y = Q(x) 


1 
we see that P(x) = — and Q(x) = 1. Using the previous formula for jz(x), we find 
x 


w(x) = ef W/ ee 
= elnx 


=X 
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Then from the first key point on page 552 with « = x and Q = | we have 


se 
ya fxdv= 54 


x  C 
and finally y = + — is the required general solution. 
x 


Engineering application 19.3 


RL circuit with ramp input 


Recall from Section 2.4.7 that a standard ramp function has value 0 fort < 0 anda 
value ct for t > 0. This is shown in Figure 2.46. A voltage ramp signal with a value 
c = | is applied to an RL circuit. The arrangement is shown in Figure 19.4. The 
differential equation governing the current flow, i(t), in this circuit is given by 
di 
iR+L— =t for 20 i(0)=0 

Show that this equation can be written in the form of Equation (19.4). Hence use the 
integrating factor method to find i(f). 


i 


v(t) 


R 
0 t 
L_ Figure 19.4 
A ramp input signal applied to a series RL 
circuit. 
Solution 
This equation can be written as 
di R t 
—+ -i=- for i 2S 
(0) ny Se 5 


which is a first-order linear equation. 
Note in this case that the independent variable is t and the dependent variable is i. 
In standard form, we have 


= P(t)hi = Q(t) 
rea ae 


R t 
where P(t) = Fi and Q(t) = re The integrating factor, ju, is given by 


= ef Pode 
= e/ (R/L) dt 


= eft /L 


lL 
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ie®t/E = xf retat 
lL, 


1 teRt/L ceaee 
DEORE if R/L 


teRt/LE It, 7 
= — atti 
ae ee 
where K is the constant of integration. The general solution is 
t iL 
i= ——-—+Ke */! 
R R 


When ¢ = 0, i = O. This gives the initial condition required to find K. Applying 
i(0) = 0 gives K = L/R’ so that the particular solution is 


In many engineering applications the terms transient response, steady-state response, 
zero-input response, and zero-state response arise through the solution of differential 
equations. These terms are explained in the following example. 


Engineering application 19.4 


RC circuit: zero-input response and zero-state response 


The differential equation which is used to model the variation in voltage across a 
capacitor, uv, (t), in a series RC circuit was derived in Engineering application 19.1: 


du 
RC—= + Ue = Us (t) 
where vu. (t) is the applied voltage. Suppose that this applied voltage takes the form 
Us(t) = V cos wt. Suppose also that when the switch is closed at t = 0, the initial 


voltage across the capacitor is Vo, that is v.(0) = Vo. 


(a) Show that this equation can be written in the form of Equation (19.4); that is, it 
is a first-order linear equation. 


(b) Use the integrating factor method to obtain the particular solution of this equation 
which satisfies the given initial condition. 


(c) Obtain the particular solution of the equation subject instead to the initial con- 
dition v, (0) = 0. It corresponds to the response of the system when there is no 
initial energy in the circuit. This solution is referred to as the zero-state response. 


(d) Obtain the solution of the equation in the case when the supply voltage is identi- 
cally zero, v,(t) = 0, subject to the given condition vu, (0) = Vo. This solution is 
often referred to as the zero-input response, and represents the response of the 
system when the input is zero. 
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(e) Show that the solution in (b) can be written as the sum of the zero-state response 
and the zero-input response. 


(f) Identify the transient and steady-state terms in the solution to part (e). 


Solution 
(a) The given differential equation can be rewritten as 
dug 1 1 
i Ug (t 
dior one: ros 
Comparing the form of this equation with (19.4) we see that it is a first-order 
linear equation, with independent variable t, and dependent variable v,, in which 


1 1 
AG) = RC and O(t) = Ro’s (t). 


The integrating factor is given by 


(b 


wm 


pe = ef A/RC)At — gt/(RC) 


It follows from the second key point on page 552 that 
V 
ey = Me / e! 8) cos wot dt 


This integral can be evaluated using integration by parts twice following the tech- 
nique in Example 14.4. You should verify that 


R2C2 et/(RO) 
i e/®O) cos wt dt = 


R2C2o? + 1 


COS =] 


sin wt 
[~ ot + RC 


+ constant of integration 


Then 
VRCet RO) 
C™ R2C2q? + 1 


cos wt 
RC 


ef /(RC),) 


[esinor + |+« 


from which 

es VRC 

= Cael 
This is the general solution of the differential equation. Applying the initial con- 
dition gives us a value for K. When t = 0, vp = Vo, so 


cos wt 
Ke t/RO) 
RC lf: 


Uc [o sin wt + 


TE lac|+* 
° R2?C2@? +1 | RC 
from which 
Pn ee 
9° RCw? +1 
Finally, 
ve= mga [osina+“SA] + (Yy- aae) em 


This is the particular solution which satisfies the given initial condition. It tells 
us the voltage across the capacitor as a function of time f. 
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(c) 


(d) 


(e) 


(f) 


To find the particular solution subject to the condition v.(0) = 0 we need only 
replace Vp by 0 in the particular solution obtained in part (b). Hence the zero-state 
response is 


VRC cos wt V 
vel) = ay “i 


RCo +1 RC |) RCw +1 

In the case of zero input we have v,(t) = 0. In part (b) we solved the equation 
with v,(t) = V cos wt. Hence, putting V = 0 in the solution to part (b) will yield 
the solution when v(t) = 0. So, 


Ue(t) = Vye FO) 


[e sin wt + 


Alternatively we can note that when v, (t) = 0 the original equation becomes 
duc 1 
= + Sp = W 
dr | RCC 
subject to v.(0) = Vo. This can be solved using separation of variables. So 


1 dug 1 
Uc dt RC 
Integration yields 
1 | 
— dite = — | = dt 
le RC 
if 
Inve = “Re +k (k a constant) 
Ve = en t/ ROK 
SCO) 
SA eee) 


where A is the constant e*. Applying the initial condition v.(0) = Vy we find 
VY, = A and so finally 


Uc(t) = Vo Qe®) 
as before. 


Inspection of part (b) shows that it is the sum of the zero-input response and the 
zero-state response: 


zero-input zero-state response 
response 
—_—— oo 
VRC COS wt V 
=V, eo t/(RC) i = —t/(RC) 
Uc (t) =Vo € + Cw 1 [e sin wt-+ RC | RCo 41 


In this example, the terms involving e~’/“*© tend to zero as t increases, and are 
termed transients. Once the system has settled down their contribution will not 
be important. The remaining terms represent the longer term behaviour of the 
system or the so-called steady state. 

Hence the transient terms are 
V 


—t/(RC) 
RCo? +1 


MG ee / RO) _ 


1 


1 
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and the steady-state terms are 


VRC 
RC? + 1 


[o sin wt + 


557 


COs =] 


RC 


EXERCISES 19.4.4 


Find the general solution of the following equations: 


dy dy 
(a) = +y=1 (b+) 7 +2y=6 


dx 


OF pac d 
©) 7 to=4 @ 


dy _ 
(e) a dee (f) 


So ByS2 
dx y 


dx 
— =3x-8 
dt 


Find the particular solution of the following 
equations: 


ae =7 0O)=1 
a) = — = 
‘y ; y 


ai =4 0) =2 
OW) -r=4, x0) = 
y(0) =2 
yl) =2 


dx 
Find the general solution of a 2x + 4t. What is 


the particular solution which satisfies x(1) = 2? 


Solutions 


(a) y=1l+ce* 
(b) y=3+ce 


2 
(c) x= q +ce™ 


) y=ce* — 


NN] WW] bh 


(e) y=ce™ — 


(f) ra tce 


7 3 —4x 
(a) rae aT has 
(b) x= 6e! —4 

8 3t 
y= ae 


(d) y=2 


10 


Find the general solution of ] +y=2x+5. 
Solve a =t—tx, x(0) =0. 
dt 
Use an integrating factor to obtain the general solution 
of iR+ i = sin wt, where R, L and w are constants. 


dy 4 
Solve x— +y=x". 
"ax _ 


Use an integrating factor to find the general solution 


dx 
of t— +x =3t. 
dt 


dx 
Find the general solution of - + 2xt = t. Find the 


particular solution satisfying the condition 
x(0) = —1. 


Find the general solution of 


t 


. e 
tx +3x= =z 
t 


14 5e2t-D 


2x+3+ce* 


l= et? /2 


L((R/L) sin wt — w cos at) —Rt/L 
+ ce 
R? + L2@? 
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19.5.1 


19.5.2 


SECOND-ORDER LINEAR EQUATIONS 


The general form of a second-order linear ordinary differential equation is 


2; 
ross +4 + rey = FO) (19.7) 


where p(x), g(x), r(x) and f(x) are functions of x only. 
An important relative of this equation is 


ay dy =) 19.8 
PX) T+ a@)a trey = (19.8) 


which is obtained from Equation (19.7) by ignoring the term which is independent of y. 
Equation (19.8) is said to be a homogeneous equation — all its terms contain y or its 
derivatives. Equation (19.7) is said to be inhomogeneous. 

For example, 


dy 

x 2 

is an inhomogeneous second-order linear equation in which p(x) = x”, g(x) = x, r(x) = 
x’ — | and f(x) = e*. The associated homogeneous equation is 


Py 


xT 
The following properties of linear equations are necessary for finding solutions of 
second-order linear equations. 


d 
+25 Ae lhy=e” 


d 
+x +00 l)by =0 


Property 1 


If y, (x) and y,(x) are any two linearly independent solutions of a second-order homo- 
geneous equation then the general solution, y,;(x), is 


Vy (x) = Ay, &) + By, (x) 


where A, B are constants. 

We see that the second-order linear ordinary differential equation has two arbitrary 
constants in its general solution. The functions y, (x) and y, (x) are linearly independent 
if one is not simply a multiple of the other. 


Property 2 


Let y,(x) be any solution of an inhomogeneous equation. Let y,,(x) be the general so- 
lution of the associated homogeneous equation. The general solution of the inhomoge- 
neous equation is then given by 


y(X) = Vy) + yp(x) 


In other words, to find the general solution of an inhomogeneous equation we must first 
find the general solution of the corresponding homogeneous problem, and then add to it 
any solution of the inhomogeneous equation. 

The function y,, (x) is known as the complementary function and y, (x) is called the 
particular integral. Clearly the complementary function of a homogeneous problem is 


Example 19.15 


Solution 


Example 19.16 


Solution 
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the same as its general solution; we shall often write y(x) for both. If conditions are given 
they are applied to the general solution of the inhomogeneous equation to determine any 
unknown constants. This yields the particular solution satisfying the given conditions. 


a 
Verify that y,(x) = x and y,(x) = | both satisfy 3 = 0. Write down the general 
solution of this equation and verify that this indeed satisfies the equation. 
2 2 


SD 2, cotta, sides © Oe SA 
de” * @ 2 >™ 


ie vibe and 
yi = ae 


dy, d’y, . dy 
then a 0 and ae > 0, so that y, satisfies qe 0. From Property 1, the general 
2 


: d°y : 
solution of —~ = 0 is 
de2 


Yy(x) = Ax + BCL) 


=Ax+B 
To verify that this satisfies the equation proceed as follows: 

d 

Dn _ 

dx 
a 

YH _ 0 
doe 


_. dy 
and so y,,(x) satisfies 42 = 0. 


Given 
dy 
ae +y=x (19.9) 
(a) Show that y, = Acosx + Bsinx is a solution of the corresponding homogeneous 
equation. 


(b) Verify that yp = x is a particular integral. 
(c) Verify that y,, + yp does indeed satisfy the inhomogeneous equation. 


(a) Ify, = Acosx + Bsinx, then 
Yy = —Asinx+Bcosx yy = —Acosx— Bsinx 
We see immediately that y,, + y;; = 0 so that y,, is a solution of the homogeneous 
equation. 


(b) If yp = x then y, = 1 and y; = O. Substitution into the inhomogeneous equation 
shows that y, satisfies Equation (19.9), that is yp = x is a particular integral. 


(c) Writing 
y=Acosx+Bsinx+x 
we have 


” 


y = —Asinx+ Bcosx+1 y’ = —Acosx — Bsinx 
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Substitution into the 1.h.s. of Equation (19.9) gives 
(—A cosx — Bsinx) + (Acosx+ Bsinx + x) 


which equals x, and so the complementary function plus the particular integral is 
indeed a solution of the inhomogeneous equation, as required by Property 2. 


MET] «constant coeFFiciENT EQUATIONS 


We now proceed to study in detail those second-order linear equations which have con- 
stant coefficients. The general form of such an equation is 


dy dy 
a2 ors +cy = f(x) (19.10) 
where a, b, c are constants. The homogeneous form of Equation (19.10) is 
fe (19.11) 
dx? dx 


Equations of this form arise in the analysis of circuits. Consider the following example. 


Engineering application 19.5 


The LCR circuit 


Write down the differential equation governing the current flowing in the series LCR 
circuit shown in Figure 19.5. 


Solution 
Using Kirchhoff’s voltage law and the individual laws for each component we find 


di il ; 
1h spuikep = || ja = 0@) 


dt G 
If this equation is now differentiated w.r.t. t we find 
vi di 1. du(t) 
a ik= 45S = 


iL 
dr? dt C dt 
This is an inhomogeneous second-order differential equation, with the inhomogene- 


3 ais U Byte 
ity arising from the term ie When the circuit components L, R and C are constants 


we have what is termed a linear time-invariant system, and the differential equation 
then has constant coefficients. 


Figure 19.5 
v(t) An LCR circuit. 


19.6.1 


Example 19.17 


Solution 


Example 19.18 
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A linear time-invariant system has components whose properties do not vary with 
time and as such can be modelled by a linear constant coefficient differential 
equation. 


Finding the complementary function 


As stated in Property 2 (Section 19.5.2), finding the general solution of ay”+by’+cy = f 
is a two-stage process. The first task is to determine the complementary function. This 
is the general solution of the corresponding homogeneous equation, that is ay” + by’ + 
cy = 0. We now focus attention on the solution of such equations. 


Verify that y, = e* and y, = e* both satisfy the constant coefficient homogeneous 
equation 
Py dy 
— —6— + 8y=0 19.12 
dx? dx as ( ) 


Write down the general solution of this equation. 


If y, =e, differentiation yields 


d 
a 4e* 
dx 

and similarly, 
a . 
i = 166" 


Substitution into Equation (19.12) gives 
16e* — 6(4e*) + 8e* =0 


so that y, = e* is indeed a solution. Similarly if y, = e?*, then 


dy, 
de 


d 
oy2 = 2e™ and 
dx 


= 4e*, Substitution into Equation (19.12) gives 


4e* — 6(2e*) + 8e* =0 


so that y, = e** is also a solution of Equation (19.12). Now e” and e** are linearly 
independent functions. So, from Property 1 we have 


Yy(x) = Ae™ + Be” 


as the general solution of Equation (19.12). 


Find values of k so that y = e“ is a solution of 


dy dy 
— -—-——-6y=0 
de dx 


Hence state the general solution. 
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Solution 


Example 19.19 


Solution 


d 
As suggested we try a solution of the form y = e*. Differentiating we find *. = ke*™ 


and 7 = k’ e. Substitution into the given equation yields 
ke — ke —6e* =0 
that is, 
(P —k—6)e“ =0 
The only way this equation can be satisfied for all values of x is if 
a ar (19.13) 
that is, 
(k —3)(kK+2) =0 


so that k = 3 or k = —2. That is to say, if y = e* is to be a solution of the differential 
equation k must be either 3 or —2. We therefore have found two solutions 
y, (x) =e* and = yy (x) =e ™ 


These two functions are linearly independent and we can therefore apply Property | to 
give the general solution: 


Yyx) = Ae* + Be™ 


Equation (19.13) for determining k is called the auxiliary equation. 


Find the auxiliary equation of the differential equation 


dy dy 


: k dy k dy 2k ss 

We try a solution of the form y = e“ so that — = ke™ and cP i k* e™. Substitution 
into the given differential equation yields 

ak’ e* + bke™ +ce™ =0 
that is, 

(ak’ + bk +c)e* =0 
Since this equation is to be satisfied for all values of x, then 

ak + bk +c=0 


is the required auxiliary equation. 


Example 19.20 


Solution 
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ee é dy dy 
The auxiliary equation of a2 + Dae + cy = Ois 


ak? + bk +c=0 


Solution of this quadratic equation gives the values of k which we seek. Clearly the 
nature of the roots will depend upon the values of a, b and c. If b? > 4ac the roots will 
be real and distinct. The two values of k thus obtained, k, and k,, will allow us to write 
down two independent solutions: 

y; (x) = eit ¥ (x) = ekot 


and so the general solution of the differential equation will be 


y(x) = Ae“? + Beb* 


If the auxiliary equation has real, distinct roots k, and k,, the complementary func- 
tion will be 


y(x) = Aek* + Be&* 


On the other hand, if b> = 4ac the two roots of the auxiliary equation will be equal 
and this method will therefore only yield one independent solution. In this case, special 
treatment is required. If b? < 4ac the two roots of the auxiliary equation will be complex, 
that is k, and k, will be complex numbers. The procedure for dealing with such cases 
will become apparent in the following examples. 


Find the general solution of 


dy dy 
—+3— -10y=0 
de trax 


Find the particular solution which satisfies the conditions y(0) = 1 and y’(O) = 1. 


: kx dy kx dy 2 kx HB : 4 
By letting y = e™, so that 7 ke™ and a k*e™, the auxiliary equation is found 
to be 
+ 3k—10=0 
Therefore, 


(k —2)(kK+5) =0 


so that k = 2 and k = —5. Thus there exist two solutions, y, = e” and y, = e~>*. From 
Property | we can write the general solution as 


y=A e* + Be 


564 Chapter 19 Ordinary differential equations | 


Example 19.21 


Solution 


To find the particular solution we must now impose the given conditions: 


y(0) = 1 gives 1=A+B 
y (0) =1 gives 1 = 2A —5B 


6 1 
from which A = 7 and B = 7 Finally, the required particular solution is 


y= Ser + pe, 


Find the general solution of 


dy 
oo 


k dy ki dy 2k “1 8 
As before, let y = e** so that a ke™ and qe ke. The auxiliary equation is 


easily found to be 


P+4=0 
that is 
P= -4 
so that 
= +2) 


that is, we have complex roots. The two independent solutions of the equation are thus 
1ojSee «ad 4G)" 
so that the general solution can be written in the form 
oa 2jx —2jx 
y(x) =Ae"+Be 


However, in cases such as this, it is usual to rewrite the solution in the following way. 
Recall from Chapter 9 that Euler’s relations give 


2j 


e* = cos 2x + j sin 2x 
and 

e 7 — cos 2x — j sin 2x 
so that 


y(x) = A(cos 2x + j sin 2x) + B(cos 2x — j sin 2x) 
If we now relabel the constants such that 

A+B=C and Aj —Bj =D 
we can write the general solution in the form 


y(x) = Ccos 2x + Dsin 2x 
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Engineering application 19.6 


Oscillating mass—spring system 


Consider the simple mechanical problem of a mass resting on a smooth frictionless 
table. A spring is attached to the mass and an adjacent anchor point as shown in 
Figure 19.6. The spring is capable of being compressed as well as stretched. When 
the spring is neither compressed nor stretched the mass is located at x = 0. 


x Figure 19.6 
0 Mass-spring system. 


If the mass is pulled in the x direction and let go, it will oscillate about the x = 0 
position. We wish to find the position of the mass, x, as a function of time, t. Differ- 
ential equations are needed to describe this problem fully. 

Newton’s second law states that if a force F is applied to a body of mass m then 
the motion of the body is governed by F = ma, where a is the acceleration. Applying 


: x i 
Newton’s second law, and noting that a = a’ we obtain 


The force, F', is provided by the spring. The force exerted by a spring is given by 
Hooke’s law, which states that the force is proportional to the extension or compres- 
sion of the spring, 


F = —kx 


where k is the spring constant for the spring in use. The minus sign is required so that 
when the spring is stretched (x > 0) the force is in the negative x direction. When 
the spring is compressed (x < 0) the force is in the positive x direction. 

If the table is sufficiently smooth and if there are no other external forces acting, 


56 
then Le = —kx. 


Therefore the differential equation that governs motion in the system is 


dx 
—+kx=0 
"ar a 
We write this as 
dys 
eA eS 1) 
dt? ot ane 


Let — = w”, giving 
m 


d’x > 
A 
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Using the technique illustrated in Example 19.21 we obtain 
x(t) = Acos at + Bsin wt 

where A, B are constants. Note this solution may also be expressed as a single wave 
x = Ccos(at + ¢) 


where C and ¢ are constants, using the technique described in Section 3.7.1, Com- 
bining waves. 

We note that this solution is sinusoidal and oscillates with time. It gives the po- 
sition of the mass at a given point in time. The constants C and @ depend on the 
position and velocity of the mass when it is released. These are known as the initial 
conditions of the differential equation (see Example 19.6 on page 539). It is intuitive 
that an equation that describes the position of the mass at a given time must take these 
into account. 

Mathematically describing or modelling systems like this one using differential 
equations is an extremely important discipline. Mathematical models of mechanical, 
electrical and other subsystems can be linked together and as a result whole systems 
can be accurately characterized. 


Example 19.22 Given ay’ + by’ +cy = 0, write down the auxiliary equation. If the roots of the auxiliary 
equation are complex and are denoted by 


k=a+fj k=a-§j 
show that the general solution is 


y(x) =e (Acos Bx + Bsin Bx) 


Solution Substitution of y = e“ into the differential equation yields 

(ak? + bk +c)e* =0 

and so 
ak’ + bk +c =0 

This is the auxiliary equation. If k, = a + Bj, k, = a — fj then the general solution is 
y= Ce@thix + De@e-hix 

where C and D are arbitrary constants. Using the laws of indices this is rewritten as 
y = Ce ef ® + De oF = ce (Ce* + De FR) 

Then, using Euler’s relations, we obtain 


y=e™(Ccos Bx + Cj sin Bx + Dcos Bx — Dj sin Bx) 
= e"{(C + D) cos Bx + (Cj — Dj) sin Bx} 


Example 19.23 


Solution 


Example 19.24 


Solution 
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Writing A = C + D and B = Cj — Dj, we find 
y=e™(Acos Bx + Bsin Bx) 


This is the required solution. 


If the auxiliary equation has complex roots a + /j and a — $j, then the complemen- 
tary function is 


y =e" (Acos Bx + Bsin Bx) 


Note that Example 19.21 is a special case of Example 19.22 witha = 0 and 6 = 2. 


Find the general solution of y” + 2y’ + 4y = 0. 


The auxiliary equation is k? + 2k + 4 = 0. This equation has complex roots given by 


_ —2+/4- 16 
Z 
—2+ V12j 
~ 3 
=-I+ V3j 
Using the result of Example 19.22 with a = —1 and B = V3 we find the general 


solution is 


y=e “(Acos J/3x + Bsin 13x) 


The auxiliary equation of ay” + by’ +cy = 0 is ak* +bk+c = 0. Suppose this equation 
has equal roots k = k,. Verify that y = xe“* is a solution of the differential equation. 


We have 
yaxe*® yoe*(1+kx) yy’ =eh*(kix+2k,) 
Substitution into the |.h.s. of the differential equation yields 
ei*{a(kjx + 2k,) + DU + kx) + cx} = e4*{(ak{ + bk, + c)x + 2ak, + b} 


But ak? + bk, +c = O since k, satisfies the auxiliary equation. Also, 


7 —b+ Jb’ — 4ac 
_ 2a 
P 4 b 
but since the roots are equal, then b“ — 4ac = 0 and hence k, = — an So 2ak, +b=0. 
a 


We conclude that y = xe“* is a solution of ay’ + by’ + cy = 0 when the roots of the 


auxiliary equation are equal. 
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If the auxiliary equation has two equal roots, k,, the complementary function is 


y =Ach* + Bret” 


Example 19.25 Obtain the general solution of the equation 


dy ody 
—++8—+4l6y=0 
dx? = dx =e 


Solution As before, a trial solution of the form y = e* yields an auxiliary equation: 
k + 8k+16=0 
This equation factorizes so that 
(k+4)(k+4) =0 


and we obtain equal roots, that is k = —4 (twice). If we proceed as before, writing 
y,@) = e*, y,(xX) = e~**, it is clear that the two solutions are not independent. To 
apply Property 1 we need to find a second independent solution. Using the result of 
Example 19.24 we conclude that, because the roots of the auxiliary equation are equal, 
the second independent solution is y, = xe~*”. The general solution is then 


y(x) =Ae* + Bre *™ 


EXERCISES 19.6.1 


1 Obtain the general solutions, that is the (h) aig i" dy 4+5y=0 
complementary functions, of the following dt2 dt 
homogeneous equations: dy dy 
i) —+—-2y=0 
(i) iu 
dy dy 2 
(a) 3-32 42y=0 «dt 
dx dx GQ) a2 + 9y =0 
dy dy 2, 
b) —4+7= = d d 
(c) dx sdk 0 d2x 
c) —= — + 6x = = 
dt? dt (I) a 16x = 0 
(d) se + ped ty=0 2 Find the auxiliary equation for the differential 
dt dt equation 
dy dy ?i di 1 
— —-4— +4y=0 L—+R—+-=i= 
ae ae ea oe 
dy dy Hence write down the complementary function. 
(f) mr +—+8y=0 : ; : 
dt dt 3. Find the complementary function of the equation 
a d 2 
(2) 5 -2>+y=0 2 ® Ly=0 


dx? “dx de? * dx 


1 


Solutions 


(a) y=Ae*+ Be 
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(k) y=A+Be* 


(b) y=Ae* + Be~™ () x=Ae¥+Be“* 


(c) x= Ae + Be i 
(d) y=Ae* + Bre 2 LR+RK+ a=9  i@) =Aeki + Beh! 


(ce) y= Ae + Bre? 


Xx where 


(f) y =e~- (A cos 2.781 + B sin 2.781) _pq.|RCR4E 

(g) y=Ae* + Bre® kyyky = 1, <— 

(h) y =e 9 (A cos 2.18¢ + B sin 2.181) 

(i) y=Ae + Bet 3 eX? ¢ cos 3x + Bsin =) 
G) y =Acos3x + Bsin 3x 2 2 


19.6.2 


Example 19.26 


Solution 


Finding a particular integral 


We stated in Property 2 (Section 19.5.2) that the general solution of an inhomogeneous 
equation is the sum of the complementary function and a particular integral. We have 
seen how to find the complementary function in the case of a constant coefficient equa- 
tion. We shall now deal with the problem of finding a particular integral. Recall that 
the particular integral is any solution of the inhomogeneous equation. There are a num- 
ber of advanced techniques available for finding such solutions but these are beyond 
the scope of this book. We shall adopt a simpler strategy. Since any solution will do 
we shall try to find such a solution by a combination of educated guesswork and trial 
and error. 


Find the general solution of the equation 


— 6y = e* (19.14) 


The complementary function for this equation has already been shown in Example 19.18 
to be 


yy =A e* + Be* 


We shall attempt to find a solution of the inhomogeneous problem by trying a function 
of the same form as that on the r-h.s. In particular, let us try yp(x) = a e**, where @ is a 
d'yp _ 
dx? 


d 
constant that we shall now determine. If y,(x) = a e** then = = 2a e* and 
4a e**. Substitution in Equation (19.14) gives 


4a e** — 2a e* — 6a e** = e* 


that is, 


2x 


—4ae* =e 
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Example 19.27 


Solution 


di 
so that y, will be a solution if a is chosen so that —4a@ = 1, thatisa = — ra Therefore the 


2x 


particular integral is yp(x) = — oe From Property 2 the general solution of the inhomo- 


geneous equation is found by summing this particular integral and the complementary 
function 


1 
y(x) —=Ae* +Be* _ 7 et 


Obtain a particular integral of the equation 


dy dy 

oe a 

In the last example, we found that a fruitful approach was to assume a solution in the 

same form as that on the r.h.s. Suppose we assume a solution yp(x) = ax and proceed 

to determine a. This approach will actually fail, but let us see why. If yp(x) = ax then 

dyp Pyp 

— =a and — 

dx dx? 
0 — 6a + 8ax = x 


= 0. Substitution into the differential equation yields 


and a ought now to be chosen so that this expression is true for all x. If we equate the 


coefficients of x we find 8a = 1 so that a = —, but with this value of a the constant 


terms are inconsistent. Clearly a particular integral of the form ax is not possible. The 
problem arises because differentiation of the term wx produces constant terms which are 
unbalanced on the r.h.s. So, we try a solution of the form 


ype) =ax+ B 


d a 
with a, 6 constants. Proceeding as before, = = a, at = 0. Substitution in the 
differential equation now gives 


0 —6a + 8(ax+ 6B) =x 
Equating coefficients of x we find 

8a = 1 (19.15) 
and equating constant terms we find 


—6a +868 =0 (19.16) 


1 
From Equation (19.15), a = 8 and then from Equation (19.16) 


1 
-6(5) +86 =0 


so that 
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that is, 
3 
P= 39 
: ; : : x. 3 
The required particular integral is yp(x) = 8 + a 


Experience leads to the trial solutions suggested in Table 19.1. 


Table 19.1 

Trial solutions to find the particular 
integral. 

FQ) Trial solution 
constant constant 
polynomial in x polynomial in x 
of degree r of degree r 

cos kx acos kx + bsinkx 
sin kx acos kx + bsinkx 
ae a ek 


Example 19.28 Find a particular integral for the equation 


a d 
2 — 6, + 8y = 3cosx 


Solution We shall try a solution of the form 
yp(x) = acosx + B sinx 


Differentiating, we find 


dyp : 

— = —asinx + Pcosx 
dx e 

a 

at = —acosx — B sinx 


Substitution into the differential equation gives 


(—a cosx — B sinx) — 6(—a@ sinx + 6 cosx) + 8(acosx + f sinx) 


= 3cosx 
Equating coefficients of cos x we find 

—a — 66 + 8a = 3 (19.17) 
while those of sin x give 


_f + 6a +88 =0 (19.18) 
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21 18 
Solving Equations (19.17) and (19.18) simultaneously we find a = 85 and 6 = ~ 35° 
so that the particular integral is 


« = 2! 18. 
= — COsSx — — SINX 
PENS fag Sr 


Engineering application 19.7 


An LC circuit with sinusoidal input 


The differential equation governing the flow of current in a series LC circuit when 
subject to an applied voltage u(t) = Vp sin wt is 
I tr _ Vi t 
— + ~i=oV,cosw 
ie : 
Derive this equation and then obtain its general solution. 


Solution 
Kirchhoff’s voltage law and the component laws give 
oe aP = fie = V) sinat 
di -C ‘ 


To avoid processes of differentiation and integration in the same equation let us dif- 
ferentiate this equation w.r.t. t. This yields 
pe SF s = wV, cos wt 
ie e : 
as required. a 
The homogeneous equation is ipo ae uy 0. Letting i = e” we find the aux- 


ade  € 
1 j 
iliary equation is Lk? + — = 0 so that k = +——. Therefore, using the result of 
y eq C Tai g 
Example 19.22, with a = 0 and 6 = ——, the complementary function is 
p Wide p ary 
t t 
i = Acos —— + Bsin —— 
WUC VLC 
To find a particular integral, try i = E cos wt + F sin wt, where E and F are constants. 
We find 
di 5 
— = —-WE sinwt + wF cos wt 
dt 
di 2 Dra ret 
@ = —w Ecosat — w F sinwt 


Substitution into the inhomogeneous equation yields 
1 
L(—@°E cos ot — w°F sin wt) + ce cos wt + F sinwt) = V.wcos wt 
Equating coefficients of sin wt gives 


yee 
ae 
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Equating coefficients of cos wt gives 
We 
=O) —— Vo 
C 0 


cv 
so that F = O and E = eee, It follows that the particular integral is 
ae 1—@?LC 
ee Finally, the general solution is 
1—@°LC 
eon t SRA t ie COVE 
COS — sin 
NATO SEE 1—@?LC 


The terms zero-input response and zero-state response were introduced in Section 19.4 
in connection with first-order equations. This terminology is identical when we deal with 
second-order equations as the following example illustrates. 


Engineering application 19.8 


Parallel RLC circuit 


Figure 19.7 shows a parallel RLC circuit which has a current source i, (t). The induc- 
tor current, 7, can be found by solving the second-order differential equation 
eed: 
LC— + =— +i=i,(t fort > 0 
P ap Farr ae s(t) 2 
It would be a useful exercise for you to derive this equation. Note that this equation 
is second order, and inhomogeneous due to the source term i, (¢). 


wo 


Suppose L = 10 H,R = 10 Q2, C = 0.1 F and i, (t) = e ~ and that the initial 


Figure 19.7 
A parallel RLC circuit. 


conditions are i = 1 and = = 2 won = 0, 


(a) Obtain the solution of this equation subject to the given initial conditions and 
hence state the inductor current i. 


(b) Obtain the zero-input response. This is the solution when i, (t) = 0. 

(c) Obtain the zero-state response. This is the solution of the inhomogeneous equa- 
tion subject to the conditions i = 0 and a 0 at t = 0. It corresponds to there 
being no initial energy in the circuit. 


(d) Show that the solution in (a) is the sum of the zero-input response and the zero- 
state response. 


574 
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Solution 

(a) With the given parameter values the differential equation becomes 
dain nedic sepa, 
erin 4° 


It is first necessary to find the complementary function. Letting i = e"’, the aux- 
iliary equation is k? + k + 1 = 0 which has complex solutions 


1 3 
ee 
2 D 
The complementary function is therefore 
3 3 
i=e? (4 cos se + Bsin er) 


For a particular integral we try a solution of the form i = we”. Substitution 
into the differential equation gives 


Age =e” Lae? = e* 
so that 
1 
Bye Ile that is a, 


Hence a particular integral is i = i e~, The general solution is the sum of the 
complementary function and the particular integral: 
V3 Jee Il 

j= e/*| Acos —t + Bsin —t | + —~e” 

1=e (Acos Fr +-asin S GaNG 
We now apply the initial conditions to find the constants A and B. Given i = | 
when ¢t = 0 means 
2 


1 
1=A+- that A == 
Bee so that 3 


di 
To apply the second condition we need to find = 


a =e? ae sin ve + eee sy 
dt 2) D D» 2) 


1 3 3 2 
= eve (4 cos ae + Bsin er) == ema 


di 
Given ~ = 2 when t = 0 means 


1 2 
yi Ee ()— 
aD 2! 3 
I) ff DD 
pee - 
y) Das 3 
pee 
2; 
aoe 


19.6.3 


Example 19.29 


Solution 
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——— 


V3 
= 2/3 


Finally, the required particular solution which gives the current through the 
inductor at any time is 
1 


2) 3 3 
i= en(: cos ae + 2/3 sin 3: + 5 a 


(b) The zero-input response is obtained by ignoring the source term i,(¢). From 
(a) we see that this is just the complementary function: 


3 3 
i=er2 (4 cos a + Bsin 1) 


Applying the given initial conditions to this solution gives A = | and B = 


Sle 


and so the zero-input response is 
3 5 3 
i=e | cos oe, + — sin es 
2 IB 2 


(c) The zero-state response of the inhomogeneous equation is found by applying the 


conditions i = 0 and - = 0 at t = 0 to the general solution already obtained in 


1 I 
(a). It is straightforward to show that A = a5 and B = Wat So the zero-state 


response is 
1 1 1 
i=e’”( — —cos 8 + —sin sa ees 
3 2 V3 2 3 


(d) Inspection of the previous working shows that the particular solution obtained in 
(a) is the sum of the zero-state response and zero-input response. 


Inhomogeneous term appears in the complementary function 


In some examples, terms which form part of the complementary function also appear 
in the inhomogeneous term. This gives rise to an additional complication. Consider 
Example 19.29. 


Consider the equation y” — y’ — 6y = e*”. It is straightforward to show that the comple- 
mentary function is 


y=Ae*+Be* 
Find a particular integral and deduce the general solution. 


Suppose we try to find a particular integral by using a trial solution of the form 
yp = a e**. Substitution into the I.h.s. of the inhomogeneous equation yields 


9a e** — 3a e** — 6a e** which simplifies to 0 


so that a e** is clearly not a solution of the inhomogeneous equation. The reason is that 
e** is part of the complementary function and so causes the 1.h.s. to vanish. To obtain a 
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particular integral in such a case, we carry out the procedure required when the auxiliary 
equation has equal roots. That is, we try yp = axe”. We find 


y =ae*B3x+1)  y’ =ae*(9x4+6) 
Substitution into the inhomogeneous equation yields 
ae (9x + 6) — we" (3x + 1) — 6axe* = e* 


Most terms cancel, leaving 


3x 
so that a = 5" Finally, the required particular integral is yp = _ The general solution 
3x 


is then y = Ae* + Be + = 


Engineering application 19.9 


Transmission lines 


A transmission line is an arrangement of electrical conductors for transporting 
electromagnetic waves. Although this definition could be applied to most electrical 
cables, it is usually restricted to cables used to transport high-frequency electromag- 
netic waves. There are several different types of transmission lines. The most familiar 
one is the coaxial cable which is used to carry the signal from a television aerial to 
a television set (see Figure 19.8). When a high-frequency wave is being carried by a 
cable several effects become important which can usually be neglected when dealing 
with a low-frequency wave. These are: 


(1) the capacitance, C, between the two conductors, 
(2) the series inductance, L, of the two conductors, 
(3) the leakage current through the insulation layer that separates the two conductors. 


The electrical parameters of a coaxial cable are evenly distributed along its length. 
This is true for transmission lines in general and so it is usual to specify per unit length 
values for the parameters. When constructing a mathematical model of a transmis- 
sion line it is easier to think in terms of lumped components spanning a distance 6z 
and then allow 6z to tend to zero (see Figure 19.9). The leakage between the two 
conductors is conventionally modelled by a conductance, G, as this simplifies the 
mathematics and avoids confusion with the line resistance, R. Note that C, G, L and 
R are per unit length values for the transmission line. 


Outer conductor 


Inner conductor 


Protective coating Figure 19.8 


Insulator A coaxial cable. 


19.6 Constant coefficient equations 577 


For many transmission lines of interest the signal that is being carried varies sinu- 
soidally with time. Therefore the voltage and current depend on both position along 
the line, z, and time, t. However, it is common to separate the time dependence from 
the voltage and current expressions in order to simplify the analysis. It must be re- 
membered that any voltage and current variation with position has superimposed 
upon it a sinusoidal variation with time. Therefore, ignoring the time-dependent ele- 
ment we write the voltage as v and the current as i, knowing that they are functions 
of z. 


Figure 19.9 
A section of a transmission line. 


Consider the circuit of Figure 19.9 which represents a section of the transmission 
line of length 5z. Applying Kirchhoff’s voltage law to the circuit yields 


v+dvu—vu+y, +p =0 
OU = =U, = Ue 


where v, is the voltage across the inductor and vp is the voltage across the resistor. 
Using the individual component laws for the inductor and resistor gives 


Sv = —ij@Ldz — iRbz 
= —i(R + joL)dz 


Note that di has been ignored because it is small compared to i. Now consider the par- 
allel combination of the capacitor and resistor (with units of conductance). Applying 
Kirchhoff’s current law to this combination yields 


8i=igtig 


where i, is the current through the capacitor and i, is the current through the resistor. 
Using the individual component laws for the capacitor and resistor gives 


di = —vjwCdz — vGdz 


= —u(G+joC)6z 
Dividing these two circuit equations by 5z yields 
dv ; : 
SS SS SHR 45 oll) 
bz 
bi ; 
= —v(G+joC) 


3z 
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In order to model a continuous transmission line with evenly distributed parameters, 
6z 1s allowed to tend to zero. In the limit the two circuit equations become 


d 
Rok) (19.19) 
dz 
di : 
Te (Gee) (19.20) 
dz 
Differentiating Equation (19.19) yields 
dv mors di 
patel yb 
dz? Porade 


di 
Substituting for = from Equation (19.20) yields 
Z 


dv 


= = (R+joL)(G + joCyu 
dz 
This is usually written as 
fog 
= =y?v where y? = (R+joL)(G + joC) (19.21) 
Zz 


This is the differential equation that describes the variation of the voltage, v, with 
position, z, along the transmission line. The general solution of this equation is easily 
shown to be 


vSae M4u,e” (19.22) 


where v, and v, are constants that depend on the initial conditions for the transmission 
line. It is useful to write y = a + jf thus separating the real and imaginary parts of 
y. Equation (19.22) can then be written as 


v=v,e “et + y, e% elf (19.23) 


The quantity v, ee /** represents the forward wave on the transmission line. It 
consists of a decaying exponential multiplied by a sinusoidal term. The decaying 
exponential represents a gradual attenuation of the wave caused by losses as it travels 
along the transmission line. The quantity v, e% e!# represents the backward wave 
produced by reflection. Reflection occurs if the transmission line is not matched 
with its load. As the wave is travelling in the opposite direction to the forward wave, 
e™ still represents an attenuation but in this case an attenuation as z decreases. 

A lossless line is one in which the attenuation is negligible. This case corresponds 
toa = 0, andso y = jf. If y = jf then y? = —f? so that, from Equation (19.21), 
(R + jwL)(G + jwC) must be real and negative. We see that this is the case when 
R = O and G = 0. This agrees with what would be expected in practice as it is the 
resistive and conductive terms that lead to energy dissipation. 


Engineering application 19.10 


Voltage reflection coefficient 


Consider a lossless transmission line of the type already described in Engineering 
application 19.9. We know in general that the forward wave at position z is given by 
v,e * e 4", For the lossless line this simplifies to v,e/**. The reverse term, by the 
same reasoning, is v,e**. 
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A definition used regularly when analysing transmission lines is the voltage 
reflection coefficient. Usually this is denoted by and can be defined either at a 
specific position on a line or as a function of distance 

vn, y, 


p= =— 


z= e2)Bz 
ia < 
ve Vv, 


It is a dimensionless quantity and is the ratio of the reverse to forward wave compo- 
nents. 

In many systems encountered in radio-frequency (RF) engineering it is desirable 
to minimize the relative amplitude of the backward wave component and hence the 
reflection coefficient. An example of this can be found in the transmit circuit of a 
mobile handset where a transmission line carrying the signal is attached to an antenna. 
During transmission, forward waves propagate towards the antenna terminals. At this 
point they are either radiated from or dissipated within the antenna, or they reflect 
back. If reflected back they may return to the amplifier circuit which generated them, 
and are wasted in the form of heat energy. As a consequence, battery life can be 
reduced due to wasted power. Hence the minimization of the reflection coefficient, 
usually by carefully designing the antenna, at the working frequency of the handset 
is an important activity. 

For the antenna it is desirable for o(z) to be as small as possible and for v, > v3. 
Note that in a system such as this |o(z)| < 1 and that o(z) is in general a complex 
number. 

We now consider a transmission line of length ¢ with characteristic impedance 
Z, terminated at z = ¢ with a load having impedance Z,. The setup is shown in 
Figure 19.10. 


Figure 19.10 
~ Lossless transmission line with 
termination. 


N 
Il -- 
Oo 
N 
Il -- 
fo) 


The total voltage at the load end of the line, v,, is the sum of the forward and 
backward wave components at z = £, that is 


UL =U; eRe Uy elt 


Using the definition of the reflection coefficient at z = @, 

p(t) = es e2iBe 

Di 
we can rewrite the voltage at the load as 
a ele ws ig o 

UL — V, e ‘jBe a. p(t) "1 Spe = ve jBe ab p(lyv, e jBe = v\e iBT] a p(t)] 
The current at the load, 7, , can be found from first principles by a similar analysis to 
the voltage, but will be stated here for simplicity 


1 ; 
i ma e JFf _ y, elf) 
0 
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This equation can also be written in terms of the reflection coefficient 
vp, ee 
= 1— pe 
i Z, [1 — p(é)] 
The voltage and current at the load are related by the simple equation 


Zeal 
Oe 
and hence 
ve P11 + p()] [1 + p(é)] 
ne v, e 8 - “or — p(é)] 
a) 
0 
It can be shown by further manipulation that 
= LE, 
p(t) = 7——> 
Zr, + Zo 


The reflection coefficient at the load is therefore dependent only on the characteristic 
impedance of the line, which is usually known, and the load impedance. Some RF 
measuring instruments such as network analysers can measure the amount of forward 
and backward waves. From this they are able to determine the load impedance. 


Engineering application 19.11 


Standing waves on transmission lines 


The differential equations that model voltage and current waves on transmission lines 
also describe the presence of standing waves. A standing wave is so called because 
it appears to remain stationary in space. Standing waves are an effect caused by the 
interference pattern created when two waves propagate in opposite directions in the 
same transmission medium. 

The previous example suggested that minimizing backward waves is often desired. 
In this case the standing wave component on the line is also minimized. In order to 
study this in more detail we wish to plot the voltage standing wave pattern for a given 
load impedance, Z, . 

Recall from Equation (19.23) (with a = 0 for a lossless line) that the voltage at 
any point z on the line is given by 


v(z) =, eRe U5 lB: 
This voltage is the sum of the forward and reflected waves. The modulus of this 
function can be plotted with z as the independent variable. Once the amplitude v, is 


known, then the amplitude v, can be determined using the reflection coefficient at 
the load 


p(t) = 2 eh 
1 


from which v, = v,p(£) e 7° so that 


v(z) =v, e + v, p(e) e 7! elf: 
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which simplifies to 
v(z) = v,e #1 + peje? | 


The modulus of v(z) is now plotted against z for several different values of the 
reflection coefficient at the load p(£). Consider a transmission line of total length 
£ = 10 m. Another quantity we need to know in order to plot the voltage on the line 
is 6, which is the phase constant. For a transmission line of given construction and 
at a particular frequency, # is constant and represents the phase change per metre of 
transmission line. Here we take 6 = 7/2 rad m7!. 


Case 1: p(£) = -1 


Consider the case when the transmission line is terminated in a short circuit. The 
reflection coefficient at the load is 
Z, —Z, 0-Z 
pf) = 22 = 2 = 
J =p ify Wap Ze 
Taking the amplitude v, = 1, which represents a 1 V peak sine wave, and plotting 
the modulus of u(z), gives the result shown in Figure 19.11. 


= 
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Figure 19.11 


Voltage standing wave pattern for a short-circuit load with a 1 V input wave. 


Notice that the voltage peak on the line is 2 V whereas the voltage input was only 
1 V. This is due to the forward-going | V input wave reaching the end of the line and 
reflecting back upon itself. At some values of z it constructively interferes with itself 
giving double the input; at others it destructively interferes giving zero volts. 


Case 2: p(f)=1 


A similar effect is seen for an open-circuit load. Here by considering Z, — oo, p(¢) 
can be shown to equal +1, giving rise to the standing wave pattern shown in 
Figure 19.12. Note here that the voltage maximum is at the load, unlike the case 
of the short circuit where the minimum was at the load. 


lu(z)| A 
a 


Figure 19.12 
Voltage standing wave pattern for an open-circuit load with a 1 V input wave. 
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Case 3: p(f) = 0 
Consider now the case in which the load impedance is equal to the characteristic 
impedance, that is 

Gy, = Xe 
This time, p(€) = 0 and hence v(z) = v, e !**. The magnitude of this complex 
exponential is shown plotted in Figure 19.13 and is a horizontal straight line with 


|v(z)| = 1 for all values of z. 
lv(z)| A 
2) [_ 
1 
| ! | ! —— 
0 2 4 6 8 IO 2 
Figure 19.13 


Voltage on a transmission line with a reflection coefficient of p(€) = 0. 


No standing wave component is present and the line is said to be matched. This 
is the ideal case for power transfer because it represents the case in which all of the 
input wave is delivered to the load and nothing is reflected back. 


Case 4: a general case 


Often it is difficult to arrange for a perfectly matched transmission line. More gener- 
ally the peak voltage on the line appears like the one shown in Figure 19.14. Here an 
assumed reflection coefficient of o(£) = 0.187 — j 0.015 is used. 


lv(z)| 4 
aL 
ee ON Ee 
0 2 4 6 8 10 = 
Figure 19.14 


Voltage on a transmission line with a reflection coefficient of p(¢) = 0.187 — j 0.015. 


The standing wave component is still present although it is of smaller magnitude 
than seen for the open- or short-circuit loads. Most of the input wave is transferred 
to the load but a proportion is reflected back and interferes. 

The standing wave pattern of a transmission line can be measured by a special 
piece of apparatus known as a slotted line. The device consists of a length of trans- 
mission line with a slot into which a small probe is inserted. The probe is used to find 
the voltage as a function of distance as it is moved along the line towards an unknown 
load. The equations presented above can be used with the results of the slotted line 
experiment to determine the reflection coefficient of the load. 


EXERCISES 19.6.3 


1. Find the general solution of the following equations: 


(a) d2x 5k Geel 
a) — -2— -3x= 
dt? dt 
@y dy 
2 45 fdas 
(b) G2 ag 
@y dy 
Ee dS a iy Oe 
(c) dt? dt #ey 
(d) dfx 11% 4.30% = 8 
dr2 dt = 
a d 
(e) a +24, +3y = 2sindx 
dy dy 
(f) qe a te et 
da 
(@) 55 ty =4e 
dx 


= 6t 
(h) ae —16x=9e 
2. Find a particular integral for the equation 


La ae ee 
i - a 
3 Find a particular integral for the equation 
d2 
gs x=4e 7 
dt? 
4 Obtain the general solution of 
y’ —y —2y =6. 
5 Obtain the general solution of the equation 


CF 5g) apy aang 
— = — COS 2X 
Ge i = 


Solutions 


1 (a) x=Ae*+Be* —2 
(b) y=Ae*+Be%42 
t BS 
—A —2t41 B ae ee 
(c) y e e 3 18 
(d) x =Ae~™ + Be~™ + 0.2671 — 0.0978 


8 
(e) y=e [Asin 2x + Bcos 2x] — 77 %°8 Di 


2. 
— sin 2x 
17 
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Find the particular solution satisfying 
(0) =1,2@ =0 
yO) = 1, 5 (0) =0. 


Find a particular integral for the equation 


Find the general solution of 


(a) = sp a oa ='3 
dt2 dt 

(b) dix OF geese 
dr2 dt 


For the circuit shown in Figure 19.15 show that 


IfL = 1mH,R = 10 Q,C = 1 pFand 
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E(t) = 2:sin 1007tt, find the complementary function. 


Find the general solution of 


Be aps ag 3t — 16 sin 3¢ 
a 7 i = 48 cos sin 


E(t) 
Figure 19.15 


(f) y=e-°(A cos 0.866t + B sin 0.8661) — 
0.438 cos 3t + 0.164 sin 3t 


(g) y=Acos3x+ Bsin 3x + 0.0548 e8* 


9 
h -A At Be 6t 
(h) x ev + Be +55 © 


x=25e% 
4 

gee e 2! 
3 
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4 


5 


6 


Ae* + Be* —3 


6 1 
Ae +Be*4+ 5 sin 2x — . cos 2x, 


3 
et 4 
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3 
T @) Aci+Be% += 


1 
(b) Ae’ + Bre + st ef 


+ — sin 2x 
2 2 


x 
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Example 19.30 


Solution 


1 
5 cos 2x BY Ac —!1270t 4B e—88730r 


14sin 3t + 18 cos 3¢ 
13 


9 e “(Asin 3t + Bcos3rt) 4 


SERIES SOLUTION OF DIFFERENTIAL EQUATIONS 


We now introduce a technique for finding solutions of differential equations that can be 
lo. @) 
expressed in the form of a power series y = > A,X" = a) +a,x+ a,x +.... Here, 


m=0 
An, m =0,1,2,3... are constants whose values need to be found. 


With the given power series form of y, we can differentiate term by term to obtain 
2 


power series for ;. and a By substituting these series into certain classes of differ- 


ential equation we can determine the constants a,,, form = 0, 1, 2,3... and thus obtain 
the power series solution for y. This technique of representing a solution in the form of 
a power series paves the way for the introduction of an important family of differential 
equations, known as Bessel’s equations, which can be solved by this method. The power 
series solutions so formed are known as Bessel functions which will be introduced in 
Section 19.8. 


Consider the following example: 


2 
The general solution of the differential equation 2 + y = 0 was shown in Example 


19.16 to be 
y =Acosx+ Bsinx 
where A and B are arbitrary constants. Obtain this result by looking for a solution of the 


equation in the form of a power series y = > ax". 


m=0 
We let 


lo,e) 
_ m __ 2 3 4 | 
y= AyX = Ay FAX + 4,X + 4,X + 44x + 45x +... 


m=0 


from which, by differentiating with respect to x, 


d lo. e) 
- = a ma,,x"| =a, +2a,x+ cri + 4a,x° + san" acy 
m=0 
and 
dy ES 9: 2 3 
a= Sm = 1)ma,,x"-? = 2a, + (2)(3)agx + (3) (A)agx? + (4) (S)a5x° +... 


m=0 
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2 
These expressions for y and a are substituted into the given differential equation 
d’y 
a2 +y = 0. Thus 
(2a, + (2)(3)a,x + (3) (4)ayx” + (4) (S)asx? +...) + 
(dy + a,x + a,x +4,0 + a4x4 +a +...) =0. 


We now use the technique of equating coefficients. This was first introduced in 
Section 1.7 Partial fractions. Equating the constant terms: 


2a,+d)=0 sothat a,= 540 


Equating coefficients of x: 


1 
(2)(3)a, +a,=0  sothat a, = ————_a 
3 1 3 (2)(3) 1 
Equating coefficients of x: 
1 1 


(3) (4)a, +f a, = 0 so that a,= 


= AQ 
(3)(4) (2)(3) (4) 
Equating coefficients of x*: 
1 1 
a= A 
(4) (5) (2)(3)(4)(S) 
Note that by using factorial notation we can express these coefficients concisely as: 
1 1 1 1 


— a, = —-—a a,= —a a. = —a,. 
a Cn | iw. 


By equating higher order terms in the same way further coefficients a,, can be determined 
CO 


(4) (5)as +4, = 0 so that a, = 


a, = — 


and the power series y = .S a,x" can then be written out explicitly. Observe that all 


m=0 
coefficients a,, are multiples of ag when m is even, and multiples of a; when m is odd. 


m 


We can therefore collect terms together as follows: 


oe) 
y= oe AiyX 


m=0 
= 2 3 A 5 
= Uy FAX + 4,X +.43,X +44xX" + 45x +... 
1 


1 2 go db 1 5 
= da) + a,x 5) “0% aici + 740%" Tea + 
1 1 1 1 , 
=a — Tl real —...) +a, (x- aoe sea —...) (19.24) 


Observe that the series in the brackets in eae 19.24 are the power series expansions 
of cos x and sin x as discussed in Section 18.6. Hence we can write the solution as 


y = ay COS x + a, SInx. 


We see that this is simply the general solution of the differential equation as given 
in the question but where the coefficients a) and a, are the arbitrary constants. Had 
we not recognised the series expansions for the sine and cosine functions we would 
still have obtained the general solution as expressed in the form of the power series in 
Equation 19.24. 
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This method of solution is widely applicable and involves the following steps: 


e given a differential equation, we seek a solution in the form of the power series 


CO 
y= aya 


m=0 
e this power series is substituted into the differential equation 


e the coefficients a,, are then found by the technique of equating coefficients. 


Once the values of a,, are found, the solution follows in the form of a power series. 
Some differential equations and their solutions have such importance in engineering 
and scientific applications that the series obtained are used to introduce new functions 
in much the same way as the sine and cosine functions appeared in Equation 19.24. 
We shall see in the following section how an important class of functions known as 
Bessel functions arises in this way. There is of course the question of whether any power 
series expansion converges but such discussion is beyond the scope of the introductory 
treatment given here. 


2 
1 The general solution of the equation cat —y=O0is is an arbitrary constant. Obtain the same result by 
; dx seeking a power series solution of the differential 
y = Ae* + Be™ where A and B are arbitrary constants. oo 
Obtain this result by looking for a solution of the equation in the form y = > A,X". 
Sa m=0 
equation in the form of a power series y = a Ay) X". ; ; 
=o 4 Obtain the first four non-zero terms in the power 
series solution of the initial value problem 

2. The general solution of the equation dy 

— = an + 2y = Ois y = Ae* + Be where A and 

dx dx 5 Obtain the power series solution of the equation 

B are arbitrary constants. Obtain this result by looking dy 

for a solution of the equation in the form of a power a2 + xy = 0, up to terms involving x’. This 

Co 
scncea— y gait differential equation is known as an Airy equation. 
Y m 
m=0 
3 Verify using the method of separation of variables that 
d 2 
the general solution of 7 = 2xy is y = Ke~ where K 
Solutions 
2 A 6 
ae 5 3 6 
Doin ef er (eee ac 
2" 8 48 : co( 6 180°" 
ar = ae = 
ce ee ee” anak 
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Example 19.31 


Solution 
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BESSEL’S EQUATION AND BESSEL FUNCTIONS 


A linear differential equation that has important applications in the theory of wave prop- 
agation, particularly when using cylindrical or spherical polar coordinates, is known as 
Bessel’s equation. In fact the term Bessel’s equation represents a whole family of equa- 
tions having the form 
2 
oS $22 + (7 —v’)y =0. 

Here x and y are the independent variable and dependent variable respectively. The third 
quantity v, termed a parameter, is known as the order of the equation.' As we vary the 
parameter v we obtain the family of differential equations. Note that v can be any real 
number. In Sections 19.8.1 and 19.8.2 we will solve Bessel’s equation when v = 0 and 
v = |. In the engineering applications which follow later in the chapter you will see that 
v may be other integers including those that are negative. Observe that the equation is 


linear and homogeneous, but it does not have constant coefficients: the coefficient of — 


is x’, a function of the independent variable. In what follows we show that solutions of 
Bessel’s equation can be found in the form of infinite power series which we call Bessel 
functions. 

Bessel functions have many applications in physics and engineering. In electrical 
engineering they are commonly used in the field of communications. They are important 
in finding the bandwidth and frequency content of some radio signals. They are also 
essential in the analysis of propagating modes in cylindrical waveguides and cylindrical 
cavity resonators, both of which are important devices used in high frequency microwave 
engineering (see Engineering application 19.12). Bessel functions are also utilised in the 
design of filters. 


Bessel’s equation of order zero 


To introduce the solution of Bessel’s equation and thereby introduce a Bessel function 
the following example deals with the special case when v = 0. 


foe) 
By seeking a power series solution in the form Ss a,,X" obtain the first four non-zero 


m=0 
terms in a power series solution of Bessel’s equation in the special case when v = 0. 


When v = 0 Bessel’s equation is: 


dy dy» 
ae + rr +xy=0. 
which for non-zero x becomes 
dy dy 


'Note that in this context the word ‘order’ means something different from the order of a differ- 
ential equation, which was defined in 19.2.1. 
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As was done in Section 19.7 we seek a power series solution in the form 
ioe) 
y= as =d),+a,x+ a,x” + Cae + ax + ie = eee 
m=0 
The strategy is to substitute y, and its first and second derivatives into Equation 19.25 


and choose values a,, so that the equation is satisfied. y and its first two derivatives can 
be written: 


y= Ay + a,x + a,x +a,x +a," + asx ee 
dy 


ay = a, + 2a,x+ a,x" + 4a,x° + Sax" +... 
dy 2 3 
qe = 2a, + 6a,x + 12a,x° + 20a5x° +... 
ad? d 
Substitution into s + +. + xy = 0 gives 


(2a,x + 6a3x" + 12a,x° + 20a,x* 2+ (a, + 2a,x + 5a, + 4a,x? i : 


+ (dox + ei + a,x + a,x" + ar aw) =O 


We now equate coefficients on both sides of this equation starting with the constant 
term: 


a, =0. 
Equating coefficients of x: 
4a, + dy = 0 sothat a,= ~7A 
Equating coefficients of x: 
9a, +a, =0 sothat a,;= 641 =0 since a, = 0. 


Equating coefficients of x*: 


1 1 1 1 
16a, +a, =0 sothat a,= 6” = ( a) ( i) dy = ate 


It is easy to verify that all subsequent terms a 


1 
ify, in a similar way, that ag = — 7304 00° When m is even, all a,, are multiples of a). We 


can now write down a power series that satisfies Bessel’s differential 
Equation 19.25: 


when mm is odd, are zero. You should ver- 


m? 


ee er ee ee (19.26) 
y= ax + a — a3047% T° . 


We have succeeded in obtaining a power series solution of Bessel’s Equation 19.25 
where a, is an arbitrary constant. 


As we have already noted, a, is an arbitrary constant. However, historically its value 
is chosen to be | in Equation 19.26. The resulting power series is used to define a 


19.8.2 
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Jo (x) 1 


\ \ J 20\J 307 
Figure 19.16 


The Bessel function of the first kind of order 
0.4 zero, Jo (x). 


function called a Bessel function of the first kind of order zero that is convention- 
ally denoted J, (x). It is possible to show that it can be expressed concisely using sigma 
notation as 


a —~ |)" 2m 
h@=>- ae (19.27) 
m=0 


You should verify this by writing out the first few terms of this infinite series. Note that 
this method of solving the differential equation produces a power series solution and is 
used to introduce a new function, J, (x). It is not practical to try to draw a graph of J, (x) 
by hand but this can be obtained readily using a technical computing language such as 
MATLAB®. Figure 19.16 shows a graph obtained in this way. Observe that J, has some 
similar characteristics to a cosine wave: its value when x = 0, J)(0), equals 1, and the 
function oscillates, albeit with decreasing amplitude. The values of x where the function 
crosses the horizonal axis are known as zeros of the Bessel function and these values 
arise in important applications as we shall see in Engineering application 19.12: The 
modes of a cylindrical microwave cavity. 

When a specific value of a Bessel function is required it is usual practice to look this 
up rather than work directly with the power series definition. Both printed tables and 
computer packages are available for this purpose. 

Recall from Section 19.5 that the general solution of a second order linear equa- 
tion requires two independent solutions. The function Jj(x) is one such solution. An- 
other independent solution is called a Bessel function of the second kind of order 
zero and is denoted by Y)(x). Derivation of this function is beyond the scope of this 
book but, like J,(x), values of Y)(x) can be obtained from tables and by using com- 
puter software. The general solution of Bessel’s equation of order zero can then be 
written 


y = AJo(x) + BY) (x) 


where A and B are arbitrary constants. 


Bessel’s equation of order one 


Here we adopt the previous approach to obtain a solution of Bessel’s equation of order 
one, and thereby introduce the Bessel function of the first kind of order one. 
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[oe 
Example 19.32 By seeking a power series solution in the form > a,x" obtain the first three non-zero 


Solution 


m 
m=0 
terms in the solution of Bessel’s equation in the case when v = 1: 


@y d 
oe + x= +02 —1)y=0. (19.28) 


As before we seek a power series solution in the form 


lo,<) 
y= 2 A,X" = Ay ta,X+ 0X +440 +a, +a 5x... 
m=0 
The strategy is to substitute y, and its first and second derivatives into Equation 19.28 
and choose values a,, so that the equation is satisfied. y and its first two derivatives can 
be written: 


2 3 A 5 
YHA +t aX+ A,X + A,X + 44x + 45x... 


d 

; =a, + 2a,x + Sa5x° + 4a,x° + Sau a 
dy 2 3 
a2 = 2a, + 6a,x + 12a,x° + 20a5x°... 


2 


ea rene 7 dy dy 7 es 
ubstitution into x de + rms + @ — l)y = 0 gives 


2a" + 6a3x° + 12a,x* + 20a,x° otae (a,x + 2a3a" + 3a,x° + 4a,x" + Saxe 3 .) 
+ (x —1)(@) t+ayx+ aise + a,x + a,x" + ix ...) =0 
Removing the brackets 
2,0" + 6a3x° + 12a,x* + 20a;x° ee (a,x + Jak + 3a,x° + 4a,x* + Saxe pe .) 
+a x" + ae = a,x" “Ir a,x lr a,x° =r tik si 


(dy tayx+ yx” + a,x + a,x" + as ...) =0 


We now equate coefficients on both sides of this equation starting with the constant 
term: 


—d) = 0 sothat a) =0. 
Equating coefficients of x: 
a,—a,=0 so that a, is arbitrary. 
Equating coefficients of x7: 
2d, +2da,+d)—d,=0 sothat 3a, = —a,, from which a, = 0. 


Equating coefficients of x*: 


1 
6a; + 3a, +a, —a,=0 sothat 8a, = —a,, from which a, = — ga: 


It is easy to verify that all subsequent terms a,, when m is even, are zero. Equating 
coefficients of x°: 


20a; + 5a; + a; — a; = 0 sothat 24a; = —a, 
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from which 


1 1 1 


~ 74% = joo"! using a, >= ogo 


Note that all non-zero coefficients are multiples of a,. We can now write down the first 
three non-zero terms in the power series solution: 


a; = 


= appa (19.29) 
a ea we : 
Here a, is an arbitrary constant. We have succeeded in obtaining a power series solution 
of Bessel’s Equation 19.28. 


Although a, is an arbitrary constant, historically its value is chosen to equal . in 
Equation 19.29 and the resulting power series is used to define a function called a Bessel 
function of the first kind of order one that is conventionally denoted J, (x). It is possi- 
ble to show that it can be expressed concisely using sigma notation as 


i 3 Spe (19.30) 
re £4 22+ Nm + VY 
m=0 


You should verify this by writing out the first few terms of this infinite series. Fig- 
ure 19.17 shows a graph of J, (x) obtained using computer software. Observe that the 
graph is similar to that of the sine function but has decreasing amplitude. Here, 
J,(0) =0. 


JX) 


0.5 


Figure 19.17 
The Bessel function of the first kind of 
order 1, J; (x). 


A second independent solution is called a Bessel function of the second kind of 
order one and is denoted by Y, (x). Derivation of this function is beyond the scope of this 
book but, like J, (x), values of Y; (x) can be obtained from tables and by using computer 
software. The general solution of Bessel’s equation of order one can then be written 


y =AJ, (x) + BY, (x) 


where A and B are arbitrary constants. 
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EXERCISES 19.8.2 


1 Write out explicitly the first five terms in the series 
expansion of J, (x) and the first four terms in the 
expansion of J, (x). By differentiating Jo (x) verify 


(i) By equating coefficients of x” show that r 
satisfies the indicial equation ;7 — 1 = 0 and 
hence deduce r = +1. 


that 1, (x)) = —J, (x). (ii) Continue the solution for the case r = | to 


2 (Method of Frobenius) Substitute a power series in 


[o.¢) 
the more general form y = > AX 


m=0 
be determined, into Bessel’s equation of order 1: 


x ay + Pl 
dx? dx 


19.8.3 


obtain a series solution of the differential 
equation. Find the first 3 non-zero terms. 


m+T where r is to (iii) Confirm that the solution is equivalent to that 
obtained previously in Equation 19.29. 


+07? —Dy=0. 


Bessel’s equation and Bessel functions of higher order 


To cope with more general cases, including those of a non-integer order, the usual prac- 
lo.e) 


m 


tice is to seek solutions in the form ) a,x"*" where r is an unknown real number 


m=0 
that is chosen during the course of the calculation in order to generate a solution (see 


Exercise 19.8.2, question 2). For details on how to proceed in such cases you should 
consult an advanced text on differential equations. 

When the order, v, is a positive integer, it is conventional to label it as n instead. A 
solution of Bessel’s equation of positive integer order n is a Bessel function of the first 
kind of order n, denoted J, (x), and can be shown to be 


eS (- 1 yin 2nrtn 


It) =) area forn =0,1,2,... 


m=0 


Graphs of these functions for n = 0, 1, 2, 3, 4 are shown in Figure 19.18. 

Whilst to this point we have only discussed cases where the order is non-negative, 
in fact Bessel functions are defined for any order v, —oo < v < oo. Later, in Engi- 
neering application 19.13 we shall make use of Bessel functions with negative integer 
order. In such cases, careful inspection of the above definition shows that it will fail. 
When 7 is negative, then m + n will be negative for some values of m and in these 
case (m + n)! is not defined. For a complete treatment, and one which is beyond the 
scope of this book, it is necessary to define a new function - the gamma function I(x) 
- which can be thought of as a generalisation of factorials to deal with negative argu- 
ments. Nevertheless, this is possible and it can be shown that Bessel functions with 
negative integer order are related to their counterparts with positive integer order by the 
result 


J_,(%) = (-1)"J,@). 


(See Exercise 19.8.3) 
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Figure 19.18 
Bessel functions J, (x), forn =0...4. 


Engineering application 19.12 


Modes of a cylindrical microwave cavity 


Circular cross-section waveguides and cylindrical microwave cavities are both com- 
ponents that are found in high power microwave systems such as radar transmitters. 
Bessel functions are used in the mathematical description of electromagnetic wave 
behaviour in such devices. By way of example, consider the microwave cavity de- 
picted in Figure 19.19. 


Figure 19.19 
A cylindrical microwave cavity. 
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The walls of microwave cavities are usually made of metals with good conductiv- 
ity such as copper, steel or aluminium but in some applications the internal surfaces 
are plated with precious metals such as gold to prevent oxidation which would reduce 
the surface conductivity. When used at high power levels the conductivity is impor- 
tant. Low conductivity can lead to loss of power and heating of the cavity due to the 
currents flowing on the internal walls. The space inside the cavity can be empty or 
it can contain a dielectric insulating material such as PTFE. The analysis here will 
consider an empty cavity. 

The purpose of the device is to act as a narrow band filter which selects a spe- 
cific frequency. In fact, there is a set of frequencies at which the microwave cavity 
will work, known as the resonant frequencies. The electromagnetic waves inside 
the cavity are governed by the propagation modes within the cylinder. These modes 
are described mathematically by Bessel functions. It can be shown that the resonant 
frequencies, f,,,,,,, associated with the transverse magnetic modes of the cavity are 
given by 


oom) HE 


where m, n and p are mode index numbers, 0, 1, 2, ... and X,,,, denotes the n-th zero 
of the Bessel function of the first kind of order m, J, (x), where c is the speed of wave 
propagation in the medium. 

The zeros can be observed in Figure 19.18 as the points where the graph crosses 
the x axis. Usually software such as MATLAB® has the capability to calculate numer- 
ically the zeros of a function. If we observe that the first zero of Jy (x) is approximately 
x = 2.5 (see Figure 19.18) we can type the following at the MATLAB® command 


prompt: 


X = fzero(@(x)besselj(0,x) ,2.5) 


which gives the result X = 2.4048. 
The second zero of Jj (x) is close to 5.5, thus we type, 


X = fzero(@(x)besselj(0,x) ,5.5) 


which gives the result X = 5.5201. 
The first zero of J, (x) is approximately 4, so, 


X = fzero(@(x)besselj(1,x) ,4) 


which gives the result X = 3.8317. 

Table 19.2 shows a summary of the results of applying this method for m = 0, 1, 
and 2. 

Consider a microwave cavity resonator made from an aluminium fizzy drinks can 
with dimensions r = 33 mm and & = 120 mm. The transverse magnetic modes are 
labelled TM,,,,,, and the first commonly used mode is TM, ,. The resonant frequency 
would be 


1 


tory treatment. 


Table 19.2 
Bessel function zeros for a microwave 
cavity. 
m Xml Xnd m3 
0 2.4048 5.5201 8.6537 
1 3.8317 7.0156 10.173 
2 5.1356 8.4172 11.620 
3108 17240487 (xa) 
— + = 2 10G 
ou Qn /( 0.033 ) (Ss) 
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Hz 


A different set of modes known as TE modes may also be calculated using Bessel 
functions. Rather than using zeros of the Bessel functions directly it is necessary to 
use the derivatives of the Bessel functions, a topic beyond the scope of this introduc- 
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EXERCISES 19.8.3 


We have seen that for positive integer order the Bessel 
functions of the first kind are defined by 


J,@) = 


[oe] 


» 


m=0 


for 


(-1)"24" 
22m+nm\(m +n)! ? 
Nj] OV Qy ic 


(a) Now suppose that we are interested in Bessel 
functions with negative integer order. Replace n by 
—n in this definition to show that 


1) Denia 


oo (—1)™x2—n 


m=0 


withn = 1,2,... 


(b) Now make the assumption that (m — n)!, which 
appears in the denominator of each term, becomes 


infinite when m = 0, 


to prove this using properties of the Gamma 


1,2,...,2— 1. It is possible 


function but is beyond the scope of this book. With 
this assumption show that 


[oe] 


J_,() = by 22m—nm\(m —n)! i 


m=n 


(-1 )iiy2m—n 


withn = 1,2,... 


m = s +n, show that 


= 1)°x 2s-+n 
_ n(x) = (-1)" > al 
for n=0,1,2,... 


19.8.4 Bessel functions and the generating function 


and hence deduce J_,, (x) = (—1)"J,, (x). 


(c) By introducing a new dummy variable, s, where 


Several important properties of Bessel functions used in physics and engineering appli- 
cations can be deduced from knowledge of the so-called generating function. In this 
section we introduce this generating function and use it to derive the Jacobi-Anger iden- 
tities which we then employ in Engineering application 19.13. 

Firstly, recall the power series expansion of e* introduced in Section 6.5. 


falte+S “a 


ee. et 


3) 


a eae 


fore) 
Sw 
n!} 
n=0 


(19.31) 
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Recall also that there is nothing special about labelling the dummy variable n. Any other 
letter could have been used and the statement of the expansion would be equivalent. 


t 
Now replace x firstly by e and then by a in Equation 19.31 to produce the fol- 
lowing expansions where we have used r and s as the dummy variables: 


and 


Multiplying these two expansions together we find 


ts x 1 pxtyr7 Sl x\s 
e?e x7 = ei-7) => ( ) ( ) 
2 r! \2 se s! 2t 


r=0 s=0 


or, more explicitly, 


sa-b | 4 22 ee x44 
er = 1+ a9 + aia + 3193 T aga T°] * 


Xx x x x4 
1 | + iss 
1)2t © 212272 312373 © 412474 


(19.32) 


Now imagine multiplying out the brackets on the right and observe that the resulting 
expression can be considered as the sum or difference of terms involving powers of f, 
that is, t, 1?,...2"... and t7!,t-*,...17",.... In fact we can express Equation 19.32 
concisely as 


e-D = Sy ar (19.33) 


n=—OoOoO 


where each y, (x) is a function of x that we can regard as the coefficient of the term fr” 
in the expansion. For example, let us focus first on the those terms in Equation 19.32 
which when multiplied out result in multiples of ¢°, as will happen by forming products 
such as 


Aye xt x x27 x ee er 4 
tre) wie 2122} \ 210272 J? \ 3103 joey 


Then together, these terms will form y,(x)r° in Equation 19.33, that is 


x 5 a x6 


0) =1— Goa + Gna? Grae t 


or more concisely, 


oo —])" 2n 
yo) = >> ane (19.34) 
n=0 


Comparing Equation 19.34 with Equation 19.27 we see that y,(x) is simply the Bessel 
function of order zero, Jy (x). 
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In a similar manner, suppose we focus attention on the terms in Equation 19.32 that 
when multiplied out result in t! as will happen when we form products such as 


( xt ) (1) rr ( Xx ) xP x? P 
—— and so on. 
1!2 : 2122 1!2¢/’ 3123 212772 


Then, together these will form y, (x)t! in Equation 19.33, so that 


xX xe x” 


1!2 = -2!23 + 312125 °° 


y, (x) = 


or more concisely, 


oo (—1) "241 


y@)= >, (m+ Dimi (19.35) 


m=0 


Comparing Equation 19.35 with 19.30 we see that y, (x) is simply the Bessel function 
of order one, J, (x). 

In the same way, it is possible to show that all the other coefficients y, (x) in Equation 
19.33 are the Bessel functions J, (x). The function e}"-1) in Equation 19.33 is called 
a generating function for the Bessel functions because the coefficient of t” in the ex- 
pansion is the Bessel function J, (x). Thus the expansion is used to generate the Bessel 
functions. We have the following important result: 


Generating function for the Bessel functions: 
e2@-7) = OE (19.36) 


If the function on the left is expanded in powers of ¢ then the coefficient of t” is the 
Bessel function of order n. 


EXERCISES 19.8.4 


function, that is J, (—x) = J, (x). Ifn is an odd 
integer, show that J, (x) is an odd function, that is 
J,(—x) = —J,, (x). 


1. Use the generating function to show that if n is an 
even integer, then the Bessel function J, (x) is an even 


19.8.5 The Jacobi-Anger identities 


Two results which relate to Bessel functions and which are important in communica- 
tions engineering are known as the Jacobi-Anger identities. We first state them before 
deriving them from the generating function. 
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Jacobi-Anger identities 


eixcosé me Se J, (xj? 


n=—OO 


eivsin® = y J, (en? 


n=—OO 


To obtain the first identity we let t = je’, so that r” = j”e”’, in Equation 19.36. 
Noting that 


1 1 : : 
t— : = je’ ~ je = jel’ +je" = 2jcosé 
el 4 ei? 
(where we have used the Euler relation cos @ = = 7 Section 9.7, p 339) then the 


left hand side of Equation 19.36 is 


ei l-7) = @2 (Zicos 4) = elt cosd 
and so from Equation 19.36 we have 


eixcosé = pS J oje”’. 


In a similar fashion, we let t = e!’, so that ¢” = e!”®, in Equation 19.36. Noting that 
1 1 


t——=e? —— =? —¢? = 2jsing 
t el? 
i@ _ e-i0 
(where we have used the Euler relation sind = a Section 9.7 p 339) then the 
J 
left hand side of Equation 19.36 is 
e271) — @22isind) _. gjxsina 


and so from Equation 19.36 we have 


elvsine = 3 J, (ae. 


n=—O0O 


Engineering application 19.13 


Use of Bessel functions in frequency modulation 


A common type of modulation scheme used in electrical engineering is frequency 
modulation (FM). The purpose of modulation is to prepare a signal for transmission, 
such as when a signal is to be sent from a transmitter to a remote receiver during a 
radio broadcast. At the receiver the signal is recovered from its modulated form by a 
device called a demodulator. 

Even though the process of electronically producing an FM signal is fairly 
straightforward, the mathematics underpinning frequency modulation can become 
complicated and requires the use of Bessel functions and a Jacobi-Anger identity. In 
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order to explain how FM works it is helpful to start with the electronic process by 
which the FM signal is generated and to then derive the properties of the modulated 
signal mathematically. 

The FM signal is usually produced by a device called a voltage controlled os- 
cillator (VCO). The VCO generates a sinusoidal signal on its output, the frequency 
of which is controlled by a voltage on its signal input. If we were to connect a d.c. 
voltage to the input then we would observe a simple sine wave on the output. The 
frequency of the sine wave could be changed by changing the d.c. voltage. In an ana- 
logue radio broadcast system however we wish to transmit a time-varying signal and 
this, rather than a d.c. voltage, is used as input to the VCO. 

Consider the case of a sinusoidal modulating signal, sin(27tf,,), that we wish 
to transmit such as that shown in Figure 19.20. Here f,, is the frequency of this mod- 
ulating signal. Another sinusoidal signal, termed the carrier, is used to carry the 
modulated signal. The frequency of the carrier is denoted f,, and is the frequency you 
would tune to on a radio dial to receive the radio station. Let us suppose, for example, 
that the carrier is A cos(27tf.t) as illustrated in Figure 19.21. 


VCO input voltage 
1 
Time, t 
-l 
Figure 19.20 
A sinusoidal modulating signal: the signal we wish to transmit. 
Carrier signal 
A 
a 
Time, t 
-A 
Figure 19.21 


A sinusoidal carrier signal. 


It is useful to define a term, 6, which is known as the modulation index: 


eeu 
ae 


where Af is the maximum change in the frequency of the output from the VCO when 
the modulating signal is applied. In other words the largest instantaneous frequency 
of the carrier is f, + Af and the smallest is f. — Af. It may seem intuitive that the 
frequencies within the output signal would be between f. + Af and f. — Af, but the 
situation is actually more complex. The changing signal produces a different set of 
output frequencies. Consider the following analysis. 

We can describe this modulation process mathematically as 


u(t) = Acos [2nf,t + B sin(27f,,t)] 


B 
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and this output waveform is shown in Figure 19.22. 


VCO output v(t) 
A 
Time,t Figure 19.22 
=a The output signal. 


We may rewrite this equation using phasor notation as: 


v= Re [Ae aie ce] 
—Re [ Ae/27f.te Jb sin(2rtf,) | 
where Re means the real part of what follows. Notice that the second exponential 
term is in the same form as the Jacobi-Anger identity, e**"° = SS J, (x)e!”’, and 
00 


can be expanded out in terms of Bessel functions of the first kind. Thus, 


v(t) = Re [scam Ss sapien 


n=—0Co 


This can be rearranged by combining the exponentials: 


v(t) = Re c > ve inerrseri| 


n=—0o 


v(t) = Re c ~ ip iHernton| 


n=—OO 


Converting back from the phasor notation, and for simplicity taking A = 1, we obtain: 


v(t) = Jo(B) cos(27f.t) 
+ J_,(B) cos[21(f, — fin)t] + J, (B) cos[2a(f. + fin )t] 
+ J_,(B) cos[27(f, — 2f,,,)t] + J,(B) cos[27(f, + 2f,,)#] 
+ J_,(B) cos[2n(f, — 3f,,)t] + J,(B) cos[2n(f, + 3f,,)t] 
=F woe 


We saw in Section 19.8.3 that for integer values of n, J_, (x) = (—1)"J,,(x), hence, 


v(t) = Jo(B) cos(27f.t) 
— J,(B) cos[2n(f, — fit] + J, (B) cos[2n(f, + fin dt 
+ J,(B) cos[2n(f, — 2f,,)t] + J,(B) cos[2n(f, + 2f,,)t] 
— J;(B) cos[27(f.. — 3f,, )t] + J3(B) cos[27(f, + 3f,,)t] 
+... 


We can see that the output from the VCO is actually made up from a set of sinusoidal 
waveforms with frequencies f,—nf,,,..-, f-—3 fin» F-—2Sin Fo Snr Fos Sot Fons fot 
2h» fo + 3hn.--->f. + nf, This is known as the frequency content of the signal 
and will be discussed in more detail in Chapters 23 & 24. The equation suggests that 
a broadcast station with a carrier at f, would have an infinite number of other fre- 
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quencies, known as side bands associated with it. The amplitudes of the side bands 
are given by the values of the Bessel functions. When these are evaluated, the val- 
ues of J, (8) generally diminish with increasing n hence only the sidebands near to 
the carrier frequency have significant amplitude. Therefore frequencies that are a 
long way away from f, can be safely ignored when designing the communications 
system. 

Tabulated values of Bessel functions are available similar to those shown in Table 
19.3. It can be seen how the signal amplitudes generally decrease with increasing 
order of the Bessel function. A larger value of 6 also implies a larger number of 
side bands are significant and need to be considered. The use of Bessel functions 
when designing radio communications systems is very important because each radio 
channel has an allocated bandwidth outside of which it should not stray. To do so 
would create the possibility of different radio channels interfering with each other. 
This mathematical method allows the amplitudes of the side bands to be calculated to 
ensure they are small enough so as to not interfere significantly with other channels. 


Table 19.3 
Table of Bessel functions J, (6) of integer order, n, for different values of 
modulation index, B. 


B is) i= il oe i= 3 n=4 es) n=6 n= 


0.00 1.00 - - - - = - - 
0.25 0.98 0.12 - 
0.50 0.94 0.24 0.03 - - = - - 
OLS 0.86 0.35 0.07 - - - - = 
1.00 0.77 0.44 0.11 0.02 - - = - 
125) 0.65 0.51 0.17 0.04 - - - - 
1.50 0.51 0.56 0.23 0.06 0.01 - - - 
ows 0.37 0.58 0.29 0.09 0.02 - - - 
2.00 0.22 0.58 0.35 0.13 0.03 - - - 
Bes) 0.08 0.55 0.40 0.17 0.05 0.01 - - 
2.50 -0.05 0.50 0.45 0.22 0.07 0.02 - = 
Pris -0.16 0.43 0.47 0.26 0.10 0.03 - - 
3.00 -0.26 0.34 0.49 0.31 0.13 0.04 0.01 = 
325) -0.33 0.24 0.48 0.35 0.17 0.06 0.02 - 
3.50 -0.38 0.14 0.46 0.39 0.20 0.08 0.03 - 
oils -0.40 0.03 0.42 0.41 0.24 0.10 0.04 0.01 
4.00 -0.40 -0.07 0.36 0.43 0.28 0.13 0.05 0.02 
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1. Find the general solution of the following equations: 3 Find the general solution of tx + x = 21. 
dx dx dx 5 
(a) aon (b) Ura 8 4 Solve — + 2x =e" cost 
(c) dy = y cosx (a) by using an integrating factor 


dy (b) by finding its complementary function and a 
2 Solve a. 2, subject to y(0) = 3. particular integral. 
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MI Find the general solution of y’ + 16y = 2. (a) Find the general solution of this equation. 


™ [a 
6 Find the particular solution of (b) (7) = —12, and (3) = 20, find the 
y” + 3y’ — 4y = e*,", y(0) = 2, Y(0) = 0. 


7 
: : F : z displacement of the particle when t = —. 
7 A particle moves in a straight line such that its 2 


displacement from the origin O is x, where x satisfies 8 


° : . Use an integrating factor to solve the differential 
the differential equation 


equation 
2 
dx 
Gz t lor=0 ay Fe eott = cos st 
Solutions 
2 
1 @) x=Ac™ 5 y=Acos4x+ Bsin4x 4 is : 
(b) x =3ln|[1+¢|/ +c 16 128 
1 39e%4+1le** xe 
= —_ = | 
oe aa ee ed 25 as 
2) y=2x+3 7 (a) Asin4t + Bcos4t (b) 12 
c 1 
3 me ae cos 2t — ~ cos4tf +c 
24 : 8 sint; x(t) = lac 
t t sin 
Z ee (4cos OM) ent 


17 


t = | 
“7 x PAF 4 

a AY i 

iO » ay a 
j =. | 
ate: 
“hol eae SPN ss 
: eh a 
; | 
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INTRODUCTION 


Engineers sometimes find it convenient to represent a differential equation using an elec- 
tronic circuit. This is known as analogue simulation. Analogue controllers are also fre- 
quently used in control systems and these are based on similar circuits. Complicated en- 
gineering systems need to be modelled using a set of differential equations. This leads 
to the topic of state-space modelling. Such models find widespread use in the aerospace 
industries as well as in other areas that have complex systems. Numerical techniques 
are very important as they allow differential equations to be solved using computers. 
Often this is the only way to obtain a solution of a differential equation, owing to its 
complexity. State-space models are almost always solved using computers. 


ANALOGUE SIMULATION 


It is possible to solve differential equations using electronic circuits based on operational 
amplifiers. The advantage of this approach is the ease with which the coefficients of 
the differential equation can be adjusted and the effect on the solution observed. The 
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10 1 
"| Al 
7) ! Vo Vg : Uo 
v; v; 10 
Figure 20.1 Figure 20.2 
An integrator. A summer. 


technique is known as analogue simulation. Prior to the mass availability of powerful 
digital computers this was a common approach in engineering design because it pro- 
vided the engineer with an automated method to solve a range of mathematical models. 
Special-purpose computers, known as analogue computers, were historically used which 
had the electronic circuits already incorporated, thus making it easier to simulate a par- 
ticular differential equation. 

Three basic types of circuit are required to enable ordinary differential equations 
with constant coefficients to be simulated. The first type is an integrator which has al- 
ready been discussed in Section 13.2. The usual symbol for such a circuit is shown in 
Figure 20.1. 

If an analogue computer is used it is common for there to be a gain of only | or 10 
on the input voltage. A gain above 10 is better achieved by linking together two circuits 
in series. In addition there is usually a facility to allow initial conditions to be set. The 
equation for the circuit of Figure 20.1 is 


t 
vy = -| (10v, + v, + v3) dt — v,, (20.1) 
0 


where v;,, indicates the initial output voltage. Note that the gain is usually written on the 
input line. 

The second type of circuit is the summer. It has the symbol shown in Figure 20.2. 
The equation for the circuit of Figure 20.2 is 


v, = —(v, + v, + 1003) (20.2) 


Finally, a circuit is required to allow gains to be varied. This is simply a potentiometer 
and is illustrated together with its symbol in Figure 20.3. The equation for the circuit of 
Figure 20.3 is simply 


v=k, O<k<1 (20.3) 


The potentiometer only allows the gain to be varied between 0 and 1. If a variable gain 
greater than | is required, the potentiometer must be placed in series with a circuit that 
increases gain, for example a one-input summer with a gain of 10. 

These circuits can be combined to obtain the solutions of differential equations. This 
is best illustrated by means of an example. Consider the general form of a second-order 
differential equation with constant coefficients: 

Py dy 
aaa + ba t+cy= f(t) (20.4) 
The general approach to obtaining the circuit for a particular differential equation is to 
assume a certain point in the circuit corresponds to a particular term and then arrange 


Figure 20.3 
A potentiometer. 


20.2 Analogue simulation 605 


Figure 20.4 
First stage in synthesizing Equation (20.4). 


to connect that point to the correctly synthesized value. It is more straightforward if the 
assumed point corresponds to the highest derivative. Rearranging Equation (20.4) we 
find 

dy 


a—- =f()—-b 


= dy 
dr2 


_ 20.5 
ay (20.5) 


2 2 
Assuming a is already available, a potentiometer can be used to obtain a Integra- 


d 
tors can then be used to obtain the variables ; and y. This is shown in Figure 20.4. The 


next stage is to obtain the expression corresponding to the r.h.s. of Equation (20.5). This 
requires a summer to add together the individual terms and potentiometers to allow indi- 
vidual coefficients to be obtained. An invertor is also required to obtain the correct sign 


d 
for the variable a This simply consists of a summer with one input and a gain of 1. The 


final circuit is shown in Figure 20.5. The three potentiometers allow different values of 
a, band c to be obtained. If gains greater than 1 were required then extra summers would 
be needed, or alternatively the input gains of the existing summers could be adjusted. 
The input to the circuit, f(t), can be simulated by applying a signal at the appropriate 
point in the circuit. The output of the circuit, y(t), corresponds to the solution of the 
differential equation. It may be necessary to include initial conditions corresponding to 


0) and ® 0 
y(0) an a! ). 


Although analogue computers remain an interesting historical development in the 
simulation of control systems it is now much more common to use digital computers. 
The system to be simulated can be represented either directly in the form of mathematical 
equations or by block diagrams similar to that shown in Figure 20.5. The response to 
various inputs can be readily obtained. 


Figure 20.5 
The complete circuit to synthesize Equation (20.4). 
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Example 20.1 


Solution 


HIGHER ORDER EQUATIONS 


In this section we shall consider second- and higher order equations and show how they 
can be represented as a set of simultaneous first-order equations. The main reason for 
doing this is that when a computer solution is required it is useful to express an equation 
in this form. Details of the analytical solution of such systems are not considered here 
although one technique is discussed in Section 21.10. 

It is possible to express a second-order differential equation as two first-order equa- 
tions. Thus if we have 


dy dy 
= f(—,y, 20.6 

de f ( Fae, *) (20.6) 
we can introduce the new dependent variables y, and y, such that y, = y and y, = all 

; dx 
Equation (20.6) then becomes 

dy dy, 

BM Gp LO») (20.7) 


These first-order simultaneous differential equations are often referred to as coupled 
equations. 


Express the equation 
d 
— —7— +3y=0 


as a set of first-order equations. 


, dy dy, _ dy ; 
Letting y, = y, and y, = —, we find —= = —. Therefore the differential equation 
dx dx dx? 
becomes 
dy dy 


We note that these equations can also be written as 


dy, 
dx 
dy, 
dx 


= 0 


—3y, + Ty, 


or, in matrix form, 
C=C 1)G) 
V5 —3 7) \yo 


d 
where ’ denotes —. 
dx 
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Higher order differential equations can be reduced to a set of first-order equations in a 
similar way. 
Example 20.2 Express the equation 


dx - dx 
dr3 dt? 


dx 
+3— +2x=0 
dt 


as a set of first-order equations. 


Fi dx d? 
Solution Letting x, = x, x, = n and x, = <* we find 


dt? 
whey 
dt? 
dx, 
a3 
dx, 
‘dG ete =0 


is the set of first-order equations representing the given differential equation. 


Example 20.3 (a) Express the coupled first-order equations 


dy 

he ae, 
dy 

ae = 4y, — 2y, 


as a second-order ordinary differential equation, and obtain its general solution. 
(b) Express the given equations in the form 


(1) =4() 


where A is a2 x 2 matrix. 


Solution (a) Differentiating the second of the given equations we have 


d’y, = 41 7 iy2 
dx? dx dx 


and then, using the first, we find 


d’y, dy, 
ge = 4(y, + y2) - ae 


d 
But from the second given equation 4y, = =n + 2y,, and therefore 


Py, _ dy, dy, 
ae de Gy 
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that is, 
dy, dy, —6y,= 
de de 


To solve this we let y = e* to obtain 


P+k—-6=0 
(k —2)(k+3) =0 


Therefore, 


k=2,-3 


The general solution is then y = y, = Ae + Be. It is straightforward to show 
that y, satisfies the same second-order differential equation and hence has the same 
general solution, but with different arbitrary constants. 


(b) The first-order equations can be written as 


(= @ -)( 


Therefore A = ({ 3) 


EXERCISES 20.3 


Express the following equations as a set of first-order 
equations: 


d2y dy 
peal i yee = 
(a) 42 ate 0 
dy ody 
i) 28 pat 
( a2 Pri 
@*? PSs ee 
ek gee 
(d) BY 6 pay 2% 
dr2 eo 
dy dy dy 
pay “al te a ee 
(3 +6 55 +27 +9 
Bx dx dx 
(f) +4— +2x=0 


ee, Lae 
dr3 dr2 dt 


Express the following coupled first-order equations as 
a single second-order differential equation: 


dy, 
(a) ——-=y, + yo, = 2y — 2yy 
dy dy 
OG) 32 -e 
dx, dx, 
(c) a = 3x, +%, 7 = 2x, — 3x, 
dy dy. 
(d) SP = 2y) + 4y9, G2 = Oy — TY 
Express 
dy, 
—_=2y,+6 and 
en yy ay) 
dy, 
—- ——2y, —5 
di yy By) 


as a single second-order equation. Solve this equation 
and hence find y, and y,. Express the equations in the 
form y’ = Ay. 
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Solutions 
dy, dy, Wy dy 
Qa. 2 aye B® wty7 yo 
dy; dy, 2 
eal a ee = d d 
(b) de y2 de 9y, — 8yo (b) 2 i ee +6y=0 
dx dx 
(c) — = Xp = —6x, — 4x5 dx 
dr dr (©) ay —llx=0 
dy, dy, 
d —= —+ =-7y, —6 
(d) qe ee re V1 — Yo o d2y 5 es 
dy, dy, dt? dt — 
OG ao 
3 y’ + 3y' + 2y=0 
d 
. = —y, — 2y, — 6y3 y,@) = Ae~*! + Bet 
2 
dy dey Yo (t) = —<Ae*! — —Be 
Oa 2 a : 
yi 2 6 y 
z = La -3)G) 
ra 2x, — 4x, — 2x3 a) —2 -5} V2 
yAiW }=STATE-SPACE MODELS 


There are several ways to model linear time-invariant systems mathematically. One way, 
which we have already examined, is to use linear differential equations with constant 
coefficients. A second method is to use transfer functions which will be discussed in 
Chapter 21. A third type of model is the state-space model. The state-space technique 
is particularly useful for modelling complex engineering systems in which there are 
several inputs and outputs. It also has a convenient form for solution by means of a 
digital computer. 

The basis of the state-space technique is the representation of a system by means of a 
set of first-order coupled differential equations, known as state equations. The number 
of first-order differential equations required to model a system defines the order of the 
system. For example, if three differential equations are required then the system is a 
third-order system. Associated with the first-order differential equations are a set of state 
variables, the same number as there are differential equations. 

The concept of a state variable lies at the heart of the state-space technique. A sys- 
tem is defined by means of its state variables. Provided the initial values of these state 
variables are known, it is possible to predict the behaviour of the system with time by 
means of the first-order differential equations. One complication is that the choice of 
state variables to characterize a system is not unique. Many different choices of a set of 
state variables for a particular system are often possible. However, a system of order n 
only requires n state variables to specify it. Introducing more state variables than this 
only introduces redundancy. 

The choice of state variables for a system is, to some extent, dependent on experience 
but there are certain guidelines that can be followed to obtain a valid choice of variables. 
One thing that is particularly important is that the state variables are independent of 
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UG 
<— 
G R 
u L 
Figure 20.6 
The circuit could be modelled using v¢ and i as the state 
variables. 


each other. For example, for the electrical system of Figure 20.6, the choice of uc and 
3vc would lead to the two state variables being dependent on each other. A valid choice 
would be vu, and i which are not directly dependent on each other. This is a second-order 
system and so only requires two state variables to model its behaviour. 

Many engineering systems may have a high order and so require several differential 
equations to model their behaviour. For this reason, a standard way of laying out these 
equations has evolved to reduce the chance of making errors. There is also an added 
advantage in that the standard layout makes it easier to present the equations to a dig- 
ital computer for solution. Before introducing the standard form, an example will be 
presented to illustrate the state variable method. 


Engineering application 20.1 


State-space model for a spring-mass-damper system 


Consider the mechanical system illustrated in Figure 20.7. A mass rests on a friction- 
less surface and is connected to a fixed wall by means of an ideal spring and an ideal 
damper. A force, f(t), is applied to the mass and the position of the mass is w. 


ui) 


Figure 20.7 
A second-order mechanical system. 


By considering the forces acting on the mass, M, it is possible to devise a differ- 
ential equation that models the behaviour of the system. The force produced by the 
spring is Kw (K is the spring stiffness) and opposes the forward motion. The force 


dw 
produced by the damper is Bae where B is the damping coefficient, and this also 


opposes the forward motion. Since the mass is constant, Newton’s second law of mo- 
tion states that the net force on the mass is equal to the product of the mass and its 
acceleration. Thus we find 
ew 
dr? 
Note that this is a second-order differential equation and so the system is a second- 
order system. 

In order to obtain a state-space model the state variables have to be chosen. There 
are several possible choices. An obvious one is the position, w, of the mass. A second 


f@)— BS —~Kw=M (20.8) 
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state variable is required as the system is second order. In this case the velocity of the 
mass will be chosen. The velocity of the mass is not directly dependent on its position 
and so the two variables are independent. Another possible choice would have been 
dw ; : : : 
— — w. This may seem a clumsy choice but for certain problems such choices may 
lead to simplifications in the state variable equations. 

It is customary to use the symbols x,, x,, x3, .. . to represent variables for reasons 
that will become clear shortly. So, 


a) (20.9a) 
dw 

ee 20.9b 

x dt ( ) 


Because of the particular choice of state variables, it is easy to obtain the first of 
the first-order differential equations — thus illustrating the need for experience when 
choosing state variables. 

Differentiating Equation (20.9a) gives 


dx, dw 

SS] SS SS X5 

dt dt 
This is the first state equation although it is usually written as 

X= % 

dx, eo 

where x, denotes ie The second of the first-order equations is obtained by rear- 
ranging Equation (20.8): 


dw K B dw 


1 
=— — t 20.10 
dt? ivan M dt oD ve ) ( ) 


However, differentiating Equation (20.9b) we get 
dw 
dt? 
Then, using Equation (20.10) we obtain 


= Xy 


, K B 1 
xX = maa! = M2 ar Vo 


Finally, it is usual to arrange the state equations in a particular way: 


x= ep 

: B 1 

xy = 77 = um? ae Te 
Wis, 


Note that it is conventional to relate the output of the system to the state variables. 
Assume that for this system the required output variable is the position of the mass, 
that is w. 

It is straightforward to rewrite these equations in matrix form: 


(:) . (<i seen) (*:) * Ge) f(t) 
v=(19(2) 
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More generally, the standard form of the state equations for a linear system is given by 


x(t) = Ax(t) + Bu(t) 
y(t) = Cx(t) + Du(f) 


For a system with n state variables, r inputs and p outputs: 


x(t) is an n-component column vector representing the states of the nth-order sys- 
tem. It is usually called the state vector. 


u(t) is an r-component column vector composed of the input functions to the 
system. It is usually called the input vector. 


y(t) is a p-component column vector composed of the defined outputs of the 
system. It is referred to as the output vector. 


A = (n X n) matrix, known as the state matrix. 
B= (n x r) matrix, known as the input matrix. 
C = (p Xx n) matrix, known as the output matrix. 


D = (p X r) matrix, known as the direct transmission matrix. 


A, B, C and D have constant elements if the system is time invariant. When presented 
in this form the equations appear to be extremely complicated. In fact, the problem is 
only one of notation. The general nature of the notation allows any linear system to be 
specified but in many cases the matrices are zero or have simple coefficient values. For 
example, the matrix D is often zero for a system as it is unusual to have direct coupling 
between the input and the output of a system. In this format it would be straightforward 
to present the equations to a digital computer for solution. 


Engineering application 20.2 


An armature-controlled d.c. motor 


Derive a state-space model for an armature-controlled d.c. motor connected to a me- 
chanical load with combined moment of inertia J, and viscous friction coefficient B. 
The arrangement is shown in Figure 20.8. 


v, = applied armature voltage é, = back e.m-f. of the motor 

i, = armature current @, = angular speed of the motor 

R, = armature resistance T = torque generated by the motor. 
L, = armature inductance 


II 


Solution 


Let us assume that the system input is the armature voltage, v,, and the system output 
is the angular speed of the motor, w,,. This is a second-order system and so two state 
variables are required. We will choose the armature current and the angular speed of 
the motor. So, 


ob] = Un X7 = On 
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Fixed field 


a L, tn GY 


Va 


Figure 20.8 
An armature- 
controlled d.c. motor. 


The next stage is to obtain a mathematical model for the system. Using Kirchhoff’s 
voltage law and the component laws for the resistor and inductor we obtain, for the 
armature circuit, 


: di, 
Ui i,R, er a & 
Now for a d.c. motor the back e.m.f. is proportional to the speed of the motor and is 
given by e, = K,,,, K, constant. So, 


eal 


; di, 
UL, = Uk, ae Ib = Sr IO 


a = bi 
eae eee ae (20.11) 
— = 0) Fs 
fo ne pin ee 


Let us now turn to the mechanical part of the system. If G is the net torque about 
the axis of rotation then the rotational form of Newton’s second law of motion states 
dw R ; . dw . ; 

G = J—, where J is the moment of inertia, and — is the angular acceleration. In 


: t ee 
this example, the torques are that generated by the motor, 7, and a frictional torque 
Bw,, which opposes the motion, so that 


For a d.c. motor, the torque developed by the motor is proportional to the armature 
current and is given by T = K;i,, where K; is a constant. So, 


renee yee 
Tha On a. dt 
ies WB 
Seige Sa eee (20.12) 


Equations (20.11) and (20.12) are the state equations for the system. They can be 
arranged in matrix form to give 


io \ {RL RIE NGG, We 
ete Selle) al 0 )e 


: ; EX i 

Alternatively the notation x = @ = i ) and u = v, can be used. However, 
a m 

when there is no confusion it is better to retain the original symbols because it makes 

it easier to see at a glance what the state variables are. 
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Finally, an output equation is needed. In this case it is trivial as the output variable 
is the same as one of the state variables. So, 


Engineering application 20.3 


State-space model for a coupled-tanks system 


Derive a state-space model for the coupled tank system shown in Figure 20.9. The 
tanks have cross-sectional areas A, and A,, valve resistances R, and R,, fluid heights 
h, and hy, input flows g, and q,. Additionally, there is a flow, g,, out of tank 2. 
Assume that the valves can be modelled as linear elements and let the density of the 
fluid in the tanks be p. Let g; be the intermediate flow between the two tanks. 


92 


Gh ae 


Tank 1, R; Tank 2, ao Figure 20.9 
area A, area Ay Coupled tanks. 
Solution 


A convenient choice of state variables is the height of the fluid in each of the tanks, 
although a perfectly acceptable choice would be the volume of fluid in each of the 
tanks. For tank 1, conservation of mass gives 
dh, 

Ch = Ch = ae 
For the resistance element, R,, the pressure difference across the valve is equal to the 
product of the flow through the valve and the valve resistance. This can be thought of 
as a fluid equivalent of Ohm’s law. Note that atmospheric pressure has been ignored 
as it is the same on both sides of the valve. So, 


pgh, — pgh, = GR, 


p 
ah) (20.13) 
R, 
Combining these two equations gives 
dh, Ps pg i 
=— h h 20.14 
T= a ra ae ae 
For tank 2, 
dh 
—qg, =A,— 
9; ar UD o 2 dt 


pgh, = Roq, (20.15) 
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Combining these equations and using Equation (20.13) to eliminate g; and qg, gives 
dh 1 
Du enettne: Pe ps ue ps hk @, (20.16) 
dt RGA Ivy Ukkpytlo, A, 
Equations (20.14) and (20.16) are the state-space equations for the system and can 
be written in matrix form as 


—pg pg cae, 
wy lal® le 
hy oe | Pe oe 

R,A, R\A,  R,A, Aly 


Note that in this case the input vector is two-dimensional as there are two inputs to 
the system. The output equation is given by 


qo = (0 ps/R2) ) 


and is obtained directly from Equation (20.15). 


NUMERICAL METHODS 


All the techniques we have so far met for solving differential equations are known as 
analytical methods, and these methods give rise to a solution in terms of elementary 
functions such as sin x, e*, x°, etc. In practice, most engineering problems involving dif- 
ferential equations are too complicated to be solved easily using analytical techniques. 
It is therefore frequently necessary to make use of computers. One such approach is to 
use analogue simulation, which we discussed in Section 20.2. Another, more common, 
approach is to use digital computers. 

There are a variety of computer software packages available for solving differential 
equations, including MATLAB® which has already been introduced. Often the detail of 
how the computer solves the equation is hidden from the user and it uses one of a range 
of numerical methods. We will examine some of the methods that might be applied by 
the computer in this section. 

When using numerical methods it is important to note that they result in approximate 
solutions to differential equations and solutions are only calculated at discrete intervals 
of the independent variable, typically x or t. We shall begin by examining the first-order 
differential equation 

dy 
7 f(x, y) 
subject to the initial condition y(x)) = yp. Usually the solution is obtained at equally 
spaced values of x, and we call this spacing the step size, denoted by h. By choosing 
a suitable value of h we can control the accuracy of the approximate solution obtained. 
We shall write y, for this approximate solution at x = x,, whereas we write y(x,,) for the 
true solution at x = x,. Generally these values will not be the same although we try to 
ensure the difference is small for obvious reasons. 

The simplest numerical method for the solution of the differential equation 

dy 
4p 1 
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with initial condition 


YX) = Yo 


is Euler’s method which we shall study in the next section. 


EULER’S METHOD 


d 
You will recall, from Chapter 12, that given a function y(x), the quantity ;. represents 


d 
the gradient of that function. So if ;, = f(x, y) and we seek y(x), we see that the differ- 


ential equation tells us the gradient of the required function. Given the initial condition 
y = Yo when x = x, we can picture this single point as shown in Figure 20.10. Moreover, 


we know the gradient of the solution here. Because Sa f(x, y) we see that 


which equals the gradient of the solution at x = x). Thus the exact solution passes 
through (x9, yp) and has gradient f(x, yo) there. We can draw a straight line through this 
point with the required gradient to approximate the solution as shown in Figure 20.10. 
This straight line approximates the true solution, but only near (xp, yo) because, in gen- 
eral, the gradient is not constant but changes. So, in practice we only extend it a short 
distance, h, along the x axis to where x = x,. The y coordinate at this point is then taken 
as y,. We now develop an expression for y,. The straight line has gradient f (x9, yo) and 
passes through (xp, yo). It can be shown that its equation is therefore 


Y= + AX — XH) FO; Yo) 

When x = x, the y coordinate is then given by 
Yi = Yo + Q — Xo) FM: Yo) 

and since x, — X) = h we find 
Y, = Yo thf (%, Yo) 


This equation can be used to find y,. We then regard (x,, y,) as known. From this known 
point the whole process is then repeated using the formula 


Yin =I; FASO Yi) 


True solution 


Yj) | --------------- Tangent line 
r approximation 
il (Soe ee 
210i) PAE Sess 
Figure 20.10 
0 Xo xy x 


Approximation used in Euler’s method. 


Example 20.4 


Solution 


20.6 Euler's method 617 


and we can therefore generate a whole sequence of approximate values of y. Naturally, 
the accuracy of the solution will depend upon the step size, h. In fact, for Euler’s method, 
the error incurred is roughly proportional to h, so that by halving the step size we roughly 
halve the error. 

An alternative way of deriving Euler’s method is to use a Taylor series expansion. 
Recall from Chapter 18 that 


/ h ” 
yQ% +hy= y(%) + hy (Xp) =F oye (Xp) +s. 


If we truncate after the second term we find 
YX +h) © YX) + hy’) 

that is, 
Yt = Yo + hy (%q) = Yo + AF(%, Yo) 


so that Euler’s method is equivalent to the Taylor series truncated after the second term. 


Use Euler’s method with h = 0.25 to obtain a numerical solution of 


dy ys 

dx 
subject to y(0) = 2, giving approximate values of y forO < x < 1. Work throughout 
to three decimal places and determine the exact solution for comparison. 


We need to calculate y,, y,, y, and y,. The corresponding x values are x, = 0.25, x, = 
0.5, x, = 0.75 and x, = 1.0. Euler’s method becomes 

Var =), +0.25(-xy7) with x=0 yy=2 
We find 

y, = 2 — 0.25(0)(2”) = 2.000 

Yy = 2 — 0.25(0.25)(2”) = 1.750 

y, = 1.750 — 0.25(0.5) (1.7507) = 1.367 

Y4 = 1.367 — 0.25(0.75) (1.367°) = 1.017 


The exact solution can be found by separating the variables: 


d 
San f xds 
y 


so that 
2 


ee 
y 2 


1 
Imposing y(0) = 2 gives C = —3 so that 
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Example 20.5 


Solution 


Table 20.1 Table 20.2 
Comparison of numerical solution by Euler’s The solution to Example 20.5 
method with exact solution. by Euler’s method. 
i x; Ji yx) i xj Ji 
numerical exact 

0 1.00 1.00 
0) 0.000 2.000 2.000 1 1.20 1.20 
1 0.250 2.000 1.882 2 1.40 1.40 
2 0.500 1.750 1.600 3 1.60 1.60 
3 0.750 1.367 1.280 4 1.80 1.80 
4 1.000 1.017 1.000 5 2.00 2.00 


Table 20.1 summarizes the numerical and exact solutions. In this example only one 
correct significant figure is obtained. In practice, for most equations a very small step 
size is necessary which means that computation is extremely time consuming. An im- 
provement to Euler’s method, which usually yields more accurate solutions, is given in 
Section 20.7. 


Obtain a solution for values of x between 1 and 2 of 


dy _y 


dx x 


subject to y = | when x = 1 using Euler’s method. Use a step size of h = 0.2, working 
throughout to two decimal places of accuracy. Compare your answer with the analytical 
solution. Comment upon the approximate and exact solutions. 


Here we have f(x,y) = S Yo = Lx = 1 andh = 0.20. With a step size of 0.2, 
xX, = 1.2, x, = 1.4,...,x; = 2.0. We need to calculate y,, y,,..., y;. Euler’s method, 
Vion = Y, + hf (%;, y;), reduces to 


Therefore, 
Yo i 
Y, =¥ + 9.2( = |) = 14+ 0.2{ — ) = 1.20 
Xo 1 
Similarly, 
y 1.20 
— 0.2{ — }) = 1.20+ 0.2{ —— } = 1.40 
a 2) " ; 


Continuing in a similar fashion we obtain the results shown in Table 20.2. To obtain the 
analytical solution we separate the variables to give 


ae 
y x 
so that 
Iny = Inx+InD = InDx 
Therefore, 


y = Dx 
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When x = 1, y = 1 so that D = 1, and the analytical solution is therefore y = x. We 
see that in this example the numerical solution by Euler’s method produces the exact 
solution. This will always be the case when the exact solution is a linear function and 
exact arithmetic is employed. 


EXERCISES 20.6 


1. Using Euler’s method estimate y(3) given 2. Find y(0.5) ify’ =x+y, yO) = 0. Use h = 0.25 and 


h = 0.1. Find the true solution for comparison. 


x+y 
y=——  y@)=1 3 
x 


Use h = 0.5 and h = 0.25. Solve this equation 
analytically and compare your numerical solutions 
with the true solution. 


Use Euler’s method to find v(0.01) given 


10-2 2 +v=sinl00mt  v(0)=0 
dt ~ > 


Take h = 0.005 and h = 0.002. Find the analytical 
solution for comparison. 


Solutions 
1 Exact: y = xIn |x| — 0.1931x 3 Exact: 
sin L0Omt — mt cos 1007 + mte~ 19 
xj Ji Ji y = 5) 
(h = 0.5) (h = 0.25) (exact) m +1 
t. v; v 
2.00 1.0000 1.0000 1.0000 : 
2.25 : 1.3750 1.3901 =) (ge) 
2.50 1.7500 1.7778 1.8080 0 0.0000 0.0000 
2.75 = 2.2056 2.2509 0.005 0.0000 0.2673 
3.00 2.6000 2.6561 2.7165 0.010 0.5000 0.3954 
2 Exact: y= —x—1+e* t; v; v 
(h = 0.002) exact) 
xj Ji y ( 
(h = 0.25) (exact) 0 0.0000 0.0000 
0 0.0000 0.0000 0.002 0.0000 0.0569 
0.004 0.1176 0.1919 
0.25 0.0000 0.0340 
0.50 0.0625 0.1487 0.006 0.2843 0.3354 
‘ : : 0.008 0.4176 0.4178 
0.010 0.4517 0.3954 
xj Ji y 
(h = 0.1) (exact) 
0 0.0000 0.0000 
0.1 0.0000 0.0052 
0.2 0.0100 0.0214 
0.3 0.0310 0.0499 
0.4 0.0641 0.0918 
0.5 0.1105 0.1487 
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20.7 


Example 20.6 


Solution 


IMPROVED EULER METHOD 


You will recall that Euler’s method is obtained by first finding the slope of the solution at 
(Xo, Yo) and imposing a straight line approximation (often called a tangent line approx- 
imation) there as indicated in Figure 20.10. If we knew the gradient of the solution at 
xX = x, in addition to the gradient at x = xy, a better approximation to the gradient over 
the whole interval might be the mean of the two. Unfortunately, the gradient at x = x, 
which is 

dy 


a = f(y) 


xX=X, 


cannot be obtained until we know y,. What we can do, however, is use the value of y, 
obtained by Euler’s method in estimating the gradient at x = x,. This gives rise to the 
improved Euler method: 


Y, =o +h x (average of gradients at x, and x,) 


fp: Yo) pieuiul 


— h 
Yo + «| 2 


h 
=Yot 5 {Fo Yo) + F(X, Yq thf OX, ¥o))} 


Then, knowing y, the whole process is started again to find y,, etc. Generally, 
h 
Yin = Yi + 5 fi. Yi) + Fin. 9; + AF; ¥))} 


It can be shown that, like Euler’s method, the improved Euler method is equivalent to 
truncating the Taylor series expansion, in this case after the third term. 


Use the improved Euler method to solve the differential equation y = —xy’, y(0) = 2, 
in Example 20.4. As before take h = 0.25, but work throughout to four decimal places. 


Here f(x, y) = —xy’, Yo = 2, X%) = 0. We find f(x), yy) = f (0, 2) = —0(2?) = 0. The 
improved Euler method 


V1 Yo t 5 lf 0.90) + Fle Yo + Af lH, ¥0))) 
yields 

y, =2+ “0 + f (0.25, 2)} 
Now f (0.25, 2) = —0.25(22) = —1, so that 

y, =2+0.125(—1) = 1.875 


We shall set out the calculations required in Table 20.3. You should work through the next 
few stages yourself. Comparing the values obtained in this way with the exact solution, 
we see that, over the interval of interest, our solution is usually correct to two decimal 
places (Table 20.4). 
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Table 20.4 


Comparison of the improved Euler 
method with the exact solution. 


Table 20.3 
Applying the improved Euler method to Example 20.6. 


bX; Yi FO yp) YAMA) — FOi41 Vi Vist i Xj Yj yQ@j) 
thf (x;, y;)) 
0 0.000 2.0000 2.0000 
0 O 2 0 2 —l1 1.8750 1 0.2500 1.8750 1.8824 
1 0.25 1.8750 —0.8789 1.6553 —1.3700 1.5939 2 0.5000 1.5939 1.6000 
2 05 1.5939 —1.2703 1.2763 —1.2217 1.2824 3 0.7500 1.2824 1.2800 
3. 0:75 1.2824 —1.2334 0.9741 —0.9489 1.0096 4 1.0000 1.0096 1.0000 


Example 20.7 Apply both Euler’s method and the improved Euler method to the solution of 
dy 
as 


for0 < x < 0.5 using h = 0.1. Compare your answers with the analytical solution. 
Work throughout to three decimal places. 


2x y= 1 whenx=0 


Solution We have - = 2x, X) = 0, yo = 1. Therefore, using Euler’s method we find 
Yaw = Ve Gey) 
= y, +0.1(2x,) 
= y, + 0.2x,; 
Therefore, 
Y, =¥p + 0.2% = 1+ (0.2)(0) =1 
Yo = y, +0.2x, = 1+ (0.2)(0.1) = 1.02 


The complete solution appears in Table 20.5. Check the values given in the table for 
yourself. Using the improved Euler method we have 


h 
Ya pr 5 FG Yi) + Fig Vi FAL YD) 
= y, + 0.05(2x; + 2x;,1) 
=y, + 0.10; +%4.,) 


Table 20.5 
The solution of Example 20.7 using Euler’s method, 
the improved Euler method and the exact solution. 


Xx; y; (Euler) y; (improved y(x;) 
Euler) (exact) 
0 1.000 1.000 1.000 
0.1 1.000 1.010 1.010 
0.2 1.020 1.040 1.040 
0.3 1.060 1.090 1.090 
0.4 1.120 1.160 1.160 


0.5 1.200 1.250 1.250 


622 Chapter 20 Ordinary differential equations Il 


Therefore, 
Y, =Yo + O.1 (xp + x,) = 14+0.1(0 + 0.1) = 1.01 
yp = y, +0.1(%, + x,) = 1.014 0.1(0.1 + 0.2) = 1.04 


and so on. The complete solution appears in Table 20.5. Check this for yourself. The 
analytical solution is 


y= ; dx =x +c 
Applying the condition y(0) = 1 gives c = 1, and so y = x* + 1. Values of this function 


are also shown in the table and we note the marked improvement given by the improved 
Euler method. 


EXERCISES 20.7 


1. Apply the improved Euler method to Question 1 in 3 Apply the improved Euler method to Question 3 in 
Exercises 20.6. Exercises 20.6. 

2. Apply the improved Euler method to Question 2 in 
Exercises 20.6. 

Solutions 

1 EE=—=—==—=EE__EE_— 
Xx; y; (h=0.5) y; (A=0.25) _ y (exact) t; v; (h = 0.005) __v (exact) 
2.00 1.0000 1.0000 1.0000 0 0.0000 0.0000 
2.25 - 1.3889 1.3901 0.005 0.2500 0.2673 
2.50 1.8000 1.8057 1.8080 0.010 0.2813 0.3954 
2.75 - 2.2476 2.2509 ———— es 
3.00 2.7017 2.7123 2.7165 EE 

t; v; (h = 0.002) _ v (exact) 

2 0 0.0000 0.0000 

x; y; (h=0.25) yy (exact) 0.002 0.0588 0.0569 
0.004 0.1903 0.1919 

0 0.0000 0.0000 0.006 0.3273 0.3354 
0.25 0.0313 0.0340 0.008 0.4032 0.4178 
0.50 0.1416 0.1487 0.010 0.3777 0.3954 
x; y;(h=0.1)  y (exact) 
0 0.0000 0.0000 
0.1 0.0050 0.0052 
0.2 0.0210 0.0214 
0.3 0.0492 0.0499 
0.4 0.0909 0.0918 
0.5 0.1474 0.1487 
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| 20.8 RUNGE-KUTTA METHOD OF ORDER 4 


The ‘Runge-Kutta’ methods are a large family of methods used for solving differential 
equations. The Euler and improved Euler methods are special cases of this family. The 
order of the method refers to the highest power of h included in the Taylor series expan- 
sion. We shall now present the fourth-order Runge-Kutta method. The derivation of this 
is beyond the scope of this book. 


d 
To solve the equation ;. = f(x, y) subject to y = yy when x = x, we generate the 


sequence of values, y,, from the formula 


h 
Yin = Vi ras + 2k, + 2k, + k4) (20.17) 
where 
k, = f(x;, y;) 
k, =f ie ay 
2=I\% 3 7" 
h h 
k,=f B+ 519; + oho 


k, = f(x +h, y; + hk) 


d 
Example 20.8 Use the Runge-Kutta method to solve ; = —xy’, for0 < x < 1, subject to y(O) = 2. 


Use h = 0.25 and work to four decimal places. 


Solution We shall show the calculations required to find y,. You should follow this through and 
then verify the results in Table 20.6. The exact solution is shown for comparison. 


f@ey=—y h=025 x4 =0 y=2 
Taking 7 = 0 in Equation (20.17), we have 

k, = fQ%:¥) = —0@2) =0 

k, = f(0.125, 2) = —0.125(2)* = —0.5 

k, = (0.125, 2+ “> (-05) = f (0.125, 1.9375) 


= —0.125(1.9375)* = —0.4692 
k, = f(0.25, 2 + 0.25(—0.4692)) = (0.25, 1.8827) 
= —0.25(1.8827)* = —0.8861 


Therefore, 


0.25 
yy, =2+ a + 2(—0.5) + 2(—0.4692) + (—0.8861)) = 1.8823 
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Example 20.9 


Solution 


Table 20.6 Table 20.7 
Comparison of the Runge-Kutta method The solution to 
with exact solutions. Example 20.9. 
i ? yj yj) x, y; 

f l 
0 0.0 2.0 2.0 “a te 
1 0.25 1.8823 1.8824 ae ce 
2 0.5 1.5999 1.6000 04 0.0897 
3 0.75 1.2799 1.2800 06 0.2112 
; i : 


1.0 1.0000 1.0000 


Use the fourth-order Runge-Kutta method to obtain a solution of 
dy 
dx 


subject to y = 0 when x = 0, forO < x < 0.6 with h = 0.2. Work throughout to four 
decimal places. 


=x+x-y 


We have 
4a = Yer i + 2k + 2ks + ky) 
where 
ky = fy) 
ky = A(x na Ay, ae sh) 
2 2 
b= A(x + an + sh) 
2: 2 
ky = £0 +h, y; + hks) 
In this example, f(x, y) =x? +x — y and x) = 0, yy = 0. The first stage in the solution 
is given by 
k, = f(%,%) = 9 
k, = f(0.1,0) = 0.11 
k, = f(0.1,0+ (0.1)(0.11)) = f(0.1, 0.011) = 0.099 
k, = f(0.2, 0 + (0.2)(0.099)) = f(0.2, 0.0198) = 0.2202 
Therefore, 


0.2 
y, =O+ ae + 2(0.11) + 2(0.099) + 0.2202) = 0.0213 


which, in fact, is correct to four decimal places. Check the next stage for yourself. The 
complete solution is shown in Table 20.7. 
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20.8.1 Higher order equations 


The techniques discussed for the solution of single first-order equations generalize read- 
ily to higher order equations such as those described in Section 20.3 and Section 20.4. It 
is obvious that a computer solution is essential and there are a wide variety of computer 
packages available to solve such equations. 


EXERCISES 20.8 
1 Find y(0.4) if y’ = (x+y)? and y(0) = 1 using the 3 Repeat Question 2 in Exercises 20.6 using the 
Runge-Kutta method of order 4. Take (a) 4 = 0.2 fourth-order Runge-Kutta method. 
and (b) h=0.1. 


4 Repeat Question 3 in Exercises 20.6 using the 
2 Repeat Question | in Exercises 20.6 using the fourth-order Runge-Kutta method. 
fourth-order Runge-Kutta method. 


Solutions 
1 (a) (b) 
Xe, xX “i yi y 
i yi i yi (h = 0.1) (exact) 
‘ : mt ‘ ; mend 0.0 0.0000 0.0000 
vi 5 ei io (a0e5 0.1 0.0052 0.0052 
: ; ae pie 0.2 0.0214 0.0214 
a ae 0.3 0.0499 0.0499 
0.4 0.0918 0.0918 
0.5 0.1487 0.1487 
2 
Xi di yi y % 
(h = 0.5) (h = 0.25) (exact) 
t. Uv: Vv 
2.00 1.0000 1.0000 1.0000 (h = 0.005) (exact) 
2.25 - 1.3900 1.3901 
2.50 1.8078 1.8079 1.8080 0 0.0000 0.0000 
2.75 - 2.2507 2.2509 0.005 0.2675 0.2673 
3.00 2.7163 2.7164 2.7165 0.010 0.3959 0.3954 
3 t U; Vv 
h = 0.002 exact 
- x : ( ) (exact) 
(h = 0.25) (exact) 0 0.0000 0.0000 
a ee ee 0.002 0.0569 0.0569 
i 0.0000 .000 
Se Doo 0.1919 01919 
6a eye Ga 0.006 0.3352 0.3354 
; 0.008 0.4178 0.4178 


0.010 0.3954 0.3954 


626 Chapter 20 Ordinary differential equations Il 


REVIEW EXERCISES 20 


1. Use Euler’s method with h = 0.1 to estimate x(0.4) 


dx 
given a = - 2xt, x(O) = 2. 


2. The generalization of Euler’s method to the two 
coupled equations 


dy, 
= =JIUG,Y1; 
de F(X, Y15 V2) 
dy, 
a = g(x, Y1>Y2) 
is given by 
Viger My + FCM,» V2,,) 
y2 = 99.0 i Agri V1,» 2,4) 


(+1) @ 
Given the coupled equations 
dy, dy 
Gt a 
estimate y, (0.3) and y, (0.3), if y, (0) = 1 and 
yy(0) = —1. Take h = 0.1. 


Solutions 


1 4.7937 


2 0.7535, —0.7535 


3 (a) O1 = y, 2 = By, — 6, 
(b) Oi Be i — 5x 
de? dt | Z 
(c) on — a) =x 
a °°? de 
43 = 5x1 — 6x, — 8x, 


3 Express the following equations as a set of first-order 


equations: 
Py dy 
— +6— + 8y=0 
(a) Po + a + By 
“se eet 
pole — Ay = 
dr? dt 
dx. dx de 
4— +8— +6— +5x=0 
(c) °c + a2 + de + 5x 


Express the following coupled first-order equations as 
a single second-order differential equation: 


(a) ef = 3y, + 4), 2 = 6y, + 8y, 
(b) = = 6x, Sn, 2 = 4x, — 3x, 
(c) Oo = 8y, + 4), 2 = 4y, — 6y, 
(a) ee ae 

2 
(b) 5p tra0 
(c) #52 _ sy =0 


t = | 
“7 x PAF 4 

a AY i 

on } 
iO » ay a 
j =. | 
ate: 
“hol eae SPN ss 
: eh a 
; | 
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INTRODUCTION 


The Laplace transform is used to solve linear constant coefficient differential equations. 
This is achieved by transforming them to algebraic equations. The algebraic equations 
are solved, then the inverse Laplace transform is used to obtain a solution in terms of 
the original variables. This technique can be applied to both single and simultaneous 
differential equations and so is extremely useful given that differential equation models 


are common as we saw in Chapters 19 and 20. 


The Laplace transform is also used to produce transfer functions for the elements of an 
engineering system. These are represented in diagrammatic form as blocks. The various 
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Example 21.1 


Solution 


blocks of the system, corresponding to the system elements, are connected together and 
the result is a block diagram for the whole system. By breaking down a system in this 
way it is much easier to visualize how the various parts of the system interact and so 
a transfer function model is complementary to a time domain model and is a valuable 
way of viewing an engineering system. Transfer functions are useful in many areas of 
engineering, but are particularly important in the design of control systems. 


DEFINITION OF THE LAPLACE TRANSFORM 


Let f(t) be a function of time f. In many real problems only values of tf > 0 are of 
interest. Hence f(t) is given for t > 0, and for all t < 0, f(t) is taken to be 0. 


The Laplace transform of f(t) is F(s), defined by 


F(s) = i} * eo f(t) dt 
0 


Note that to find the Laplace transform of a function f(t), we multiply it by e~” and 
integrate between the limits 0 and oo. 

The Laplace transform changes, or transforms, the function f(t) into a different func- 
tion F'(s). Note also that whereas f(t) is a function of t, F'(s) is a function of s. To denote 
the Laplace transform of f(t) we write C{f(t)}. We use a lower case letter to represent 
the time domain function and an upper case letter to represent the s domain function. 
The variable s may be real or complex. As the integral is improper restrictions may need 
to be placed on s to ensure that the integral does not diverge. 


Find the Laplace transforms of 


(a) 1 (b) e# 


(a) L£{l} = fen = =| = t = F(s) 
0 s 


=s 5 
This transform exists provided the real part of s, Re(s), is positive. 


(b) L{e“} = / e “edt 
0 


00 
-| e “Star dt 
0 
| en sta)t ii 
~ L-@ +4) Jo 


(=r) 
SS eS 
—(s+a) 


1 
— =f 
Sia (s) 


This transform exists provided s + a > 0. 


Throughout the chapter it is assumed that s has a value such that all integrals exist. 


Example 21.2 
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LAPLACE TRANSFORMS OF SOME COMMON 


FUNCTIONS 


Determining the Laplace transform of a given function, f(t), is essentially an exercise 
in integration. In order to save effort a look-up table is often used. Table 21.1 lists some 


common functions and their corresponding Laplace transforms. 


Table 21.1 


The Laplace transforms of some common functions. 


Function, f(t) 


Laplace transform, F (s) 


Function, f (t) 


Laplace transform, F (s) 


1 es 


g2 
? . 
9 
! 
a” n! 
git 
ett 1 
s—a 
ena 1 
S+a 
the-@ n! 
(s + antl 
ude b 
sin — 
sr +b 
Ss 
cos bt =>— 
s+ bh 
: b 
e “ sin bt 


(sta)?+b 


e “ cos bt 


sinh bt 


cosh bt 


e “ sinh bt 


e “ cosh bt 


t sin bt 


t cos bt 


u(t) unit step 


u(t —d) 
5(t) 
b(t —d) 


Sta 
G+ayr +h 
b 
g2 — p2 
Ss 
g2 — p 
b 
(sta)? —b? 
sta 
(sta)? —b? 
2bs 
(s2 + b*)2 
eb 
(oh 


Use Table 21.1 to determine the Laplace transform of each of the following functions: 


(a) 3 
(c) sin 4t 


(e) cos ( s) 
(g) cosh 5t 


(i) e~ sin 2t 
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Solution From Table 21.1 we find the results listed in Table 21.2. 
Table 21.2 
f(t) F(s) f(t) F(s) f(t) F(s) 
(a) 23 £ (e) cos(t/2) 7 (h) tsin4t aS 
st s2 + 0.25 (s2 + 16)? 
7! 3 2 
7 . ey 
(b) t 3 (f) sinh 3t 2-5 (i) e' sin 2t Gal a4 
(c) sin4t z (g) cosh 5t 7 (Gj) e' cost = 
2416 e 225 (s—3)2 41 
1 
d —2t 
(d) e 22 
EXERCISES 21.3 
1. Determine the Laplace transforms of the following (k) tsint (1) e~' sin 3 
functions: cos 7t ; 
(a) sin 6f (b) cos 4t eat top setahse 
(0) cosh 5t (p) e7*! sinh 4r 
. f 2t 4t 
(c) wn( 2) (d) co(#) (q) e~2! cosh 7t (r) e* sinh 3r 
(e) 14 (f) 1203 (s) e” cosh 9t 
3, 3 
(g) eo (h) e* 2 Show from the definition of the Laplace transform that 
1 
i) — j) tcos3t —sd 
O @ 0) Llu(t —d)} = ——, d>0 
s 
Solutions 
6 Ss 2s 3 
5 —— — j= —_ 
(a) s? + 36 () s*+16 &) (s? +1)? @ (s+ 1)2+9 
6 9s s+5 
d) ——— eee Se 
(©) 9s2 +4 @ 9s2 + 16 (m) (s+5)2 +49 (n) 52 — 36 
24 120 4 
(ec) = (f) — 5 —— 
s sf ©) 235 ©) Gea? 16 
1 1 
(2) —> (h) ere 
+3 s—3 @ Grn? © G-Hr~9 
i = oo s—7 
i piace 
s+4 (+992 © Gop 


21.4.1 


Example 21.3 


Solution 


Example 21.4 


Solution 
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PROPERTIES OF THE LAPLACE TRANSFORM 


There are some useful properties of the Laplace transform that can be exploited. They 
allow us to find the Laplace transforms of more difficult functions. The properties we 
shall examine are: 

(1) linearity; 

(2) shift theorems; 

(3) final value theorem. 


Linearity 


Let f and g be two functions of t and let k be a constant which may be negative. Then 


L{f + st = Lif} + Lfg} 
Ltkf} = kL{f} 


The first property states that to find the Laplace transform of a sum of functions, we 
simply sum the Laplace transforms of the individual functions. The second property 
says that if we multiply a function by a constant k, then the corresponding transform 
is also multiplied by k. Both of these properties follow directly from the definition of 
the Laplace transform and linearity properties of integrals, and mean that the Laplace 
transform is a linear operator. Using the linearity properties and Table 21.1, we can find 
the Laplace transforms of more complicated functions. 


Find the Laplace transforms of the following functions: 


(a) 3+2¢ (b) 522 -2e! 


(a) £L{3 + 2r} = L{3} + L{2z} 
= 3£{1} + 2L{t} 
3 2 
ss 
(b) L{5t2 —2e'} = L517} + L{-2e} 
= 5L{t7} — 2L{e'} 
10 2 


With a little practice, some of the intermediate steps may be excluded. 


Find the Laplace transforms of the following: 
(a) 5cos 3t +2sin5¢ — 6f° (b) —e' + }(sint + cost) 


5s 4 10 36 
se+9  s2425  s4 


(a) L{5cos 3t +2 sin 5t — 67} = 
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sint + cost —1 1 Ss 
b) £i-e* = 
(®) | v9 ce1 284 22S) 
—1 l-+s 


S41 Gee) 


21.4.2 First shift theorem 


If L{f(t)} = F(s) then 
L{e“ f(t)} = F(s +a) a constant 


We obtain F'(s + a) by replacing every ‘s’ in F(s) by “s + a’. The variable s has been 
shifted by an amount a. 


Example 21.5 (a) Use Table 21.1 to find the Laplace transform of 
f@) =tsinSt 
(b) Use the first shift theorem to write down 
Lie 't sin 5t} 


Solution (a) From Table 21.1 we have 
: 10s 
L{f(t)} = L{t sin 5t} = @a 25) = F(s) 
(b) From the first shift theorem with a = 3 we have 
Lee" fO} = F(s +3) 
10(s + 3) 
(3)? 29)" 
— — :10(s +3) 
- Bae $30) 


Example 21.6 The Laplace transform of a function, f(t), is given by 
2. 1 
ree 
s(s +1) 
State the Laplace transform of 


(a)e f(t) (b) e* fF) 


Solution (a) Use the first shift theorem with a = 2. 
Li{e” f(t)} = F(s +2) 
— -2(s+2)4+1 
~ (s+2)6+4+241) 
_ 2s+5 
~ (gs +2)(s +3) 
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(b) Use the first shift theorem with a = —3. 


Lie" f(t)} = F(s — 3) 
2(s—3)+4+1 
~ (s—3)s—-3+D) 
2s —5 
~ (s—3)(s—2) 


21.4.3 Second shift theorem 


If L{f()} = F(s) then 
Lfu(t — d) f(t — d)} =e “F(s) d>0 


The function, u(t — d) f(t — d), is obtained by moving u(t) f(t) to the right by an 
amount d. This is illustrated in Figure 21.1. Note that because f(t) is defined to be 
0 fort < 0, then f(t —d) = 0 fort < d. It may appear that u(t — d) is redundant. How- 
ever, it is necessary for inversion of the Laplace transform, which will be covered in 
Section 21.6. 


u(t fo u(t — d) f(t—d) 
(a) i (b) d t 
Figure 21.1 


Shifting the function u(t) f(t) to the right by an amount d yields the function 
u(t —d)f(t—d). 


2. 
Example 21.7. Given L{f(1)} = a find L{u(t — 2) f(t —2)}. 
Ss 
Solution Use the second shift theorem with d = 2. 


2se775 


s+9 


L{u(t — 2) f(t —2)} = 


—3s 


Example 21.8 The Laplace transform of a function is © Find the function. 


a 
Solution The exponential term in the transform suggests that the second shift theorem is used. Let 


es 


—sd = 
e “F(s) = 2 
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21.4.4 


Example 21.9 


Solution 


1 
so that d = 3 and F(s) = aa If we let 
Ss 


1 
LUFOL=F@)= 5 


S 


then f(t) =t and so f(t — 3) =t — 3. Now, using the second shift theorem 


—3s 


e3 
Liu(t — 3) f(t — 3)} = L{u(t — 3)(t — 3)} = a 
Hence the required function is u(t — 3)(¢ — 3) as shown in Figure 21.2. 


AO 


Figure 21.2 
3 t — The function: f(t) = u(t — 3)(t — 3). 


Final value theorem 


This theorem applies only to real values of s and for functions, f(t), which possess a 
limit as tf > oo. 


The final value theorem states: 
lim sF(s) = lim f(t) 
se [f= A686) 
Some care is needed when applying the theorem. The Laplace transform of some func- 


tions exists only for Re(s) > O and for these functions taking the limit as s — 0 is not 
sensible. 


Verify the final value theorem for f(t) =e”. 


ji, ee dt exists provided Re(s) > —2 and so 


F(s) = Re(s) > —2 


s+2 


Since we only require Re(s) > —2, it is permissible to let s — 0. 


lim sF(s) = lim = 
s 0 so0 s+ 2 


Furthermore 


lim e-7 =0 


t>oo 


So lim,_,) sF(s) = lim f(t) and the theorem is verified. 


too 


1 


1 


EXERCISES 21.4 
Find the Laplace transforms of the following 
functions: 
(a) 312 — 4 (b) 2sin4¢+11—t 
(c)2— 12 +214 (d) 3e% +4sint 


@ = sind 4 : 
€) 3 sin —4c08\ 5 


(f) 3r47e% +1 
(g) sinh 2t + 3 cosh 2t 
(h)e~ sin 3t + 4e~ cos 3t 


The Laplace transform of f(t) is given as 
F(s) 357 — 1 
s) = = 
sts+1 


Find the Laplace transform of 


(aje*fe) (b) eft) © ef) 


Solutions 

6 4 8 ll 

anya by) 
(a) 3 RY (b) ig s 
~ 22 48 @ a 
a See 

s gs 3g9 gs—-2 gs24+1 

1 16s 72 1 
256° tai 3 Gu Se 

3s+2 4s +7 

h Pane eee eames 

@) a4 ®) G+De+9 


AND INTEGRALS 
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3. Given 


4s 
L{f(t)} = Pare 
find 
(a) L{u(t— 1) f(t —1)} 
(b) L{3u(t — 2) f(t — 2)} 


ft —4) 
2 


(c) c{w —4) 


4 Find the final value of the following functions using 
the final value theorem: 


(a) f(t) =e‘ sint 
(b) fi) =e7%4+1 
(c) f(t) =e * cost +5 


BG) aa & ies 18s + 26 
se+3s+3 sé —5s4+7 
12s* + 125-1 
©) Gary es47 
BM a) 4se~5 (b) 12s e725 (c) 2ye-™ 
s+] s+] s+ 


4 (a) 0 (b) 1 (c) 5 
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In later sections we shall use the Laplace transform to solve differential equations. In 
order to do this we need to be able to find the Laplace transform of derivatives of func- 
tions. Let f(t) be a function of t, and f’ and f” the first and second derivatives of f. The 
Laplace transform of f(t) is F(s). Then 


L{f'} = sF(s) — f(O) 


Ap} — sb (5) — sf (0) — FO) 


where f(0) and f’(O) are the initial values of f and f’. The general case for the 
Laplace transform of an nth derivative is 


L{f™) = s"F(s) — s"-f(0) — s*?f' (0) —--- — f° 0) 
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Another useful result is 


¢{ [70 | ee 
0 Ss 


Example 21.10 The Laplace transform of x(t) is X(s). Given x(0) = 2 and x’/(0) = —1, write expres- 


Solution 


sions for the Laplace transforms of 


(a) 2x” — 3x’ +x (b) —x" + 2x°+x 


L{x'} = sX(s) — x(0) = sX(s) —2 

Li{x"} = s?X(s) — sx(0) — x’(0) = s?X(s) — 25+ 1 

(a) L{2x” — 3x +x} = 2(s?X(s) — 2s +1) — 3(sX (8) — 2) + X(s) 
= (2s? —35+1)X(s) —454+ 8 

(b) L{—x” + 2x’ + x} = —(s?X(s) — 25 + 1) + 2(sX (8) — 2) + X(s) 
= (—s* + 25+ 1)X(s)+25—5 


Engineering application 21.1 


Voltage across a capacitor 


The voltage, u(t), across a capacitor of capacitance C is given by 


1 ibs 
uF) == i(t) dt 
= / i(t) 
Taking Laplace transforms yields 


Ves) — at) 
where V(s) = L{v(t)} and I(s) = L{i(t)}. 


Engineering application 21.2 


Frequency response of a system 


The Laplace variable s is sometimes referred to as the generalized or complex fre- 
quency variable. It consists of a real and an imaginary part, where s = o + jo. If 
only the sinusoidal steady-state response to a sine wave input for a system is required 
then we can obtain this by putting 0 = 0 into the expression for s and so s = jw. 
Thus it is possible to make this substitution in any transfer function given in terms of 
the Laplace variable s to obtain the sinusoidal steady-state frequency response. We 
will not prove this assertion here for reasons of brevity. Instead we will demonstrate 
its usefulness. 


1 


EXERCISES 21.5 


The Laplace transform of y(t) is Y(s), y(O) = 3, 
y’(0) = 1. Find the Laplace transforms of the 
following expressions: 


(a) y’ 

(c) 3y”—y’ 

(d) y’ + 2y’ 4+ 3y 
(e) 
(f) —4y” + Sy’ — 3y 


3y" a, y + 2y 


(g) 


(h) 
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Consider the Laplace transform of the equation for the voltage across a capacitor 
of capacitance C. Recall from Engineering application 21.1 that 


1 
Wo) = a” 


Rearranging this expression gives a formula for the impedance of the capacitor, Z(s), 
in the s domain 


He Oe 

I(s) Cs 
This form of the equation can be used to determine the behaviour of the capacitor in 
a variety of situations where the voltages and currents can take a number of forms, 
for example step inputs, sine waves, triangular waves, etc. However, if we are only 
interested in the sinusoidal steady-state response of the system when all transients 
have decayed, then it is possible to substitute s = jw into the expression for Z(s) 

V(jo) 1 
— I(jo) joc 
This form of the impedance is the one regularly used in a.c. circuit theory. It gives 
the impedance of the capacitor in terms of the capacitance and the angular frequency 
w, where w = 27f. (This result has already been discussed in the context of phasors 
on page 342). 

The same substitution can be made to obtain the frequency response of a system, 
given its transfer function. The system frequency response is something that can usu- 
ally be obtained easily by experiment. All that is required is a signal source of a known 
amplitude and a means of measuring the output amplitude and phase relative to the 
original signal. Measurements of this sort can reveal some of the properties of the 
system in question reflecting the close relationship between the transfer function and 
the frequency response. 


Z(jo) 


(b) y” 


2 Given the Laplace transform of f(t) is F(s), 
f() = 2, f’(0) = 3 and f”(0) = —1, find the 
Laplace transforms of 
(a) 3f'=2/ Ob af =f br 
ex” (@) 2p" — f +4f" —2f 
3 (a) IfF(s)=L{f(}= ca e— f(t) dt, show using 
integration by parts that 
(i) L{f')} = sF(s) — Ff) 
Gi) Lf" ()} = s°F(s) — sf) — f’(0) 
(b) If F(s) = L{f(t)} prove that 
Le" f@))} = F(s +a) 


1 
Deduce L{re~'}, given L{t} = >. 
s 
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Solutions 
1 (a) s¥-3 (h) (4s? — 8s + 6)¥ — 12s + 20 
(b) s?¥ —3s—1 2 (a) 3s—2)F -—6 
(c) 3s°¥ —s¥Y —9s ii) Ge =e Fr 63-7 
(d) (s?+25+3)¥ -3s—7 (c) s8F —2s? 3941 
(e) Gs —s+2)¥ — 9s (@) (283 —s? +45 —2)F — 4s? — 4s —3 
(f) (—4s? +5s—3)¥ +12s—11 1 
(g) (s* + 6s+8)¥ — 9s — 21 3 (b) Gap? 
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Example 21.11 


Solution 


As mentioned in Section 21.5, the Laplace transform can be used to solve differential 
equations. However, before such an application can be put into practice, we must study 
the inverse Laplace transform. So far in this chapter we have been given functions of t and 
found their Laplace transforms. We now consider the problem of finding a function f(t), 
having been given the Laplace transform, F'(s). Clearly Table 21.1 and the properties of 
Laplace transforms will help us to do this. If £{f(t)} = F(s) we write 


fOH=L7F OQ) 


and call £~! the inverse Laplace transform. Like £, £~' can be shown to be a linear 
operator. 


Find the inverse Laplace transforms of the following: 
stl 


(d) Pa 


2 16 
al ofS © (e) 
Ss S 


s 
s+] set] 


2 
(a) We need to find a function of t which has a Laplace transform of —. Using 
83 


2 
Table 21.1 we see es] =P. 
2 


(b) ett =sc{ 5 = 8° 


{stl 7 s a 1 : 
(e) £ ‘fest ie Horo} +c {| =eose + sing 


In parts (a), (c) and (d) we obtained the inverse Laplace transform by referring directly 
to the table. In (b) and (e) we used the linearity properties of the inverse transform, and 
then referred to Table 21.1. 


Example 21.12 


Solution 


Example 21.13 


Solution 


21.6 Inverse Laplace transforms 639 


Find the inverse Laplace transforms of the following functions: 


10 (s+ 1) 15 
@ Gap =O) Gage) © GD 
af 10 | - 10 +f 6 | - 583 et 
ls [Ss = 6" loan f7 3 


(b) a etre] i | as 
(s+1)?+4 (s+ 1)? + 22 


(c) ci} =se{__- | = 5e’ sinh 3t 


The function is written to match exactly the standard forms given in Table 21.1, with 
possibly a constant factor being present. Often the denominator needs to be written in 
standard form as illustrated in the next example. 


=e‘ cos2t 


Find the inverse Laplace transforms of the following functions: 


s+3 2s+3 
ee bj) 
Ome oO Sresap | 


s—l 
2s? + 854+ 11 


(a) By completing the square we can write 
s+ 6s +13 = (s+3)?+4= (9+ 3)? +2’ 


Hence we may write 


-1 s+3 -1 s+3 31 
= =e cos2t 
s? + 6s + 13 (s + 3)? + 22 
2s+3 2s+3 25 +6 3 
(b) = = 2 2 — 2 2 2 2 
so+6s+13  (s+3)?+2 (s+3)*+2 (s+3)?+2 


=2( s+3 ) ( 2 ) 
Ast 3% 427 2X (643)? +2? 


The expressions in brackets are standard forms so their inverse Laplace transforms 
can be found from Table 21.1. 


pa 25+3 i eae 3e-* sin 2t 
s? + 6s + 13 2 
(c) We write the expression using standard forms: 
s—l _ 1 s—1 _ 1 s—1 
2824+ 85411 25274+484+55 2(9+2)2415 
=H S+2 3 
~ 2\(s+2)24+1.5 (9 +2)? +15 
i s+2 3 V1.5 
~ 2NG4+22 415 VTS (6422 +15 
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Having written the expression in terms of standard forms, the inverse Laplace trans- 


form can now be found: 


s—l 


| 


2s? + 85+ 11 


I=: 


Ga cos V7 1.5f — 


3 
V1.5 


e 7 sinv 3) 


In every case the given function of s is written as a linear combination of standard 
forms contained in Table 21.1. 


By using the standard forms in Table 21.1 find the 


inverse Laplace transforms of the following functions: 


(a) 
(c) 


AloOn|s 


(kK). 5 


(q) 


Solutions 


1 

8 

6 eo" 
Jest 
cos 3t 
5 sin 4t 


8te~2" 


2 
(b) 3 


(d) 


(bh) =F 


(p) 


(b) 
(d) 
(f) 
(h) 
@ 
ce) 
(n) 


—2t 


5 ett 
sin 2t 

6 cos 2t 
2te 


9 
2 31 
2 


(w) 


(y) 


2 
Gil2—4 
2 
G43 o4 


(x 


@ 5 


s+3 
(s+ 3)? -—4 
6s 
=5 


) 


2 Find the inverse Laplace transforms of the following 
functions: 


(a) 


(c) 


©) a7 


(g) 


(i) 


(k) 


(m) 


(0) 
(q) 
(s) 
(u) 
(w) 
(y) 


s+4 
(s+ 4)? +1 
6s + 17 
(s+4)?4+1 
0.5 
(s + 0.5)? 
6s+9 
s2?+25+10 


u(t — 3) 
é(t—9) 
sinh 4¢ 

4 sinh 3t 
e~‘ sinh 2t 


e7 > sinh 2t 


O53 


(d) 


(h) 


(n) 


(p) 
(r) 
(t) 
(v) 
(x) 
(2) 


4 1 


1 
3(s + 2) 
s—6 
s—4 

5 
(s+4)?4+1 

Ss 
st+25+7 

s+5 
s? + 85 + 20 

TIs+3 
se+4s+8 


(f) 


) 


() 


4u(t — 6) 
46(t — 8) 
cosh 3t 

6 cosh J/8t 
e— > cosh 2t 
6cosh J/5t 


21.7 Using partial fractions to find the inverse Laplace transform 641 


3 ? (i) 6e7 cost — Je sint 
2 (a) 5 (bl) 4-— 
2 2 1 
ent (Gj) et cos /6t — —=e~ sin V6r 
(c) 30¢ (d) V6 
3 te—t/2 
(k) 


7 
(e) 3cos 3t — 3 sin 3t 


4 


1 
(I) e7*’ cos 2t + a e sin 2t 


(f) 8(4) -—2e™ 
(g) e~ cost (m) 6e~ cos 3¢ + ee sin 3t 

ins (n) 7e~2! cos 2t — — e~”" sin 2r 
(h) 5e-* sint 2 


Yai ~=USING PARTIAL FRACTIONS TO FIND THE INVERSE 
LAPLACE TRANSFORM 


The inverse Laplace transform of a fraction is often best found by expressing it as its par- 
tial fractions, and finding the inverse transform of these. (See Section 1.7 for a treatment 
of partial fractions.) 


Example 21.14 Find the inverse Laplace transform of 


4s—1 6s +8 
pip 
@) es 6) s?+3s4+2 


4s — 


1 
Solution (a) Weexpress as its partial fractions: 
Ss 


se 
4s—1  4s—1 A B 
s@—s  s(s—1) s s—l 


Using the technique of Section 1.6 we find that A = 1, B = 3 and hence 
4s —1 _ 1 3 


s—s gs s—l 


The inverse Laplace transform of each partial fraction is found: 


1 
ci(*)=1, oa =; ) =3¢ 
s s- 


Hence 


Aig ef 
rae : )=143e 
= $ 


32 
(b) The expression is expressed as its partial fractions. This was done in Section 1.7. 


6s+8 2 4 4 
e+3st+2 st+1 s42 


The inverse Laplace transform of each partial fraction is noted: 


2 4 
c( =) =2e" ras =) =4e7 
S S 
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Example 21.15 


Solution 


Example 21.16 


Solution 


EXERCISES 21.7 


Express the following as partial fractions and hence 
find their inverse Laplace transforms: 


(a) 5s +2 


() 3s+4 


(c) 4s +1 


(s+ 1)(s +2) 
(s + 2)(s +3) 


(s+ 3)(s +4) 


Chapter 21 The Laplace transform 


and so 
6s+8 


aero =o eer 


3s? + 65 +2 
Find the inverse Laplace transform of ean 
s? + 352 + 25 
Following from the work in Section 1.7.1, we have 


357 +65 +2 1 1 7c 1 
P4+3s24+25° 5s stl s+2 


The inverse Laplace transform of the partial fractions is easily found: 


357 +65 +2 


=| -t =2t 
eee Sy 
eres, ee 


3s? + lls +14 
gs? +25? — lls — 52° 


Find the inverse Laplace transform of 


From Example 1.33 we have 
3s* + lls + 14 _ 2 s+3 
+25? -lls—52) s—4) 98 +65413 


We find the inverse Laplace transforms of the partial fractions: 


al pwd =2e" 
a 


= aie =e *' cos 2t 
s?+ 65+ 13 


So 
4 3s* + lls +14 
gs? + 252 — 1ls — 52 


= 2e" +e-* cos 2r 


6s —5 
(s+ 5)(s + 3) 
4s+1 
s(s + 2)(s + 3) 
Is +3 
s(s + 3)(s +4) 


6s+7 


© e4+D6F4 


21.8 Finding the inverse Laplace transform using complex numbers 


8s —5 

(s+ 1)(s + 2)(s +3) 
3s+5 

(s + 1)(s? +3842) 
2s —8 

(s + 2)(s? + 7s + 6) 


2 Express the following fractions as partial fractions 
and hence find their inverse Laplace transforms: 


(a) 


(b) 


3s+3 


(s — 1)(s +2) 


5s 


(s + 1I)@s — 3) 


Solutions 
1 (a) 8e-% —3e7 
(b) 5e7*! —2e7% 
(c) 15e7“ — lle 
(d) 35 et 23 —3t 
2 5, 
i ee 
(e) 6 +5 e _ 3 e 
1 25 
(f) ri 4 6e7>! ral 
ee: ee 
@® gtae 8° 
(h) 21e~7 _ 29 —3t 13 e? 
2 2 
(i) e+ 2tet — e-2! 


) 


3e72t = et = Jet 


2s+5 
s+2 

(a) set 4s+4 
so + 2s? + 5s 

l-s 

(s + 1)(s? + 25 +2) 

s+4 

a agee 

2(s? — 3s? +5 —1) 

(s2 +45 + 1)(s? +1) 
3s7 —s +8 

(s? — 2s + 3)(s +2) 


(c) 


(a) 2e! +e7?! 


31/2 
(b) ef + —— 


(c) 26(t) + e727 

1 a 1 leer eae 
(d) gate cos 2t + = e sin2r) 
(e) e “(2 — 2cost — sint) 


(f) e~7*(1 + 28) 


V3 
(h) ef (cos /2t + V/2 sin /2t) + 2e72" 


8 
(g) e 7! ( cosh /3t — —= sinh vi) —cost 


F218 FINDING THE INVERSE LAPLACE TRANSFORM USING 


Example 21.17 


COMPLEX NUMBERS 


643 


In Sections 21.6 and 21.7 we found the inverse Laplace transform using standard forms 
and partial fractions. We now look at a method of finding inverse Laplace transforms 
using complex numbers. Essentially the method is one using partial fractions, but where 
all the factors in the denominator are linear — that is, there are no quadratic factors. We 
illustrate the method using Example 21.13. 


s+3 
— b 
(a) s2+6s4+13 (6) 


s?+ 65+ 13 


Find the inverse Laplace transforms of the following functions: 


s—l 


©) 5248411 
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Solution (a) We first factorize the denominator. To do this we solve s? + 6s + 13 = 0 using the 


formula 
_ 6+ V36— 403) 
_ 2 
—6+./-16 
~ 5 
S624) 
~ 2 
= —3+2)j 


It then follows that the denominator can be factorized as (s — a)(s — b) where 
a = —3 + 2j and b = —3 — 2j. Then, using the partial fractions method 
s+3 s+3 A B 


2 +6s+13 (s —a)(s — b) eg aad 


The unknown constants A and B can now be found: 
s+3=A(s—b)+B(s—a) 
Puts =a=-—3+42j 


2) = A(—3 + 2j — b) = A(4j) 


1 
A=- 
2 


Equate the coefficients of s 
1=A+B 

1 
B= 


So, 


st3 if 
et+6st+13 2\s—-a s—b 


The inverse Laplace transform can now be found: 


s+3 1 1 1 
ie — Pe ae 
laser | 2 {—+—5} 


1 ga 4 el”) = 1 asp ae ef 3-2i)r) 
2 2 


Loe (e7i! oh ei) 
2 
1 
=5 e * (cos 2t + j sin 2t + cos 2t — j sin 2r) 


=e cos 2t 
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2s+3 2s +3 A B 
(b) = = ee 
s* + 65+ 13 (s—a)(s—b) s—-a_ s—b 


where a = —3 + 2j and b = —3 — 2). Hence, 


2s+3=A(s—b)+B(s—a) 
Puts =a=-—3+42j 


—~3 + 4j = A(4j) 
A=1+0.75j 


Equate the coefficients of s 


2=A+B8B 
PS i075) 
Hence, 
2543 140.75) 10.753 
e+6s+13° s—a s—b 


Taking the inverse Laplace transform yields 


“ta | = (1+ 0.75j)e” + (1 — 0.75j) e” 
= (1 + 0.75j) 7" + (1 — 0.75j) 
= (1 + 0.75j) e~* (cos 2t + j sin 2t) 
+ (1 — 0.753) e "(cos 2t — jsin 2r) 


=e" (2cos 2t — 1.5 sin 2r) 


wm 


( s—1 1 s—1 1 s-—1 
Cc — — 
2s? + 85+ 11 2\s2+4s4+5.5 2\(s —a)(s — b) 
where a = —2+4+ V1.5j, b = —2 — V'1.5j. Applying the method of partial fractions 


produces 


s—1 _ A B 
(s—a)(s—b)  s—a s—b 


Hence, 
s—1=A(s—b)+ B(s —a) 

By letting s = a, then s = b in turn, gives 
A=054+V1.5j) B=05- V1.5; 


Hence we may write 


s—1 ean, OS wl 
(s-al(s=—5)— S-—a s—b 
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Taking the inverse Laplace transform yields 


c{ I = (0.54 Vi 5j)e" + 0.5 — VI.5j) 


(s — a)(s — b) 

= (054 VT.5j) 21 Sit 
+ (0.5 —9/1.5j)e5° 4" 

=e 7{(0.5 + V1.5j) ev it 
+ (0.5 —V1.5j) eH} 

=e {(0.5 + /1.5j) (cos V1.5t + j sin V1.5) 
+ (0.5 — V1.5j) (cos V1.5t — j sin /1.5t)} 

=e (cos V/1.5t — 2V1.5 sin V'1.5t) 


Hence 


= | —2t 
els A z | = “5 (cos V1.5t — 2V1.5 sin V1.5t) 
S S 


As seen from Example 21.17, when complex numbers are allowed, all the factors in the 
denominator are linear. The unknown constants are evaluated using particular values of 
S or equating coefficients. 


EXERCISES 21.8 
1 Express the following expressions as partial fractions, (c) = (d) Sts+1 
using complex numbers if necessary. Hence find their (s2/2) —s+5 s? —25s+3 
inverse Laplace transforms. 
@ — 382? oy 284! Ca 
s?+6s+13 s? —2s+2 —s° + 2s —5 
Solutions 


3411j/4 3-11)/4 
a) 
—a s—b 
where a = —3 + 2j,b = —3 — 2), 


im 


(d) 14 


3 —j/2V2) n $4 5/2V2) 
s—a s—b 


11 where a = 1+ /2j,b=1— V2), 
37 ; 
e [3 cos 2t — a sin 21 


1 

[378-14 3)2 3(t) +e! ( 3cos V2t + — sin V2t 
(b) + V2 

sS—a s—b 

wherea = 1+j,b=1-—-j, -~145j/4  —1—5j/4 

e' (2 cost +3 sin) () eA + s—b 
(goes aa 42 where a = 1 +2j,b = 1 —2j, 

S—a = 


where a = 1+ 3j,b= 1 — 3j, 


5 
16 e' (—2 cos 2t — y sin 2t) 
e! (4 cos 3t — = sin xr) + 28(t) 
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P29 THE CONVOLUTION THEOREM 


Let f(t) and g(t) be two piecewise continuous functions. The convolution of f(t) 
and g(t), denoted (f * g)(t), is defined by 


(f * g)(t) =| f(t — v)g(v) dv 


Example 21.18 Find the convolution of 2f and f°. 
Solution f(t) = 2t, g(t) =f, f(t —v) = 2(¢ — v), g(v) =v’. Then 


t t 
arti = [2¢-vysdv=2 f tv? — v* du 
0 0 


It can be shown that 


fre=ee7 
but the proof is omitted. Instead, this property is illustrated by an example. 
Example 21.19 Show that f * g = g* f where f(t) = 2r and g(t) = 9°. 


Solution /f *« g=2r*t? = 1/10 by Example 21.18. From the definition of convolution 


(g* JO = [ g(t — v) f(v) du 


We have g(t) = f°, so g(t — v) = (t — v)*, and f(v) = 2v. Therefore 
id 
ge f=P «t= / (t — v)?2udu 
0 


t 
= / ( — 307v + 3tv? — v?)2u du 
0 


£ 
2 | tv — 3t7v? + 3tv® — vt du 
0 


fo apie 3tut =| _# 
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For any functions f(t) and g(t) 
fegagxf 


21.9.1 The convolution theorem 


Let f(t) and g(t) be piecewise continuous functions, with L{f(t)} = F(s) and 
L{g(t)} = G(s). The convolution theorem allows us to find the inverse Laplace trans- 
form of a product of transforms, F (s)G(s): 


L'{F(s)G(s)} = (f * 8)(t) 


Example 21.20 Use the convolution theorem to find the inverse Laplace transforms of the following 
functions: 


1 3 
@ Canes - eae 


Soluti Let F(s) = i G(s) = 
olution (a) Let F(s) = Oo (s) = ae 
Then f(t) = L-'{F(s)} =e”, o(t) = L7'N{G(s)} =e*. 
-1 1 a, po = 
L [=| = LF (s)G(s)} = (f * 9) (0) 
=} ee 20-% e—3u dv =, eet e73¥ dv 
0 0 


t 
=% = 
=, ee 'du 
0 


= e[-e "Jf = e 7 (—e7 4 1) = e 2 _ et 


(b) Let F(s) = = G(s) = : Zz Then f(t) = 3, g(t) = + sin 2t. So, 
S 


s2 + 


3 
cl | = LF (s)G(s)} 


= Ff *3)() 


£ is) 3 t 
-|/ qa “aw =5 f sin 2u dv 
0 2 2 Jo 


_ 3[—cos2u I 21) 
A ge ee 


1 


2 


3 
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EXERCISES 21.9 


Find 

(a) e727 xe? 

(b) 12 xe7> 

Find f * g when 

(a) f=le=t 

(b+) f=r.g=t 

(7 Se 251 

(d) f=sint,g=t 

In each case verify that 

Lif} x Ligh = Lif * 3}. 
1 1 1 


If F(s) = sa? G(s) = ~ and H(s) = 43 


Solutions 


(a) et e72t 


? 2% 2 2e% 


3 9 7 2] 27 

oe  * 

ae a 
12 


(d) t —sint 


(b) 


use the convolution theorem to find the inverse 
Laplace transforms of 


(a) F(s)G(s) 
(b) F(s)H(s) 
(c) G(s)H(s) 
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Use the convolution theorem to determine the inverse 


Laplace transforms of 


1 
@ Joa 
1 
©) G436—2) 
1 

© Grip 

of — en 3/2 ime ae2 
(a)e’ —1 (b) ——— (c) 3 

e2t _ eo 3t 

@)t-14e% — ) SS 


sint — ft cost 


(c) 5 


SOLVING LINEAR CONSTANT COEFFICIENT 


DIFFERENTIAL EQUATIONS USING THE LAPLACE 


TRANSFORM 


So far we have seen how to find the Laplace transform of a function of time and how to 
find the inverse Laplace transform. We now apply this to finding the particular solution 
of differential equations. The initial conditions are automatically satisfied when solving 
an equation using the Laplace transform. They are contained in the transform of the 


derivative terms. 


The Laplace transform of the equation is found. This transforms the differential equa- 
tion into an algebraic equation. The transform of the dependent variable is found and then 
the inverse transform is calculated to yield the required particular solution. 


Example 21.21 Solve the differential equation 


ox 0 
e 


using Laplace transforms. 


x(0) =3 
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Solution The Laplace transform of each term is found: 
dx 
L(x) = X(s), c($) = sX(s) — x(0) = sX(s) —3 


Note that we write X(s) to draw attention to the fact that X is a function of s; X(s) 
does not mean X multiplied by s. Taking the Laplace transform of the equation 
yields 


sX(s) —3 4+ X(s) =0 
The equation is rearranged for X(s): 


sX(s)+X(s) =3 
(s+1)X(s) =3 
X(s) = 


stl] 


Taking the inverse Laplace transform of both sides of the equation gives 


x(t) =3e7 


Example 21.22 Solve 
OF ia Get x(0) = 3 
dt 


using the Laplace transform. 


Solution The Laplace transform of both sides of the equation is found: 


sX(s) —x(0) + X(s) = — 
sX(s)-—34+X(s) = s 
s—2 
(s+ 1)X(s) = er 
s—2 
1 
(6+ DX(s) = “SFY 
3 
X(s) = 3 


Taking the inverse Laplace transform yields 


x(t) = 3e7 
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Engineering application 21.3 


RL circuit with ramp input 


Recall from Engineering application 19.3 the differential equation for an RL circuit 
with a unit ramp input is given by 


di 
R+L—=t t20 i(0)=0 
dt 
Use the Laplace transform to solve this equation. 


Solution 
Let 


L(i(t)) = I(s) 


Then 
ee 
£($) = 16) ~ 10) = 16) —0 = 51 
From Table 21.1 we see that 
1 


We note that R and L are constants. We can now take the Laplace transform of the 
given equation. This gives 


1 
I(s)R + LsI(s) = = 
s 
This equation is solved for I(s): 


I(s)(R+ Ls) = 5 
1 
s?(R + Ls) 
In order to take the inverse Laplace transform and hence find i(t) we express /(s) as 
the sum of its partial fractions. The expression 


1 
s2(R + Ls) 


Ko) = 


has a repeated linear factor, s? in the denominator, giving rise to partial fractions 
A B 
ss 
The linear factor, R + Ls, gives rise to a partial fraction of the form 
€ 
R+Ls 


Hence 


Tee eyes 
S(R+Ls) ss? R+Ls 


KS) = 
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The unknown constants, A, B and C, need to be expressed in terms of the known 
constants R and L. Multiplying the above equation by s*(R + Ls) yields 


1 =As(R+ Ls) + B(R+ Ls) + Cs? (1) 
Letting s = 0 in Equation (1) gives 
1] = 1BIR 
1 
and so B = —. 
R 


R R 
We note that when s = — 7 then R+Ls is 0. Hence letting s = — i in Equation (1) 
gives 
R\? ee 
1 =C|——}] from which C = — 
L R 
Comparing the coefficient of s on both sides of Equation (1) gives 
0=AR+ BL 
BL E 
A = - eS 
R R? 
Substituting the expressions for A, B and C into the expression for /(s) gives 


I(s) =—- Z + : ar i 
R?s Rs? -R2(R+Ls) 
In readiness for taking inverse Laplace transforms we write the final term as follows: 
Le iP az L 
R2(R+Ls) R2L(R/L+s)  R2(R/L+s) 
Hence 


(eo. 
= ide Re POayosee) 


Taking the inverse Laplace transform yields 


: ll, t Ih, 
1 anes pal ey t20 


This may be rearranged as 


W 
=) 


ig UL 
lee —(R/L)t 
i(t) = te (e Die at 


Example 21.23 Solve 


y’-y=-? — y(0) =2, y'(0) =0 


using the Laplace transform. 


Solution Let L(y) = Y(s). Then using the result stated in Section 21.5 we have 


L(y") = s°¥(s) — sy(0) — yO) = s°¥(s) — 2s 
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We also note that 


L£(-1”) = a 


33 
The Laplace transform of the differential equation is taken. This yields 


5 2 
sY(s) —2s—Y(s) =—- = 
Ss 


The equation is rearranged for Y (s): 


(s* —1)Y(s) = = +2s 
=-5 


2(s* — 1) 
= —— 

2(s? — 1)(s? +1) 
= 3 


By dividing the equation by (s* — 1) we obtain 
2(s? — 1)(s? + 1) _ 2(s? + 1) 


Y = = 
(s) s3(s? — 1) s 

_2 22,2 

~ 8 ' 8 gs g3 


Taking the inverse Laplace transform gives 


y(t) =24+2 


Example 21.24 Solve 
y+y—%¢=-2  y0)=2, YO)=1 
Solution Let L(y) = Y(s). Then 
L(y’) = s¥(s) — yO) = s¥(s) —2 
L(y") = #¥(s) — sy) — yO) = °¥(s) — 25-1 
We also note that 
2 
L£(-2) =-= 
s 
We can now take the Laplace transform of the differential equation to get 
2 
s°Y(s) — 2s —1+sY(s) —2—2Y(s) = —— 
S 
and so 
2 
(s? +5 —2)¥(s) = -—— +2543 
S 


2s* +35 —2 _ (2s — 1)(s+ 2) 
Ss 


s=)DG+2)%G)= 


Dividing the equation by (s — 1)(s + 2) gives 
(2s — 1)(s + 2) _ 2s—1 
s(s—1)(s+2)  s(s—1) 


¥(s) = 
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pee ee yg ; ’ 
The expression, ( y’ is written as its partial fractions: 
s(s— 


1 
Y(s) =—+ 
s sl 
Taking the inverse Laplace transform yields 
yf) =1+e 


Example 21.25 Solve 
x" 42x +2x=e7 x(0) = x(0) =0 


using Laplace transforms. 


Solution Taking the Laplace transform of both sides: 


s°X(s) — sx(O) — x’ (0) + 2(sX(s) — x(0)) + 2X(s) = i 
Therefore, 
(s* + 25 + 2)X(s) = = since x(0) = x’(0) = 0 
stl 
1 _ 1 
(s+1)(s?+25+2)  (s+1)(s—a)(s—b) 


where a = —1+j,b = —1 —j. Using partial fractions gives 


Xs) = 1 1 1 7 1 
ee | 2\s-a s—b 


Taking the inverse Laplace transform 


X(s) = 


1 
x(t) =e7— ae +e) 
met bei 4 ey 
2) 


—t et jt —jt 
=e = aoe +e“) 
-t 


Be = (cost +jsint + cost —jsint) 


=e. 


=e'—e ‘cost 


Example 21.26 Solve 
x" — 5x + 6x = 6t — 4 x(0) = 1, x(0) =2 


Solution The Laplace transform of both sides of the equation is found. Let L{x} = X(s). 


s°X(s) — sx(0) — x’(0) — 5(sX(s) — x(0)) + 6X(s) = _ -- 
332 
Pa=5oeGe 2 eg 
Ss 


s2 s? 
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Cee ed ee 
U= "262 —S5s+6) sXe —2)(8—3) 


A B Cc D 


The constants A, B, C and D are evaluated in the usual way: 
X(s) 1 + 1 * 3 2 

=> 

6s ss? 22(s—2) 3(s—3) 


Taking the inverse Laplace transform yields 


1 3» 2 3 
H=—fi+s =e" 
x(t) ae vee a" 


Engineering application 21.4 


Discharge of a capacitor through a load resistor 


In Engineering application 2.8, we examined the variation in voltage across a capac- 
itor, C, when it was switched in series with a resistor, R, at time t = 0. We stated a 
relationship for the time-varying voltage, v, across the capacitor. Prove this relation- 
ship. Refer to the example for details of the circuit. 
Solution 
First we must derive a differential equation for the circuit. Using Kirchhoff’s voltage 
law and denoting the voltage across the resistor by vg we obtain 

v+ Up = 0 
Using Ohm’s law and denoting the current in the circuit by 7 we obtain 

vp ke— 10 


For the capacitor, 


Combining these equations gives 
du 
mn) 
dt 
We now take the Laplace transform of this equation. Using £{v} = V(s) we obtain 

V(s) + RC(sV(s) — v(0)) = 0 

V(s)(1 + RCs) = RCv(0) 

RCv(O) v(0) 
V — = 
OS Tet oa 


v + RC 


Taking the inverse Laplace transform of the equation yields 
p= vier ey 0 


This is equivalent to the relationship stated in Engineering application 2.8. 


656 Chapter 21 The Laplace transform 


Engineering application 21.5 


Electronic thermometer measuring oven temperature 


Many engineering systems can be modelled by a first-order differential equation. 
The time constant is a measure of the rapidity with which these systems respond to 
a change in input. Suppose an electronic thermometer is used to measure the tem- 
perature of an oven. The sensing element does not respond instantly to changes in 
the oven temperature because it takes time for the element to heat up or cool down. 
Provided the electronic circuitry does not introduce further time delays then the dif- 
ferential equation that models the thermometer is given by 


du 


it 

dt 
where v,, = measured temperature, vu, = oven temperature, tT = time constant of the 
sensor. For convenience the temperature is measured relative to the ambient room 
temperature, which forms a ‘base line’ for temperature measurement. 

Suppose the sensing element of an electronic thermometer has a time constant 
of 2 seconds. If the temperature of the oven increases linearly at the rate of 3°C s~! 
starting from an ambient room temperature of 20°C att = 0, calculate the response of 
the thermometer to the changing oven temperature. State the maximum temperature 
error. 


a =P 0, = 0, 


Solution 
Taking Laplace transforms of the equation gives 


1 (sV,,(s) — Vp,(0)) + V-,(8) = V, (8) 


v,,(0) = O as the oven temperature and sensor temperature are identical at t = 0. 
Therefore, 


ESV AS) ar WS) = VO) 


Old) 


For this example, the input to the thermometer is a temperature ramp with a slope of 
3°Cs7!. Therefore, v, = 3t fort > 0: 


3 
Wo) = Lin. @)} = 2 (212) 
Combining Equations (21.1) and (21.2) yields 
3 3 
WA) = since tT = 2 


P(+ts)  s2(1 +25) 
Then using partial fractions, we have 


eee ee 
ee s 1+2s 


Taking the inverse Laplace transform yields 


Un =3t-6+6e°% +t¢230 
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This response consists of three parts: 


(1) a decaying transient, 6e°" 


, which disappears with time; 

(2) aramp, 3f, with the same slope as the oven temperature; 

(3) a fixed negative temperature error, —6. 

Therefore, after the transient has decayed the measured temperature follows the oven 
temperature with a fixed negative error. It is instructive to obtain the temperature error 
by an alternative method. Given that the temperature error is v,, then 


Us = Um — Vo 
and 
V.(s) = V,,(5) — V,(s) (21.3) 
Combining Equations (21.1) and (21.3) yields 
V.(s) =e) 
V. = -—vV. = —_— 21.4 
2(5) eee (5) cer (21.4) 
Combining Equations (21.2) and (21.4) yields 
— 3 3 
Oe Ts = ie 


1+tss2 s(1+Ts) 
The final value theorem can be used to find the steady-state error: 
; , ; —3ts 
heh OG) = ita so) = lhitn|| ———— | = =3¢ = —6 
too s>0 s30 s(1 ar TS) 


that is, the steady-state temperature error is —6°C. It is important to note that the 
final value theorem can only be used if it is known that the time function tends to a 
limit as t + oo. In many cases engineers know this is the case from experience. 


The Laplace transform technique can also be used to solve simultaneous differential 
equations. 


Example 21.27 Solve the simultaneous differential equations 


/ 


K+x+5=1 x(0) = y(0) =0 
ey ty =0 
a ty t= 


Solution Take the Laplace transforms of both equations: 
sY(s)—y(O) 1 
2  § 


+ sY(s) — yO) + Y(s) =0 


sX(s) — x(O) + X(s) + 


sX(s) — x(0) 
2 
These are rearranged to give 
sY (s) _ 1 


(s + 1)X(s) + 


sX (s) 
2 


+ (s+ 1)Y(s) =0 
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These simultaneous algebraic equations need to be solved for X(s) and Y(s). By 
Cramer’s rule (see Section 8.7.2) 


s+11/s 
2 0 —1/2 
Heel _ / 
s+1 5/2 (s + 1)? — s?/4 
s/2 stl 
i 1 
~  2|3s2/44+2s41 
i 4 
2 (3s? +8544 
= ail 4 
~— 2| Gs+2)(s +2) 
Using partial fractions we find 
1 3 1 
Y(s) = - 
2(3s+2 s+2 
2d 1 1 
~ 2 st2  s+2 
and hence 
1 
WH Gee") 
Similarly, 
4 1 1 1 1 
X(s) = (s+ ) — — 5) 
s3s+2)(s+2) 5s 2(s + 3) 2(s + 2) 
and so 


1 
x(t) = [= ae ae e 7) 


EXERCISES 21.10 
1. Use Laplace transforms to solve (d) Fic +8y=7 
dt , 
0) =6 
(a) x. +x =3, mm) 
x(0) =1 (e) y’+y +y=1, 
y(0) = 1, y'(0) =3 
dx 
(b) 3 oT + 4x = 2, 2 Use Laplace transforms to solve 
x(0) = 2 (a) x’ +x =2t, 


x(0) = 0,x/(0) =5 
(c) 2 +4y=1 
Pas ge =o (b) 2x" +x! —x = 27 cos 2t + 6sin2r, 
Mme: x(0) = —1,x(0) = -2 


(c) x" +2 —2x = 1-21, 
x(0) = 6,x'(0) =—11 
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f) x +x+y'+y=6e, 
x’ 4+2x—yl -y=2e7 
x(0) = 2, y(0) = 1 


t 
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(d) x” — 4x = 4(cos 2t — 1), 
x(0) = 1,x'(0) =0 3 Using Laplace transforms find the particular 
; solution of 
/_ 2x — yl + 2y = —207 +7 a d 
©) is * ded a! a? ~ Sq 7 = Me™ 
gz tata +y=? +6 3 
x(0) = 3, y(0) =6 satisfying y = 3 and 77 = 8 whent = 0. 
Solutions 
1 (a) x(t) =3-—2e7 (c) t+6e-7 
3° a 1 ee 1 
by 6 d 2r+1 
(b) rhe + 7 (d) , + a haa 
@ +a (e:) x= +3,y=6-t 
7 41 _» 6e! 7 6 e73t/2 
= A th= +2 : 
(d) y(t) gt ra (f) x(t) 5 € 5 
= : —31/2 
(e) y@) = 143,464.27 sin 0.8661 sty 28 amet © . 
2 (a) 3sint+ 2t Bet 1366 
(b) —3cos2t+2e7% m ~2te% + — ; 
GEXEER TRANSFER FUNCTIONS 


It is possible to obtain a mathematical model of an engineering system that consists of 
one or more differential equations. This approach was introduced in Section 20.4. We 
have already seen that the solution of differential equations can be found by using the 
Laplace transform. This leads naturally to the concept of a transfer function which will 
be developed in this section. Consider the differential equation 
dx(t) 

qT) 
and assume that it models a simple engineering system. Then f(t) represents the input to 
the system and x(t) represents the output, or response of the system to the input f(t). For 
reasons that will be explained below it is necessary to assume that the initial conditions 
associated with the differential equation are zero. In Equation (21.5) this means we take 
Xp to be zero. Taking the Laplace transform of Equation (21.5) yields 


x(0) = Xp (21.5) 


SX (S) —X) + X(s) = F(s) 


(1+ s)X(s) = F(s) assuming x) = 0 


so that 
X(s)_ 1 
F(s) 1+s 


The function, X(s)/F(s), is called a transfer function. It is the ratio of the Laplace 
transform of the output to the Laplace transform of the input. It is often denoted by 
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Example 21.28 


Solution 


G(s). Therefore, for Equation (21.5), 


G(s) = 


l+s 

The concept of a transfer function is very useful in engineering. It provides a simple 
algebraic relationship between the input and the output. In other words, it allows the 
analysis of dynamic systems based on the differential equation to proceed in a relatively 
straightforward manner. Earlier we noted that it was necessary to assume zero initial 
conditions in order to form the transfer function. Without such an assumption, the re- 
lationship between the input and the output would have been more complicated. What 
is more, the relationship would vary depending on how much energy is stored in the 
system at t¢ = 0. By assuming zero initial conditions, the transfer function depends 
purely on the system characteristics. Such an approach, whilst very convenient, does 
have its limitations. The transfer function approach is useful for analysing the effect of 
an input to the system. However, if one requires the effect of the combination of a system 
input and initial conditions, then it is necessary to carry out a full solution of the differ- 
ential equation as we did in Section 21.10. In practice there are many cases where the 
simplified treatment provided by the transfer function is perfectly acceptable. An alter- 
native approach is provided by state-space models, which we examined in Section 20.4. 
These provide a natural way of handling initial conditions. In order to solve a state-space 
model where there are initial conditions, all that is necessary is to use a non-zero initial 
value of the state vector. Recall that in Section 20.4 we only set up the state-space models 
and did not solve them. However, the method for the standard solution of a state-space 
model can be found in many advanced textbooks on control and systems engineering. 


Find the transfer functions of the following equations assuming that f(t) represents the 
input and x(t) represents the output: 


dx(t) 
(a) aa 4x(t) = 3f(t), x(0) =0 

Px(t) x(t) _ dx(0) - 
(b) ap +3 a x(t) = f(t), 7 0, x(0) =0 


(a) Taking Laplace transforms of the differential equation gives 


sX(s) — x(0) — 4X(s) = 3F(s) 
(s — 4)X(s) = 3F(s) as x(0) = 0 


X(s) _ 
Fa ea 
(b) Taking Laplace transforms of the differential equation gives 
s°X(s) — sx(0) — _ + 3(sX(s) — x(0)) — X(s) = F(s) 
(s* + 3s — 1)X(s) = F(s) as =0 and x(0) =0 
X(s) _ 1 
Fay Passo 1 


When creating a mathematical model of an engineering system it is often convenient to 
think of the variables within the system as signals and elements of the system as means 


R(s) | 2 | Ys) 
G(s) 


Figure 21.3 


The relationship 
Y(s) = G(s)R(s) holds 
for a single block. 


21.11.1 
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by which these signals are modified. The word signal is used in a very general sense 
and is not restricted to, say, voltage. On this basis each of the elements of the system 
can be modelled by a transfer function. A transfer function defines the relationship be- 
tween an input signal and an output signal. The relationship is defined in terms of the 
Laplace transforms of the signals. The advantage of this is that the rules governing the 
manipulation of transfer functions are then of a purely algebraic nature. Consider 
Figure 21.3. If 


R(s) = L{r(t)} = Laplace transform of the input signal 
Y(s) = £L{y(t)} = Laplace transform of the output signal 


G(s) = transfer function 
then 
Y(s) = G(s)R(s) 


Transfer functions are represented schematically by rectangular blocks, while signals 
are represented as arrows. Engineers often speak of the time domain and the s do- 
main in order to distinguish between the two mathematical representations of an en- 
gineering system. However, it is important to emphasize the equivalence between the 
two domains. 

Often when constructing a mathematical model of a system using transfer functions, 
it is convenient first to obtain transfer functions of the elements of the system and then 
combine them. Before the overall transfer function is calculated a block diagram is drawn 
which shows the relationship between the various transfer functions. Block diagrams 
consist of three basic components. These are shown in Figure 21.4. 


R(s) [2 | Ys) R(s) 4 R(s)—X(s) -Y(s) ¥(s) 
G(s) 


X(s) Y(s) 
(a) (b) (c) 


Figure 21.4 

The three components of block diagrams. (a) A basic block; the block contains a 
transfer function which relates the input and output signals. (b) A summing point. 
(c) A take-off point. 


We have already examined the basic block which is governed by the relationship 
Y(s) = G(s)R(s). A summing point adds together the incoming signals to the summing 
point and produces an outgoing signal. The polarity of the incoming signals is denoted 
by means of a positive or negative sign. There can be several incoming signals but only 
one outgoing signal. A take-off point is a point where a signal is tapped. This process 
of tapping the signal has no effect on the signal value; that is, the tap does not load the 
original signal. There are several rules governing the manipulation of block diagrams. 
Only two will be considered here. 


Rule 1. Combining two transfer functions in series 
Consider Figure 21.5. The following relationships hold: 
X(s) =G,(s)R(s) —- Y(s) = G,(s)X(s) 
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R(s) X(s) Y(s) R(s) Y(s) 
G(s) G>(s) G(s) G)(s) 


Figure 21.5 Figure 21.6 
Two blocks in series. Figure 21.5 is simplified to a single block. 


Eliminating X(s) from these equations yields 
Y(s) = G,(s)G,(s)R(s) 


Y(s) 
R(s) 


= G,(s)G,(s) 


Finally the overall transfer function, G(s), is given by G,(s)G,(s) as shown in 
Figure 21.6. 


For two transfer functions in series the overall transfer function is given by 


G(s) = G,(s)G,(s) 


21.11.2 Rule 2. Eliminating a negative feedback loop 


Consider Figure 21.7 which shows a negative feedback loop. It is so called because the 
output signal is “fed back’ and subtracted from the input signal. Such loops are common 
in a variety of engineering systems. The quantities X, (s) and X,(s) represent intermedi- 
ate signals in the system. We wish to obtain an overall transfer function for this system 
relating Y(s) and R(s). For the two transfer functions the following hold: 


Y(s) = G(s)X,(s) (21.6) 

X,(8) = H(s)¥(s) (21.7) 
For the summing point 

X, (8) = R(s) — X,(s) (21.8) 
Combining Equations (21.7) and (21.8) gives 

X, (8) = R(s) — H(s)¥(s) (21.9) 
Combining Equations (21.6) and (21.9) gives 

Y(s) = G(s)(R(s) — H(s)¥(s)) = G(s)R(s) — G(s)H(s)¥ (s) 

Y(s)(1 + G(s)H(s)) = G(s)R(s) 


G(s)R(s) 


YS) = TT E@H® 


The overall transfer function for a negative feedback loop is given by 
Y(s) | G(s) 
R(s) 1+ G(s)H(s) 
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R(s) G(s) Ys) 
1 + G(s)H(s) 


Figure 21.7 Figure 21.8 
Block diagram for a negative feedback loop. Simplified block diagram for a negative 
feedback loop. 


Example 21.29 


Solution 


Example 21.30 


The simplified block diagram for a negative feedback loop is shown in Figure 21.8. 

A complicated engineering system may be represented by many differential equa- 
tions. The output from one part of the system may form the input to another part. Con- 
sider the following example. 


A system is modelled by the differential equations 
x +2x = f(t) (21.10) 
2y —y=x(t) (21.11) 


In Equation (21.10) the input is f(t) and the output is x(t). In Equation (21.11), x(t) 
is the input and y(f) is the final output of the system. Find the overall system transfer 
function assuming zero initial conditions. 


The output from Equation (21.10) is x(t); this forms the input to Equation (21.11). The 
block diagrams for Equations (21.10) and (21.11) are combined into a single block dia- 
gram as shown in Figure 21.9. 

Using Rule 1, the overall system transfer function can then be found: 
Y(s) _ 1 
F(s) (s+2)(2s—1) 
This transfer function relates Y(s) and F'(s) (see Figure 21.10). 


F(s) 1 X(s) 1 Y(s) F(s) 1 Y(s) 
s+2 Qs—1 (s + 2)(2s — 1) 


Figure 21.9 Figure 21.10 
Combined block diagram for The overall system transfer function. 
Equations (21.10) and (21.11). 


G(s) = 


A system is represented by the differential equations 


2x -—x = f(t) 
y +3y = x(t) 
zZ+z= y(t) 


The initial input is f(t) and the final output is z(t). Find the overall system transfer 
function, assuming zero initial conditions. 
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Solution The transfer function for each equation is found and combined into one block diagram 
(see Figure 21.11). The three blocks are simplified to a single block as shown in Fig- 
ure 21.12. The overall system transfer function is 

_ 2s) 1 

— F(s) Qs —1)(s+3)(5+ 1)" 


F(s) 1 X(s) 1 Y(s) 1 Zs) F(s) 1 Zs) 
ds—1 543 s+] (2s — I(s + 3)(s + 1) 


G(s) 


Figure 21.11 Figure 21.12 
Block diagram for the system given in Simplified block diagram for 
Example 21.30. Example 21.30. 


Engineering application 21.6 


Transport lag in a system to supply fuel to a furnace 


Transport lag is a term used to describe the time delay that may be present in certain 
engineering systems. A typical example would be a conveyor belt feeding a furnace 
with coal supplied by a hopper (see Figure 21.13). The amount of fuel supplied to 
the furnace can be varied by varying the opening at the base of the hopper but there 
is a time delay before this changed quantity of fuel reaches the furnace. The time 
delay depends on the speed and length of the conveyor. Mathematically, the function 
describing the variation in the quantity of fuel entering the furnace is a time-shifted 
version of the function describing the variation in the quantity of fuel placed on the 
conveyor (see Figure 21.14). 
Let 


u(t)q(t) = quantity of fuel placed on the conveyor, 
where u(t) is the unit step function, 


Conveyor velocity, v 
— 


=—_m— | —————_— 


CEO SOma© 


Conveyor 
| Furnace | 


Figure 21.13 
Coal is fed into the furnace via the conveyor 
belt. 
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Fuel variation 


(mass of fuel q(t) q(t — d) 
per unit length 
of conveyor, d 
kgm! —/ 
ue Os) Ea Qa(s) 
esd 
Ti 

me) Figure 21.15 
Figure 21.14 Block diagram for the 
The time delay, d, is introduced by the conveyor belt. conveyor belt. 


u(t — d)q(t — d) = quantity of fuel entering the furnace, 
d = time delay introduced by the conveyor, 
1 = length of conveyor (m), 
v = speed of conveyor (m s~'). 


The input to the conveyor is u(t)q(t). The output from the conveyor is 


l 
u(t — d)q(t — d). If the conveyor is moving at a constant speed then v = , and 


l 
so d = —. The transfer function that models the conveyor belt can be obtained by 
v 


using the second shift theorem: 
O,(s) = L{ut — d)q(t — d)} =e“ L{q(t)} =e “O(s) 
The transfer function for the conveyor is shown in Figure 21.15. 

Transport lags can cause difficulty when trying to control a system because of 
the delay between taking a control action and its effect being felt. In this example, 
increasing the quantity of fuel on the conveyor does not lead to an immediate increase 
in fuel entering the furnace. The difficulty of controlling the furnace temperature 
increases as the transport lag introduced by the conveyor increases. 


Engineering application 21.7 


Position control system 


There are many examples of position control systems in engineering, for example 
control of the position of a plotter pen, and control of the position of a radio telescope. 
The common term for these systems is servo-systems. 

Consider the block diagram of Figure 21.16 which represents a simple servo- 
system. The actual position of the motor is denoted by ©,(s) in the s domain and 
0, (t) in the time domain. The desired position is denoted by ©,(s) and 6, (t), respec- 
tively. The system is a closed loop with negative feedback. The difference between 
the desired and the actual position generates an error signal which is fed to a con- 
troller with gain K. The output signal from the controller is fed to a servo-motor and 
its associated drive circuitry. The aim of the control system is to maintain the actual 
position of the motor at a value corresponding to the desired position. In practice, if 
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Servo-motor + 
drive amplifier 


Controller 


Figure 21.16 
Position control system. 


a new desired position is requested then the system will take some time to attain this 
new position. The engineer can choose a value of the controller gain to obtain the 
best type of response from the control system. We will examine the effect of varying 
K on the response of the servo-system. 

We can use Rules | and 2 to obtain an overall transfer function for the system. 
The forward transfer function is 


G(s) Ok by Rule 1 
= e 
mete ame 
a... Gis) . : . 
The overall transfer function ; is obtained by Rule 2 with H(s) = 1. So, 
a(S 
0.5K 
0,(s) = s(s+ 1) _ 0.5K = 0.5K 
O,(s) 1 4 OPK) ~ s(st+1)+05K  s?+5+0.5K 
s(s+ 1) 


Let us now examine the effect of varying K. We will consider three values, 
K = 0.375, K = 0.5, K = 5, and examine the response of the system to a unit step 
input in each case. 
Forks = 0375 
0, (s) 0.1875 
O,(s)  s2 +s + 0.1875 


1 
With ©,(s) = —, then 
Ss 


0.1875 - a 0.5 1S 
(s? +5+0.1875)s 8 s+0.75 s+0.25 


using partial fractions. So, 


@,(s) = 


6,@)=1+05e°"—15e° +30 


This is shown in Figure 21.17. Engineers usually refer to this as an overdamped 
response. The response does not overshoot the final value. 


OTmKe—a Oks 
@,(s) 0.25 
@,(s) s*@-+54+0.25 
0.25 1 1 0.5 
@,(s) = 


s(s?++5+0.25) 5s s+05 (s+05) 


6,(t) =1—-e°* —0.5te°* £30 


1 


EXERCISES 21.11 


Find the transfer function for each of the following (d) a 
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0,(t) 


Figure 21.17 
¢ Time response for various values of K. 


This is shown in Figure 21.17 and is termed a critically damped response. It corre- 
sponds to the fastest rise time of the system without overshooting. 
For K = 5 


Ors) De) 
O,(s) De ees 
DS 
(2) = ——_____ 
als) s(s? +5 +2.5) 
Rearranging to enable standard forms to be inverted gives 
1 §4EOS 0.5 
© === = 
(YS (Se Ie OS) 
6,(t) = 1—e°* cos 1.5¢ — $e sin 1.5¢ 


The trigonometric terms can be one as a single sinusoid using the techniques 
given in Section 3.7. Thus, 


6,(t) = 1 —1.054e° sin(1.5t + 1.249) for t>0 


This is shown in Figure 21.17 and is termed an underdamped response. The system 
overshoots its final value. 

In a practical system it is common to design for some overshoot, provided it is not 
excessive, as this enables the desired value to be reached more quickly. It is interesting 
to compare the system response for the three cases with the nature of their respective 
transfer function poles. For the overdamped case the poles are real and unequal, for 
the critically damped case the poles are real and equal, and for the underdamped 
case the poles are complex. Engineers rely heavily on pole positions when designing 
a system to have a particular response. By varying the value of K it is possible to 
obtain a range of system responses and corresponding pole positions. 


2 4682 40% 44 = 20) 


equations assuming zero initial conditions: 


(a) x" +x= f(t) 


d-x 
(b) 2— 


2y 


dr? 


2 


d*y 


(c) 222 


(ec) 6 482 yay =3T say, 


dx 
a a FO f is the system input 
ay 32 + 6y = pit) (f) 6x” + 2x" —x'+4x=4f" +2f' + 6f, f is the 
dt . system input 
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Solutions 


sz +] 
1 
2s? + 35+6 


(c) 


() 1 () 35+4 

ee, 5 yc 

2s? +5—1 6s2 + 85 +4 

(d) I f 4s? +2546 
353 + 652 + 85 +4 (f) 653 +282 —s+4 


POLES, ZEROS AND THE s PLANE 


Most transfer functions for engineering systems can be written as rational functions: that 
is, as ratios of two polynomials in s, with a constant factor, K: 


P(s) 


G = kK 
= FOS) 


P(s) is of order m, and Q(s) is of order n; for a physically realizable system m < n. 
Hence G(s) may be written as 


G(s) =< KE HWE = 4) = ey) 
(s — p,)(s — po)... (8 — p,) 


The values of s that make G(s) zero are known as the system zeros and correspond to 
the roots of P(s) = 0, that is s = z,, Z,,..., Z,,- The values of s that make G(s) infinite 
are known as the system poles and correspond to the roots of Q(s) = 0, that is s = 
P\> Po +++ Py» AS We have seen, poles may be real or complex. Complex poles always 
occur in complex conjugate pairs whenever the polynomial Q(s) has real coefficients. 

Engineers find it useful to plot these poles and zeros on an s plane diagram. A complex 
plane plot is used with, conventionally, a real axis label of o and an imaginary axis label 
of jw. Poles are marked as crosses and zeros are marked as small circles. Figure 21.18 
shows an s plane plot for the transfer function 


3(s — 3)(s +2) 
(s+ 1)+14+3)6+1-3)) 


G(s) = 


The benefit of this approach is that it allows the character of a linear system to be deter- 
mined by examining the s plane plot. In particular, the transient response of the system 
can easily be visualized by the number and positions of the system poles and zeros. 


Figure 21.18 
Poles and zeros plotted for the transfer function: 


3(s — 3)(s +2) 


G(s) = - — 
(s+ 1)(s+ 1+ 3j)(s +1 —3j) 


Example 21.31 


Solution 
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s—2 
(s? +25+5)(s+1) 


Find the poles of 


The denominator is factorized into linear factors: 


(s? +25 +5)(s + 1) = (8 — p,)( — p2)(5 — Ps) 
where p, = —1 + 2j, p» = —1 — 2j, p; = —1. The poles are —1 + 2j, —1 — 2j, —1. 


If X(s) = 


where the pole p, is given by a + bj, then 


1 
x(t) =e?! = e“e™ = e“ (cos bt + j sin bt) 


Hence the real part of the pole, a, gives rise to an exponential term and the imaginary 
part, b, gives rise to an oscillatory term. If a < 0 the response, x(t), will decrease to zero 
as t > 0d. 

Consider the Laplace transform in Example 21.25. There are three poles: —1, —1 +] 
and —1 — j. The real pole is negative, and the real parts of the complex poles are also 
negative. This ensures the response, x(t), decreases with time. The imaginary part of the 
complex poles gives rise to the oscillatory term, cost. The characteristics of poles and 
the corresponding responses are now discussed. 

Given a system with transfer function G(s), input signal R(s) and output signal C(s), 
then 


C(s) 
R(s) > G(s) 
that is, 
C(s) = G(s)R(s) 
and so 
C(s) = K(s—2z,)(s —Z)... (6 —z,, )R(s) 


(s — p(s -_ P») ieasne (s ~ Pn) 
The poles and zeros of the system are independent of the input that is applied. All that 
R(s) contributes to the expression for C(s) is extra poles and zeros. 


1 
Consider the case where R(s) = —, corresponding to a unit step input: 
s 


K(s —z,)(8 —Z,)... (s — Z,) 


C(s) = 
(8 — pi )(S — Pz)... (8 = Py )s 
A A, A, By 
= — 4+ — +--+ + — 
S—P, S—P, S— Pp, Ss 
where A,,A,,...,A,, and B, are constants. Taking inverse Laplace transforms yields 


c(t) = Ae?! + A, Po +--+ A,ePn’ + By 


If the system is stable then p,, p,,..., p,, Will have negative real parts and their contri- 
bution to c(t) will vanish as t > oo. 

The response caused by the system poles is often called a transient response because 
it decreases with time for a stable system. The component of the transient response due 
to a particular pole is often termed its transient. Notice that the form of the transient 
response is independent of the system input and is determined by the nature of the system 
poles. It is now possible to derive a series of rules relating the transient response of the 
system to the positions of the system poles in the s plane. 
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21.12.1 


21.12.2 


21.12.3 


Transient 
jo response 


Figure 21.19 

A pole with a negative real part leads to a decaying transient while a 
pole with a zero real part leads to a transient that does not decay with 
time. 


Transient 
jo response 


Figure 21.20 
The further a pole is from the imaginary axis, the quicker the decay of 
its transient. 


Rule 1 


The poles may be either real or complex but for a particular pole it is necessary for the real 
part to be negative if the transient caused by that pole is to decay with time. Otherwise the 
transient response will increase with time and the system will be unstable, a condition 
engineers usually design to avoid. In simple terms this means that the poles of a linear 
system must all lie in the left half of the s plane for stability. Poles on the imaginary 
axis lead to marginal stability as the transients introduced by such poles do not grow or 
decay. This is illustrated in Figure 21.19. 


Rule 2 


The further a pole is to the left of the imaginary axis the faster its transient decays (see 
Figure 21.20). This is because its transient contains a larger negative exponential term. 
For example, e~*' decays faster than e~~’. The poles near to the imaginary axis are termed 
the dominant poles as their transients take the longest to decay. It is quite common for 
engineers to ignore the effect of poles that are more than five or six times further away 
from the imaginary axis than the dominant poles. 


Rule 3 


For a real system, poles with imaginary components occur as complex conjugate pairs. 
The transient resulting from this pair of poles has the form of a sinusoidal term 
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jo Transient 
response 
x s plane 
o 
x 
Figure 21.21 
A pair of stable complex poles gives rise to a decaying sinusoid 
transient. 


multiplying an exponential term. For a pair of stable poles, that is negative real part, 
the transient can be sketched by drawing a sinusoid confined within a ‘decaying expo- 
nential envelope’ (see Figure 21.21). The reason for this is that when the sinusoidal term 
has a value of | the transient touches the decaying exponential. When the sinusoidal 
term has a value of —1 the transient touches the reflection in the ¢ axis of the decaying 
exponential. The larger the imaginary component of the pair of poles, the higher the 
frequency of the sinusoidal term. 

It should now be clear how useful a concept the s plane plot is when analysing the 
response of a linear system. A complex system may have many poles and zeros but by 
plotting them on the s plane the engineer begins to get a feel for the character of the 
system. The form of the transients relating to particular poles or pairs of poles can be 
obtained using the above rules. The magnitude of the transients, that is the values of the 
coefficients A,, A,,...,A,,, depends on the system zeros. 

It can be shown that having a zero near to a pole reduces the magnitude of the transient 
relating to that pole. Engineers often deliberately introduce zeros into a system to reduce 
the effect of unwanted poles. If a zero coincides with the pole, it cancels it and the 
transient corresponding to that pole is eliminated. 


Engineering application 21.8 


Asymptotic Bode plot of a transfer function with real poles 
and zeros 


Recall that the Bode magnitude plot of a simple linear circuit was examined in 
Engineering application 2.14. It is possible to construct a plot of a more complicated 
transfer function by following some simple steps. 

Here we consider a transfer function with only real poles and zeros 

_ ¢ Ua a 6-2) (6-2) 
G(s) = K 
S72) (So) hai) 

where z,, z,... are the system zeros and p,, p,... are the system poles. 

The first stage is to obtain an expression for the frequency response of the system 
by substituting s = jw 

(jo — 21) (i =z) + Go = zn) 
jo (jo — p;) (im — py) -- (jo — P,) 


G (jo) = K 
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We note that (jw — z,) can be written as —z, (= 
7A 
to each bracket in the numerator and denominator yields 
(—z,) (—z,) oe (—Zn) 
(—p,) (—p») as (sy) 


x : : 5; 
jo (+1) (2 +1)..(2+1) 
12 ae 2p) Pn 
The Bode plot is a logarithmic plot of 20 log,)|G@Qw)|. So we wish to plot 


(=) 3) a 2) 
enter) 


jo (2 +1) (#2 +1)..(2+1) 
oe 1 —Pn 


Using the laws of logarithms (page 86) this can be rewritten as 


20 log, |G Ga)| 


+ 1) Applying this process 


G(jo) = K 


20 log, |G Gjm)| = 20log,, 


—Z —Z)::- =f. 1 
= 201og,5 ral 1) (=) ++ (=n) + 20 logi, eal 
(—p,) (—p») Pag (—p,) ai 
jo ja 
+20lor l= ] Sein, (ea 1 
mo) <n 
— 20 log) lj] — 20 log,, ls ar 1 
Pi 
= oiiloe | ot ) Be Ulog yee 1 
1) Pn 


In this form the contribution of individual poles and zeros can be studied. 
(=z) (=z) fae (=z) 


(—p,) (—p2) ae (=12)) 
frequency, w. 


The term 20 log,, |j@| has a value of 0 at a = 1. Atm = 10 it has a value of 20, at 
q@ = 100 it has a value of 40, and so on. At # = 0.1 it has a value of —20, at 0.01, a 
value of —40, and so on. If we plot this term by itself using a logarithmic frequency 
scale it would therefore be a straight line passing through the point where m = | with 
a gradient of 20 dB for each decade, or factor of 10, increase in frequency. 


is constant. It has no dependence on 


The term 20 log) Ik 


The term 20 log,, Hee + 1} can be examined by considering the influence at dif- 
—Z, 
; oO jo 
ferent frequencies. If |—| < 1, then 20log,, | — + 1} © 20log,, |1| = 0. So the 
ca Si 


term has very little influence. 
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; a) 
Now consider 


= 10. Then 20log,, 
<] 


eee 1 = 20 log, |j10| = 20. It can 
=e 


be shown by successively considering e = 100, 1000, 10000, ... that for every 


& 
decade multiplication in frequency (for toed z,) the term increases by another 20 dB. 
Graphically, it approximates to a straight line that increases at 20 dB for every decade 
(x10) change in frequency. The point at which this straight line intersects the w axis 
is called the breakpoint. 


a a ll 
1 
discussed for the term involving zeros. The only difference is the straight line falls 
by 20 dB for every decade multiplication in frequency away from the breakpoint. 

By considering the addition of the terms discussed above we can produce an ap- 
proximation to the Bode plot. These approximate lines are asymptotes to which the 
actual plots tend as one moves away from the breakpoints. Hence the term asymp- 
totic Bode diagram is used. 

Consider a practical example of finding the poles and zeros, and sketching the 
transfer function 


The term —20 log, and others like it have similar properties to those 


5000 (s + 3) 


AO eer rE ST CEECTIO) 


Firstly we note the position of the poles and zeros by observing that 

5000 (s — (—3)) 

s (s — (—100)) (s — (—500)) 
There is a single zero at z, = —3. There are three poles at p, = 0, p, = —100, 
P3 = —500. The pole at 0 is usually termed the pole at the origin. 

The substitution s = jw is made to give 

5000 Ga + 3) 

jo Gw + 100) G@ + 500) 


G(s) = 


G(jo) = 


The equation is rearranged as 


jo 
5000 x 3 ($+) 
100x500 . (jw jo 
eee || ea 
jo (4 4 (H+ ) 


The constant term contributes a gain of 


G(jo) = 


(Eales) 5000 x 3 | 
20 log), |K = = 20 log,,. | ————- | = 20log,, |0.3| 
I (2p (Spee) '© 100 x 500 ay 
The zero contributes a gain of 
Sloe |e 1 S0loe ae ) 
Ke 3 


This is approximated by a straight line on the Bode diagram starting at @ = 3 and 
increasing at 20 dB per decade. 


674 Chapter 21 The Laplace transform 


The pole at the origin contributes a gain of 
—20 10g 19 |jo| 


This is plotted as a straight line with no start or end points, with a slope of —20 dB 
per decade passing through w = 1. 
The two other poles contribute a gain of 


o jo 
—20 log ig =e =F 1) = —20 logo 100 SF 1 
and 
jo jo 
—201lo — +1} = —20lo —4+1] 
S10 a + | S10 500 os | 


respectively. The first of these is approximated by a line with slope —20 dB per decade 
starting at w = 100, the second has the same gradient and starts at @ = 500. 

The individual contributions to the overall plot are shown in Figure 21.22. Notice 
the starting points of the asymptotes — the breakpoints — and their correspondence to 
the positions of the poles and zeros. 


oe 0 
NEN a “ll 
= 
A 


Figure 21.22 
Contributions of the individual terms to the Bode plot. Dotted lines are the asymptotic 
approximations. 


The asymptotes can now be added together to form a combined graph which is 
shown in Figure 21.23. Also plotted on the same graph is an exact Bode plot of the 
transfer function obtained using a computer. 

Notice that the largest error in the approximation occurs at the breakpoints, 
whereas the two graphs agree well at other frequencies. The reason for the error 
is most apparent when we consider the case where m = —z,, for example. The gain 
at this break frequency according to the line plotted should be 
We 


1 


20 log io |— 


= 20 logy lj] = 0 
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dB 


10 100 1000 10* @ 


Figure 21.23 
Comparison between the asymptotic Bode plot and the exact frequency response. The dotted 
line is the result of adding the asymptotic approximations together at each frequency point. 


whereas the exact value is 
anys ] = 20 logyg ii + 1] = 20log,q |¥2| = -3.01 
= 

Fortunately this is the worst-case error for a single pole or zero and if a more 
accurate sketch is required it is possible to use this fact to improve on the asymptotic 
approximation of the transfer function. Engineers find it useful to sketch a Bode plot 
in order to get a ‘feel’ for the frequency response of a system. If a more accurate plot 


20 log) 


is needed then it is possible to use a computer package to obtain it. 


LAPLACE TRANSFORMS OF SOME SPECIAL FUNCTIONS 


In this section we apply the Laplace transform to the delta function and periodic func- 
tions. These functions were introduced in Chapter 2. 


The delta function, 5(t—d) 


Recall the integral property of the delta function (see Section 16.4), 
i f(t)d(t — d) dt = f(d) 


The Laplace transform follows from this integral property. Let f(t) = e~” so that 
f(d) =e. Then 


[. e"8(t—d) dt = ik e"5(t—d)dt = fd) =e™ 
= 0 


lo. @) 


that is, 


L{5(t —d)} =e 
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The Laplace transform of 5(t) follows by setting d = 0: 


EGG 


Example 21.32 For a particular circuit it can be shown that the transfer function, G(s), is given by 
MO (s) a 1 
~ Vis) s+2 


G(s) 
where V,(s) and V,(s) are the Laplace transforms of the input and output voltages 
respectively. 

(a) Find v,(t) when v;(t) = (ft). 
(b) Use the convolution theorem to find v,(t) when 
e’ ¢t>0 


if) = 
u@) 0 t<0O 


Solution (a) Weare given the transfer function 


Vos) 1 
OO iy iD 
When v,(t) = 5(¢), V,(s) = 1 and hence 
Vi(s) = a8 
: s+2 


and 


s+2 


This is known as the impulse response of the system, g(t). If the impulse response, 
g(t), is known then the response, vu, (t), to any other input, v,(¢), can be obtained by 
convolution, that is v,(t) = g(t) * u,(¢). 


win=£"| : \ao? 


(b) The response to an input, v,(t) = u(t) e™, is given by 


u(t) = gt) #u,() = / g(A)u,(t — A) dA 
0 


t 
= / ee 4) qa 
0 


t 
e- i. eda 
0 
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21.13.2 Periodic functions 


Recall the definition of a periodic function, f(t). Given T > 0, f(t) is periodic if f(t) = 
f(@ +T) for all t in the domain. If f(t) is periodic and we know the values of f(t) 
over a period, then we know the values of f(t) over its entire domain. Hence, it seems 
reasonable that the Laplace transform of f(t) can be found by studying an appropriate 
integral over an interval whose length is just one period. This is indeed the case and 
forms the basis of the following development. 

Let f(t) be periodic, with period T. The Laplace transform of f(r) is 


LUO} =F@ = iy eo f(t) dt 


db 2T 3T 
= eroas | roa f e "f(t)dt+--- 
0 T 


2r 


Let t = x in the first integral, f = x + T in the second, t = x + 2T in the third and so on. 
T T 
F(s) = i, ef (x) dx + i e OY) F(x +7) dx 
0 0 
T 
+f raidcaieal yf 6 +2T)dx+--- 
0 

Since f is periodic with period T then 

f@) = f@tT)=fat2T)=--- 
So, 


vi 


i fF 
F(s) = i ef (x) dx + / e Te f(x) dx + / ee Fajds +s 
0 0 0 


Tr 
= al af et ae eT shes of e f(x) dx 
0 


We recognize the terms in brackets as a geometric series whose sum to infinity 


is —————. Hence, 
1—e—7 


ies" fe)ide 


18 = 
(s) (20 
Example 21.33 A waveform, f(t), is defined as follows: 


2 O0<t< 1.25 


f=) 1.25 <1t<15 


and f(t) is periodic with period of 1.5. Find the Laplace transform of the waveform. 
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15 st 
Solution L{f(t)} = Jo & "FO at 


1 = ea l5s 
fo? 2e- dt + fi. Oe de 
= 1 eal 
_ 2(1 _ e712) 
= s(1 — e7! 55) 


EXERCISES 21.13 
1 Find the Laplace transforms of and has a period of 2. 
tah. Sie) ee) (b) —6u(t) + 48(¢) (a) Sketch two cycles of f(t). 
(c) 3u(t —2)+ d(t — 2) 
(d) u(t — 3) —d(t — 4) 
(e) 5u(t — 4) + 35(t — 4) 4 A waveform, f(t), is defined by 


where u(t) is the unit step function. 


(b) Find L{f(1)}. 


0<t<1.5 
OD) pute 1s ap i 
2 Find the inverse Laplace transforms of : : 

2. 2 1 3 — 2s and has a period of 2.5. 
(a)--1 (b) —+>5 (c) 

RY 3s 2 
(a) 4s —3 (a) Sketch f(r) on [0, 5]. 

2 (b) Find L{f(t)}. 


3. A periodic waveform is defined by 


5 If the impulse response of a network is g(t) = 10e~’ 


f(t) = O<r<l find the output when the input is f(t) = e~ cos 21, 
2-t 1<t<2 t>0. 
Solutions 
3 6 — 26 d) 46(t) — 
Bo 241 (b) 2 44 (c) 3u(t) (t) (d) 46(t) — 3u(t) 
Ss AY 3 (b) 1—e°* 
© = 4e2 @ Se rdee™) 
Ss re () 2s Ae e255 _ e158) 
(e) e 4s 43 en4s s2(1 = e255) 
? (30cos2¢+20sin2rye-? 30e-* 
2u(t) S(t) 5 
2 (a) 2u(t) — d(t) (b) 3 + —s 13 13 
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(c)2e"(f@) +1) (d) u@—Dfe¢-1) 


s+l1 
1 Given F(s) = aE is the Laplace transform of 
Ss 


+2 
f(t), find the Laplace transforms of the following: (e) 4u(t — 3) f(t — 3) 
f() 


e2t 


(a) (b) 3e7 f(r) (f) e~~u(t — 2) f(t — 2) 


Find the Laplace transforms of the following: 


(a) 2f sin 3t (b) : (—3t cos 2t) 
(c) e u(t) (d) e ‘u(t — 1) 
(e) 3¢2u(t — 1) (f) e“5(t — 2) 
Find the inverse Laplace transforms of 
2s +3 

(s+ Ds +2) 

4s 
s2—9 
2(s3 + 4s? + 4s + 64) 

(s? + 4)(s? + 16) 


(a) 


(b) 


(c) 


s+1 


6 
d 2s * —s 
(dye a (e) e 2 


Solve the following differential equations using the 
Laplace transform method: 


(a) x’ +2x'—3x=2r, x(0)=1 x (0)=2 

(b) x” — 2x’ + 5x =cost, x(0) =0 x(0)=1 
or 3 

(c) tt+ytety=sz(l +2) 


x—2y+x+2y =2t 
x(0) =0 y(0) =0 


@) t-y+x=—-1, x(0)=0 y(0)=2 
2-45 -x=1 


The current, i(f), in a series LC circuit is governed by 


dt C 
where v(t) is the applied voltage. 


a 4 Pf . 
L—+-—]| idt=v(t) 
0 


(a) Assuming zero initial conditions show that 
1 
LsI+ —I=V 
st + Cs (s) 
(b) If v(t) = d(¢) show that 


. 1 t 
i(t) = — cos —— 


LL /LC 


The input and output voltages, uv, (t) and v,(t), of a 
series RC network are related by the differential 
equation 


du, 
ar a Vo => v; 
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(a) IfVo(s) = L{v,(¢)} and V;(s) = Lf{v;(1)}, show 


that the transfer function, 


1 
sCR+1° 


, 1s given by 


1 


(b) If C =0.1 wF, R = 100 kQ find, and sketch a 
graph of the response, v,(t), when 


5 volts ¢t>0 


(7) = 
YO=) 4 t<0 


(c) Using the component values given in (b) find the 
response when the input is a unit impulse, 5(f). 


(d) Using the component values given in (b) use the 
convolution theorem to determine the response 
when 0; (t) = 5 e 1008, 


7 Express the square wave 


1 O<t<a 


fH= 


-l a<t<2a period 2a 


in terms of unit step functions. Hence deduce its 
Laplace transform. 


8 Consider the circuit shown in Figure 21.24. 


1H 40 


Lol 
Ls a 


i(t) 


u(t) 
Figure 21.24 


(a) Show that 
t 


di 
+443 [ idt = v(t) 
dt 0 

(b) Assuming zero initial conditions, show that 


I(s) = (s) 


Ss 
(s+ 1)(s+ 3)" 
(c) Find i(t) if v(t) = 6(2). 
(d) Find i(t) if 


Qe +>0 
vt) = : 
0 otherwise 
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INTRODUCTION 


Difference equations are the discrete equivalent of differential equations. The termi- 
nology is similar and the methods of solution have much in common with each other. 
Difference equations arise whenever an independent variable can have only discrete val- 
ues. They are of growing importance in engineering in view of their association with 
discrete-time systems based on the microprocessor. 

In Chapter 21 the Laplace transform was shown to be a useful tool for the solution of 
ordinary differential equations, and for the construction of transfer functions in circuit 
analysis, control theory, etc. Generally, Laplace transform methods apply when the vari- 
ables being measured are continuous. The z transform plays a similar role for discrete 
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22.2.1 


systems to that played by the Laplace transform for continuous ones. In this chapter you 
will be introduced to the z transform and one of its applications — the solution of linear 
constant coefficient difference equations. The z transform is of increasing importance 
as more and more engineering systems now contain a microprocessor or computer and 
so have one or more discrete-time components. For example, most industrial controllers 
now have an embedded microprocessor, and overall control of a factory is often by means 
of a supervisory process control computer. Many factories also have a production control 
computer to schedule production. 


BASIC DEFINITIONS 


Before classifying difference equations we will first derive an example of one. 

Suppose a microprocessor system is being used to capture and analyse images. The 
number of instruction cycles, i, of the microprocessor needed to process an image de- 
pends on the number of pixels, n, that the image is broken down into. Clearly n is a 
non-negative integer, that is nm € N. Since i depends upon n we write i = i[n]. The 
square brackets notation reflects the fact that n is a discrete variable (see Section 6.2). If 
there are n items of data, the number of instruction cycles used is i[7]. 

Suppose that if there are n + 1 items of data, the number of instruction cycles used 
increases by 10n + 1. Then, 


i[n+ 1] =i[n] + 10n+1 


This is an example of a difference equation. The dependent variable is i; the independent 
variable is n. 
Putting n = 0 in the difference equation gives 


i[1] = i[0] + 10(0) +1 =1 


Similarly, i[2] = 12, i[3] = 33, i[4] = 64 and so on. We see that the difference equation 
gives rise to a sequence of values. 

There are strong similarities between difference and differential equations. The im- 
portant point to note is that with difference equations, the independent variable is dis- 
crete, not continuous. In the above example, n is the number of pixels; it can have only 
integer values. This discrete property of the independent variable is an essential and 
distinguishing feature of difference equations. Much of the terminology of differential 
equations is applied, with identical meaning, to difference equations. 


Dependent and independent variables 


Consider a simple difference equation: 
x[n + 1] — x[n] = 10 

The dependent variable is x; the independent variable is n. In the difference equation 
yl[k + 1] — y[k] = 3k4+5 


the dependent variable is y and the independent variable is k. 


22.2.2 


Example 22.1 


Solution 


22.2.3 
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The solution of a difference equation 


A solution is obtained when the dependent variable is known for each value of interest 
of the independent variable. Thus the solution takes the form of a sequence. There are 
frequently many different sequences which satisfy a difference equation; that is, there 
are many solutions. The general solution embraces all of these and all possible solutions 
can be obtained from it. 


Show x[n] = A2”, where A is a constant, is a solution of 


x[n + 1] — 2x[n] = 0 


x[n] = A2" so x[n + 1] = A2”*! = 242" 
Hence, 
x[n + 1] — 2x[n] = 2A2” — 2A2” =0 


Hence x[n] = A2” is a solution of the given difference equation. In fact x[7] is the general 
solution. 


If additionally we are given a condition, say x[0] = 3, the constant A can be found. If 
x[n] = A2”, then x[0] = A2° = A and hence 


A=3 


The solution is thus x[n] = 3(2”). This is the specific solution and satisfies both the 
difference equation and the given condition. 


Linear and non-linear equations 


An equation is linear if the dependent variable occurs only to the first power. If an equa- 
tion is not linear it is non-linear. For example, 


3x[n + 1] — x[n] = 10 
y[n + 1] —2y[n— 1] =r" 
ke[k + 2] + z[k] = z[k — 1] 


are all linear equations. Note that the presence of the term n? does not make the equation 
non-linear, since 7 is the independent variable. However, 


(x[n + 1])* — x[n] = 10 
y[k + 1] = Jy[k] +1 


are both non-linear. Also, 


z[n + 1]z[n] = n* + 100 


sin x[n] = x[n — 1] 


are non-linear. The product term z[m + 1]z[n] and the term sin x[m] are the causes of the 
non-linearity. 
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22.2.4 


22.2.5 


Order 


The order is the difference between the highest and lowest arguments of the dependent 
variable. The equation 


3x[n + 2] — x[n + 1] — 7x[n] =n 
is second order because the difference between n + 2 and nis 2. 
x[n + 1]x[n — 1] = 7x[n — 2] 


is third order because the difference between n + 1 and n — 2 is 3. In general, the higher 
the order of an equation, the more difficult it is to solve. 

In some difference equations the dependent variable occurs only once. These are clas- 
sified as zero order. Engineers refer to them as non-recursive difference equations be- 
cause calculation of the value of the dependent variable does not require knowledge of 
the previous values. In contrast, difference equations of order | or greater are referred to 
as recursive difference equations because their solution requires knowledge of previous 
values of the dependent variable. The difference equation 


x[n]J =r? +n+1 


has zero order and so is a non-recursive difference equation. 


Homogeneous and inhomogeneous equations 


The meanings of homogeneous and inhomogeneous as applied to linear difference equa- 
tions are analogous to those meanings when applied to differential equations. To decide 
whether a linear equation is homogeneous or inhomogeneous it is written in standard 
form, with all the dependent variable terms on the |.h.s. Any remaining independent 
variable terms are written on the r.h.s. For example, 


3nx[n + 1] — 2n? = x[n — 1] 
is written as 
3nx[n + 1] —x[n — 1] = 2n° (22.1) 


If the r.h.s. is 0, the equation is homogeneous; otherwise it is inhomogeneous. Equa- 
tion (22.1) is inhomogeneous but 


3nx[n + 1] —x[n — 1] =0 


is homogeneous. 


Engineering application 22.1 


Signal processing using a microprocessor 


In engineering, an increasing number of products contain a microprocessor or com- 
puter which is solving one or more difference equations. The input to the micropro- 
cessor is a sequence of signal values, in many cases formed as a result of sampling a 
continuous input signal. The output from the microprocessor is a sequence of signal 
values which may be subsequently converted into a continuous signal. For example, 
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an inhomogeneous difference equation could be of the form 
y[n] — 2y[n — 1] = 0.1s[n] + 0.2s[n — 1] — 0.5s[n — 2] 


where y[n] is the output sequence or dependent variable and s[n] is the input se- 
quence. Note that the input sequence can still be thought of as the independent vari- 
able but instead of being expressed analytically in terms of n it arises as a result of 
sampling. The corresponding homogeneous equation is 


y[n] — 2y[n — 1] =0 


22.2.6 Coefficient 


The term coefficient refers to the coefficient of the dependent variable. In Equation (22.1) 
the coefficients are 3n and —1. 


Example 22.2 (a) State the order of each of the following equations (i)—(vii). 


(b) State whether each equation is linear or non-linear. 
(c) For each linear equation, state whether it is homogeneous or inhomogeneous. 
(i) 2x[n] — 3nx[n — 1] + x[n — 2] +n? =0 
(ii) sue + 1] —x[n — 1]) = x{[n] 
(ii) z[n + 21Qn —z[n—1]) =n+1 
_. Ix{n-1) n+l 
(iv) = 
x[n — 2] n—1 
(v) w[n+3]w[nt+ l]=n-1 


(vi) y[n +2] + 2y[n + 1] = 6s[n + 2] — 2s[n + 1] + s[n] where y is the dependent 
variable 


(vii) x[k+3] —2x[k+2]+.2[k] = e[k+2]—e[k] where x is the dependent variable. 


Solution (a) (i) Second order 
(ii) Second order 
(iii) Third order 
(iv) First order 
(v) Second order 
(vi) First order 
(vii) Third order 
(b) (i) Linear 
(ii) Linear 
(iii) Non-linear 
(iv) Linear 
(v) Non-linear 
(vi) Linear 
(vii) Linear 
(c) Equations (iii) and (v) are non-linear. The linear equations are written in standard 
form: 
(i) 2x[n] — 3nx[n — 1] +.2[n — 2] = —n?* 
(i) x[n + 1] — 3x[n] — x[n — 1] =0 
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(iv) 7(n — 1)x[n — 1] — (n+ 1)x[n — 2] =0 
(vi) and (vii) are already in standard form. 
Hence we find the following: 

(i) Inhomogeneous 

(ii) Homogeneous 

(iv) Homogeneous 


(vi) Inhomogeneous 
(vii) Inhomogeneous 


EXERCISES 22.2 
1. For each of the following equations (a)—(e): (c) pln — 2] + yfn — 1] + yn] = ? 
(i) State the order of the equation. (11) State whether (d) Jn¥alnl = JO} cia whee othe 


each equation is linear or non-linear. (111) For each 
linear equation, state whether it is homogeneous or : 
inhomogeneous. (e) Qw[n— 1] + 1)* = w[n—2]4+s[n— 1] —s[n—2], 
(i iGr+ainl) an — 1] where w is the dependent variable. 


2z[k — 4] 


(b) Wk —3) = z[k — 2] 


dependent variable. 


Solutions 


1 (a) First order; linear; inhomogeneous (d) Second order; non-linear 


(b) Second order; non-linear (e) First order; non-linear 


(c) Second order; linear; inhomogeneous 


es] REWRITING DIFFERENCE EQUATIONS 


Sometimes an equation or expression can be written in different ways. At first sight, it 
may appear there are two independent equations when in fact there is only one. Thus 
we need to be able to rewrite equations so that comparisons can be made. When general 
solutions of equations are to be found, usually the equation is first written in a standard 
form. So once again there is a need to rewrite equations. 


Example 22.3 Rewrite the equation so that the highest argument of the dependent variable is n + 1. 
x[n + 3] — x[n + 2] = 2n x[(2)=7 
Solution The highest argument in the given equation is n + 3; this must be reduced by 2 ton + 1. 
To do this n is replaced by n — 2. The equation becomes 
x[n + 1] — x[n] = 2(n — 2) x 27 


Note, however, that the initial condition, x[2] = 7, is not changed. This is simply stating 
that x has a value of 7 when the independent variable has a value of 2. 


1 


Solution 


Example 22.5 
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Example 22.4 Write the following equations so that the highest argument of the dependent variable is 


n+2. 
(a) 3x[n + 4] — 2nx[n + 2] = (n— 1)? x[3] = 6 x[4] = -—7 
(b) z[n — 2] +2[n—1]+2[n]=1+4+n z[0] = 1 z[{1] =0 
(a) The highest argument, n+ 4, must be replaced by n+ 2, that is n is replaced by n—2 
throughout the equation. 
3x[n + 2] — 2(n — 2)x[n] = (n — 3)” x[3] = 6 x[4] = -7 
(b) The highest argument, n, is increased to n + 2, that is n is replaced by n + 2. 


z[n] + z[n+ 1] + z[n +2] =n+4+3 z[0] = 1 z[1] =0 


Write the following equations so that the highest argument of the dependent variable 


is k: 
s[k + 2] — 2s[k + 1] + s[k] 
15 
where a is the dependent variable. 


i[k + 3] + Uk + 2] + Uk + 1] + 1k] + Uk — 1) 


5 
where a is the dependent variable. 


(a) alk +2] = 


(b) alk +3] = 


(a) The highest argument, k + 2, must be replaced by k, that is k is replaced by k — 2 


s[k] — 2s[k — 1] + s[k — 2] 
15 
(b) The highest argument, & + 3, must be replaced by k, that is k is replaced by k — 3 


Wk) + Uk — 1) + Ik — 21+ Uk — 3) + Uk — 4] 
~ 5 


Solution 
throughout the equation: 
a[k] = 
throughout the equation: 
alk] 
EXERCISES 22.3 


Write each equation so that the highest argument of 
the dependent variable is as specified: 


(a) p[k] —3p[k+ 1] = p[k — 2], highest argument of 
the dependent variable is to be k + 2. 


(b) R[n — 1] — R[n — 2] — R[n — 3] =n, R[O] = 1, 
R{1] = —2, highest argument of the dependent 
variable is to be n. 


(c) q[t] + tg[t — 1] = 3q[t + 1], q{1] = 0, 


(d 


(e 


wm 


) 


q{2] = —2, highest argument of the dependent 
variable is to be t — 1. 

T{m] + (m— 1)T[m — 2] = m2, 

T(O] = T[1] = 1, highest argument of the 
dependent variable is to be m + 2. 

ylk + 1] — y[k+ 3] = (s[k + 2] — s[k + 4])/2, 
where y is the dependent variable and the highest 
argument of the dependent variable is to be k. 
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Solutions 
1 (a) pik +1) —3p[k +2] = pik —- 1] (d) Thm +2] + (n+ 1)T[m] = (m+ 2)" 
(b) R[n] — R[In — 1] — R[n—2] 1 T[0]) =7[1]=1 
n n n =n-4 
R[O] = 1, R[1] = —2 (©) ylk-2) —y[ = sik — 1] —s[k+1] 


2 


(©) g[t— 2] + @ — 2)q[t — 3] = 3qlt — 1 
ql] = 0, q[2] = —2 


BLOCK DIAGRAM REPRESENTATION OF DIFFERENCE 
EQUATIONS 


Many engineering systems can be modelled by means of difference equations. It is pos- 
sible to represent a difference equation pictorially by means of a block diagram. The use 
of a block diagram representation helps an engineer to visualize a system and may often 
be helpful in suggesting the required hardware or software to implement a particular dif- 
ference equation. This is particularly important in areas such as digital signal processing 
and digital control engineering. 

Before discussing block diagrams it is necessary to review the topic of sampling. 
Difference equations operate on discrete-time data and therefore a continuous signal 
needs to be sampled before use. In the most common form of sampling, a sample is taken 
at regular intervals, T. A continuous signal and the sequence produced by sampling it 
are shown in Figure 22.1. Some authors write the sequence as x[nT] to indicate that the 
sequence has been obtained by sampling a continuous waveform at intervals T. We will 
not use this convention but simply refer to the sampled sequence as x[n]. 

Several components are used in a block diagram. The delay block is shown in 
Figure 22.2. The effect of this element is to delay the sequence by one sampling interval, 
T. For example, if 


x[n] = 6, 4, 3, —2, 0, 2,5,0,... n=0,1,2,... 


we can write this as x[0] = 6, x[1] = 4, x[2] = 3, x[3] = —2,..., and then 


x[n — 1] = 0, 6, 4, 3, —2,0,2,5,0,... n= 051,25 5402 


x(t) x{n] 


(a) (b) 


Figure 22.1 
(a) Continuous signal; (b) sequence produced as a result of sampling. 
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x[n] x[n- 1] x[n] x[n — 2] 
=| 2 eg Me cee 


Figure 22.2 Figure 22.3 
A delay block delays a sequence Two delay blocks in series. 
by a time interval, 7. 


x{[n] x{n] pin] 


x[n] +a kx[n] pin] + q[n] + rin] 
a k 
Figure 22.4 Figure 22.5 Figure 22.6 
Adding a constant Scaling a sequence Adding sequences together 
to a sequence. by a constant. using a summer. 


Note that x[n — 1] is undefined when n = 0 and so this is assigned a value of 0. A delay 
of two sampling intervals results in the sequence x[n — 2] as shown in Figure 22.3. 

Another block diagram element represents the addition of a constant to a sequence 
and is shown in Figure 22.4. A sequence can be scaled by a constant. This is shown in 
Figure 22.5. Finally, sequences can be added together using a summer. This is shown 
in Figure 22.6. 


Engineering application 22.2 


Discrete-time filter 


The term filter is used by engineers to describe an electronic device that selectively 
processes signals of different frequencies. So, for example, a low pass filter is a filter 
that allows low frequency signals to pass with little attenuation of their amplitude. 
However, high frequency signals tend to be severely attenuated. In contrast, a high 
pass filter attenuates low frequency signals while allowing high frequency signals to 
pass with relative ease. A discrete-time filter or, as more commonly termed, a dig- 
ital filter is an electronic device that uses digital methods to selectively filter digital 
signals. 

A simple example of a discrete-time filter is one described by the difference 
equation 


y[n] — ay[n — 1] = x{[n] 


where x[7] is the input sequence, y[7] is the output sequence and a is a constant. If a 
is positive then the filter behaves as a low-pass filter which rejects high frequencies 
but allows low frequencies to pass. If a is negative then the filter behaves as a 
high-pass filter. A block diagram for the filter is shown in Figure 22.7. Note that 
this is a recursive filter because calculation of y[n] requires knowledge of previous 
values of the output sequence. Note also that the block diagram contains a feedback 
path. This is a feature of recursive difference equations. 
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x[n] yln] 


Figure 22.7 
Discrete-time filter. 


Engineering application 22.3 


Development of a difference equation to calculate the 
acceleration of an object 


Engineers often need to calculate the acceleration of an object when all that is known 
is the position of that object. This can be achieved by developing a difference equation 
that can be solved at regular time intervals by a computer. It is first necessary to 
use a device that converts the position signal into a form suitable for analysis by a 
computer. Usually the position signal will be continuous and therefore in analogue 
form. It is therefore necessary to use an analogue-to-digital converter to transform 
the analogue signal into a digital signal. Consider the following problem. Develop 
a difference equation to convert a digital position signal into a digital acceleration 
signal given that this position signal varies with time. Derive an associated block 
diagram for this difference equation. 
Let s = position, v = speed and a = acceleration: 
ds dv d’s 
DS Ge sd = de 
Therefore, in order to obtain an acceleration signal the position signal must be dif- 
ferentiated twice. For a small time interval T, the derivative, y(t), of a signal x(t) can 
be approximated by 
x(t) —x(t —T) 
Ee 
This follows directly from the definition of differentiation. If the signal x(t) is sam- 
pled to give x[n] then the process of differentiation is represented by the difference 
equation 


y(t) © 


x[n] — x[n — 1] 
yln] = F 
Figure 22.8 shows a block diagram for the differentiator. It is important to note that 
this difference equation is only an approximation to the process of differentiation. 
This could be implemented using special-purpose hardware or by software on a mi- 
croprocessor. It follows that the speed of the object is given by 


s[n] — s[n — 1] 
a 


The problem of finding the acceleration, a[n], can now be solved by coupling two 
differentiators together as shown in Figure 22.9. An alternative approach to this 


v[n] = 
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Figure 22.8 
Block diagram of a differentiator. 


s[n] v[n] 


Figure 22.9 
Two differentiators in series. 


problem is to obtain a difference equation for the process of finding a second deriva- 
tive. Given that 


v[n] = “oe (229) 
and 
a(n] = aS O23) 


then substituting Equation (22.2) into Equation (22.3) gives 
— Gil sles) — Gh N—sla— 2) 
= a 
s[n] — 2s[n — 1] + s[n — 2] 
T2 


an] 


a[n| = 


The block diagram for this difference equation is shown in Figure 22.10. Note that the 
difference equation is non-recursive and so there are no feedback paths in the block 
diagram. The output sequence is a[n] and the input sequence is s[n]. The independent 
variable is n. 


Figure 22.10 

a[n] is obtained by 
calculating the second 
derivative of s[n]. 


692 Chapter 22 Difference equations and the z transform 


Engineering application 22.4 


Development of a moving average filter to calculate the height 
of acid in a chemical tank 


A common problem in chemical engineering is that the height of liquid in a tank is 
difficult to measure accurately because of the liquid swilling around. One solution to 
this problem is to use a moving average filter to smooth out any variations in height 
obtained from the tank position sensor. A key design decision is how many past data 
points to use when constructing the moving average filter. In practice this depends, 
amongst other things, on how much the liquid moves around. It is important not to 
use too many points or significant variations in height may not be picked up in time. 
However, if too few points are used then the output from the filter may be too spiky to 
be of use. Consider the case of a moving average filter that makes use of the five most 
recently measured values of the level of the liquid in the tank. Formulate a difference 
equation to carry out this averaging and draw an equivalent block diagram. 

Let a[n] represent the average level of the acid in the tank and let /[n] represent 
the sampled values of the level measurements received from the transducer. Then, 

[[n] + [[n — 1] + [[n — 2] + I[n — 3] + I[n — 4] 
a(n] = 5 
The block diagram for this difference equation is shown in Figure 22.11. The action 
of taking a moving average of sampled values is equivalent to passing the sampled 
values through a low-pass filter because it filters out high-frequency variations in the 
sampled values. This process is termed digital filtering or digital signal processing. 


Figure 22.11 
Block diagram of a moving averager. 


EXERCISES 22.4 


computer the signal is sampled using an 
analogue-to-digital converter. Derive a difference 
equation and associated block diagram to obtain the 
2 Acomputer is fed a signal representing the velocity of acceleration of the object as a function of time. 

an object as a function of time. Prior to entering the 


1. Design a digital filter based on taking a moving 
average of the last three values of a sampled signal. 


Solutions 


1. See Figure S.23. 


Figure S.23 
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v[n] — v[n — 1] . 
2 a(nj= > See Figure S.24. 


a{n] v[n] 


Figure S.24 


DESIGN OF A DISCRETE-TIME CONTROLLER 


Figure 22.12 shows a block diagram of a single loop industrial control system. The con- 
trol system employs feedback to compare the desired value of a process variable with 
the actual value. Any difference between the two generates an error signal e(t). This 
signal is processed by the controller to produce a controller signal m(t). An amplifier is 
often present to magnify this signal to make it suitable for driving the plant that is being 
controlled. 

The controller can be implemented by means of an analogue electronic circuit. How- 
ever, digital computers are being used increasingly as controllers. The signals e(t) and 
m(t) are both continuous in time and so it is necessary to sample e(t) before it can be 
used by the computer and to reconstruct the signal generated by the computer to pro- 
duce the controller output, m(t). The arrangement for implementing a digital controller 


is shown in Figure 22.13. 
Figure 22.12 


A single loop industrial control system. 


e(t) Analogue-to- Digital Digital-to- mt) 
digital converter computer analogue converter 


Figure 22.13 
Block diagram of a digital controller. 
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The most common type of controller used in industry is the proportional/integral/ 
derivative (p.i.d.) controller. It can be shown that the analogue form of this controller is 
modelled by the equation 


de(t) 
dt 


t 
m(t) = K,e(t) + K | e(t) dt + K, (22.4) 
0 
where K,, K, and K, are constants. In order to implement a discrete-time (digital) con- 
troller it is necessary to convert this equation into an equivalent difference equation. The 
approximation for the process of differentiation has already been examined in 
Engineering application 22.3, and is given by 
de(t) i e[n] — e[n — 1] 
dt T 
There are several possible ways of approximating the process of integration. One method 
is illustrated in Figure 22.14. Here the area under the curve is approximated by a series 


of rectangles, each of width 7. If the approximate area under the curve from t = 0 to 
t = nT is denoted by x[n], then 


x[n] = x[n — 1] + Te[n] (22.5) 


The discrete form of Equation (22.4) can now be formulated. It is given by 
- -1 
m[n] = K,e[n] + Kx[n] + aa (22.6) 


Equations (22.5) and (22.6) form a set of equations to implement a discrete form of the 
p.i.d. controller on a digital computer or microprocessor. These two equations are termed 
coupled difference equations because both are needed to calculate m[n]. In addition, 
Equation (22.5) is recursive. A flow chart for implementing these equations is shown in 


Figure 22.15. 
Time to 
sample? 
¥ 


ES 
Calculate 
m{n] 


Output m[n] 


Figure 22.14 
Approximating the area under the curve by a Figure 22.15 
series of rectangles. Flow chart for a p.i.d. controller. 
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EXERCISES 22.5 


2 Draw a block diagram for Equations (22.5) 
and (22.6). 


1. A transducer is used to measure the speed of a motor 
car. Design a digital filter to calculate the distance 
travelled by the car. 


Solutions 


1. s[n] = s[n — 1] + Tv[n]. See Figure S.25. 


Figure S.25 


| 22.6 | NUMERICAL SOLUTION OF DIFFERENCE EQUATIONS 


Having seen how difference equations are formulated we now proceed to methods of 
solution. The numerical method illustrated may be applied to all classes of difference 
equation. 


Example 22.6 Given 
x[n + 1] —x[n] =n x[0] = 1 


determine x[1], x[2] and x[3]. 


Solution The terms in the equation are evaluated for various values of n: 


n=0 n=1 n=2 
x[1] — x[0] = 0 x[{2] — x[1] = 1 x[3] — x[2] = 2 
x[1] = 1 x[2] = 2 x[3] = 4 


Example 22.7 Determine x[4] given 


Ax[k +2] —x[k+1])4+2fk)=-h x0) =1 xf1] =3 
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Solution k=0 k=1 
2x[2] — x[1] + x[0] = 0 2x[3] — x[2] + x[1] = -1 
3 
x[2] = 1 x[3] = = 
k=2 

2x[4] — x[3] + x[2] = —4 
4 13 

x[4] = ae 


As the previous examples illustrate, to determine a unique solution to a first-order equa- 
tion requires one initial condition; for a second-order equation, two initial values are 
required. 


Engineering application 22.5 


Numerical solution of the output from a digital low-pass filter 


Recall from Engineering application 22.2 that a low-pass filter is a filter that is de- 
signed to attenuate high frequency signals. A simple difference equation for a digital 
filter is given by: 
yln] — ay[n — 1] = x[n] 
If a is positive then the filter is a low-pass filter. We will choose a = 0.5 and so 
y[n] = 0.5y[n — 1] + x[n] 
Let us examine the response of this filter to a unit step input applied at n = 0, that is 


QO ne 
lel 1 otherwise Bee 


Assume the output of the filter is zero prior to the application of the step input, that 
is y[n] = 0 for n < —1. From the difference equation, we find 


y[0] = O.5y[—1] + x[0] 


= 0.5(0) + 1 
= ll 
Similarly, 
yLL] = 0.5y[0] + x{1] 
=0.5()+1 
= 1.5 


and so on. When numerically solving a difference equation, it is often useful to form 
a table with intermediate results. Table 22.1 shows such a table. 

Figure 22.16 shows the input and output sequences superimposed on the same 
graph. The sequence values have been joined to illustrate their trends. Note that the 
input signal reaches its final value immediately whereas the output signal takes sev- 
eral sample intervals to reach its final value. Engineers often refer to this process as 
‘smoothing’ the input signal. We will see in Chapter 23 that rapidly changing signals 
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Table 22.1 

Numerical solution of a difference 
equation. 

n x[n] yin — 1] y[n] 
—1 0 0 0 

0 1 0 1 

1 1 1 1.5 
D 1 ES 1.75 
3 1 1.75 1.88 
4 1 1.88 1.94 
5 1 1.94 1.97 
6 1 1.97 1.98 
7 1 1.98 1.99 
8 1 1.99 2.00 
9 1 2.00 2.00 
10 1 2.00 2.00 


2 y[n] 


1 x[n] 


Figure 22.16 
Input and output sequences for the 
low-pass filter. 


tend to be richer in high frequencies than those that change more slowly. The effect 
of the low-pass filter is to filter out these high frequencies and so the output from 
the filter changes more slowly than the input. In Chapter 23 we will also examine a 
continuous low-pass filter which does the same job for continuous signals. 


®@ tezS34a 567 8 Qi@ # 


Numerical solution of difference equations is often the only feasible method of ob- 
taining a solution for many practical engineering systems. This is because the input terms 
are usually obtained as a result of sampling an input signal and so cannot be expressed 
analytically. However, in some cases it is possible to express the input analytically and 
then an analytical solution to the difference equation may be feasible. While analytical 
methods analogous to those applied to differential equations are available, z transform 
techniques are more popular with engineers and so these are introduced in the following 


sections. 


EXERCISES 22.6 


1 Given 
x[n + 2] + x[n + 1] —x[n] = 2 
x[0] = 3 x[1] =5 
find x[2], x[3], x[4] and x[5]. 
2 If 
anjin—W=n afl] =7 
find z[2], z[3] and z[4]. 
3 Determine x[2] and x[3] given 


(a) 2x[n + 2] —5x[n + 1] = 4n, x[1] =2 


(b) 6x[n] — x[n — 1] + 2x[n — 2] 
=n? —n,x{0] = 1,xf1] =2 


(c) 3x[n 


— 1] 4+ 2x[n — 2] — 9x[n — 3] 


= (n— 1)", x[0] = 3, x[1] =2 


Calculate the first five terms of the following 
difference equations with the given initial conditions: 


(a) x[n+ 1] —x[n] = 2, x[0] = 3 
(b) x[n +2] +2x[n + 1] —x[n] = 4, 


x[0] 


=5,x[1] =7 


(c) x[p+2]—x[p + 1] 4+ 2x[p] = 2, 


x[0] 


=1,x{1)=1 
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Solutions 
1 0,7, —5, 14 4& (a) 3,5,7,9, 11 
4 63 64 (b) 5,7, 2,9, 3 
7 4°63 (c) 1,1,1,1,1 
29 1 7 29 52 


3 (a) 5, = (b) (c) Sy 


Technical Computing Exercises 22.6 


You should observe a sine wave on the output y(7). 


1. Write a computer program to implement the - : 
Make a note of its amplitude. 


difference equations given in Engineering application 


22.5. Verify that your results agree precisely with 4 Investigate what happens to the amplitude of the 

those given in Table 22.1. output when you increase the frequency of the input 
2 Modify your program so that the input sequence is enreaneat The frequency can be increased by 

x[n] = [01000000000 OJ. The output reducing the value of the denominator in the sine 


is known as the impulse response of the system. function, for example, 


Observe the shape of the plot and compare it to the x[n] = sin(27/10) 


output sequence in the Example. 
5 Does the result agree with the description of the 


operation of the discrete-time filter described in the 
example? 


3 Now change your program so that it will generate a 
total of 100 samples of output and change the input 
sequence to: 


x[n] = sin(27tn/100) 


Ya ~6DEFINITION OF THE z TRANSFORM 


Suppose we have a sequence f[k], k € N. Such a sequence may have arisen by sampling 
a continuous signal. We define its z transform to be 


G2 lil) oe rllcn (22.7) 


k=0 


We see from the definition that the z transform is an infinite series formed from the terms 
of the sequence. Explicitly, we have 


Z(fIkI} = f10] + A Ps i " oO 


In most engineering applications we do not actually need to work with the infinite series 
since it is often possible to express this in a closed form. The closed form is generally 
valid for values of z within a region known as the radius of absolute convergence as 
will become apparent from the following examples. 


Example 22.8 


Solution 


Example 22.9 


Solution 


Example 22.10 


22.7 Definition of the ztransform 699 


Find the z transform of the sequence defined by 


1 k=0 


k|= 
STK] 0 k#0 


This sequence is sometimes called the Kronecker delta sequence, often denoted by 
6[k]. 


Z {fl} = > flee 
k=0 
_ qoy+ SH, £2 


Zz 2 


git x... 


23 


0 oO 0 

SL ae Sr = aS ee 
z z z 

=1 


Hence F(z) = 1. 


Find the z transform of the sequence defined by 
fik)=1 keN 


This is the unit step sequence, often denoted by u[k]. 


Z {fll} = >> flee 
k=0 


1 1 1 
Sa ee ag 
z z z 


This is a geometric progression with first term | and common ratio —. The progression 
: g : F ae z 
converges if |z| > 1 in which case the sum to infinity is 


1 _ & 
1-1/z z-1 


We see that F(z) has the convenient closed form 


& 
S| 


for |z| > 1. 


Note that the process of taking the z transform converts the sequence f[k] into the con- 
tinuous function F(z). 


Find the z transform of the sequence defined by f[k] = k, k € N. This sequence is called 
the unit ramp sequence. 
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Solution F(z) = Z{f[k]}= > kze* 


k=0 


i. = 3 
=-}1l+-+5+-- 
Zz r4 & 


~2 
If we use the binomial theorem (Section 6.4) to express (1 - 7) as an infinite series, 
Zz 


we find that 
ae 2.3 panailltl 
1-- = bp Sb ee provided |—| < 1, that is |z| > 1 
Zz gz £ v4 


Using this result we see that Z{f[k]} can be written as 


1 i 
F@= -(1-3) 
z ra 


and so 
0 2 for || > 1 
z= = or |z| > 
z(d—-1/z)? (-1) 


Example 22.11 Find the z transform of the sequence defined by 
S[k] = Ak A constant 


Solution We find 


F() = 2(f1k}} = Ake 


k=0 


= @=12 using Example 22.10 
—_ 


In the same way as has been done for Laplace transforms, we can build up a library of 
sequences and their z transforms. Some common examples appear in Table 22.2. Note 
that in Table 22.2 a and b are constants. 


Example 22.12 Use Table 22.2 to find the z transforms of 
(a) sintk —(b) e* cos 2k 


Solution Directly from Table 22.2 we find 
(a) Z{sin 1} = a 
2 — 2zcos 5 +1 
2 — ze cos2 
2 — 2ze3 cos2 + e® 


(b) Z{e* cos 2k} = 
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Table 22.2 
The z transforms of some common functions. 
FI] F(z) FIK] F(Z) 
1 k=0 zz+ 1) 
d[k] = 1 ag 
e i k£0 cy 
ui) = 1 k>0 z 2B 2(z2 + 42+ 1) 
0 k<0O z-1 (z- 1)4 
k — sin ak ee 
(z— 1)? 22 —2zcosat 1 
ea : cos ak Sa 
z—-e 4 2 —2zcosat1 
ak Zz en ak gin bk ze“ sinb 
zZ—a 22 — 2ze-4 cosh + e~ 24 
2 = 
kak a e~“ cos bk a 
(z— a)? 22 — 2ze-4 cosb + e724 
ak az(z+ a) 
(z — a)? 
EXERCISES 22.7 
1. Using the definition of the z transform, find (d) e* sin 2k (e) 4* (f) (—3)* 
closed-form expressions for the z transforms of the _ [kn kr 
following sequences f[k] where (g) sin By (h) cos 2 
(a) f[0] =0, f[1] =0, f[k] =1 fork > 2 ; 
0 k=-0.1.....5 3 Find, from Table 22.2, the sequences which have the 
(b) f[k] = ep 5 Te following z transforms: 
2 3 
(c) flk] = 3k,k > 0 (a) — (b) 4 (c) <a 
(@) fli —e—*,k =0,1,2,... oe a, a 
d meee 
(e) flO) = 1, (11 = 2, F121 =3, fk) = 0.k > 3 oa Bad 
(f) flO] = 3, fk] = 0, k #0 4 By considering the Taylor series expansion of el/z, 
(g) fikl= | : : a find the z transform of the sequence 
; < 


1 
2 Use Table 22.2 to find the z transforms of i= kt hee 


(a) cos 3k (b) ek (c) e72* cosk 


Solutions 
1 4 242743 2z 
le 23 Ps @ > 03 @4 
= ge (z—cos3) 
” @—Ip ez— 1 .@G. = oy = 


2 —2zcos3 +1 z—e 
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2 — ze? 


(c) 


cos | Z Zz 


2 


22 —2ze~2 cos 1 + e~4 


(h) 


2+ 


és ze? sin2 3 (a) (-4)* —b) 1/2)" =~ ©) (-1/3)4 
22 — 2ze4+ cos2 + (d) ek (e) sin(7tk/2) 
z & 
(e) a= (f) 3 4 el/z 


SAMPLING A CONTINUOUS SIGNAL 


We have already introduced sampling in Section 22.4. We now return to the topic. Most 
of the signals that are encountered in the physical world are continuous in time. This 
means that they have a signal level for every value of time over a particular time interval 
of interest. An example is the measured value of the temperature of an oven obtained 
using an electronic thermometer. This type of signal can be modelled using a continuous 
mathematical function in which for each value of ¢ there is a continuous signal level, 
f(t). Several engineering systems contain signals whose values are important only at 
particular points in time. These points are usually equally spaced and separated by a time 
interval, 7. Such signals are referred to as discrete time, or more compactly, discrete 
signals. They are modelled by a mathematical function that is only defined at certain 
points in time. An example of a discrete system is a digital computer. It carries out 
calculations at fixed intervals governed by an electronic clock. 

Suppose we have a continuous signal f(t), defined for t > 0, which we sample, that 
is measure, at intervals of time, T. We obtain a sequence of sampled values of f(t), 
that is f[O], f[1], [2], ..., f[A], .... Returning to the example of the oven temperature 
signal, a discrete signal with a time interval of 5 seconds can be obtained by noting 
the value of the electronic thermometer display every 5 seconds. Some textbooks use 
the notation f[kT] as a reminder that the sequence has been obtained by sampling at 
an interval T. We will not use this notation as it can become clumsy. However, it is 
important to note that changing the value of T changes the z transform as we shall see 
in Example 22.13. It can be shown that sampling a continuous signal does not lose the 
essence of the signal provided the sampling rate is sufficiently high, and it is in fact 
possible to recreate the original continuous signal from the discrete signal, if required. 
It is often convenient to represent a discrete signal as a series of weighted impulses. The 
strength of each impulse is the level of the signal at the corresponding point in time. We 
write 


f°) = Yo flkls¢ — kT) (22.8) 


k=0 


the * indicating that f(t) has been sampled. This representation is discussed in 
Appendix I. This is a useful mathematical way of representing a discrete signal as the 
properties of the impulse function lend themselves to a value that only exists for a short 
interval of time. In practice, no sampling method has zero sampling time but provided 
the sampling time is much smaller than the sampling interval, then this is a valid math- 
ematical model of a discrete signal. 
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We can apply the z transform directly to a continuous function, f(t), if we regard the 


function as having been sampled at discrete intervals of time. Consider the following 
example. 


Example 22.13 


(a) Find the z transform of f(t) =e sampled at t = 0, 0.1, 0.2,.... 


(b) Find the z transform of f(t) = e sampled at t = 0, 0.01, 0.02,.... 


(c) Express the sequences obtained in (a) and (b) as series of weighted impulses. 


(a) The sequence of sampled values is 


wm 


wm 


that is, 
e Olk keNandT =0.1 


From Table 22.2 we find the z transform of this sequence is 


—e01° 
The sequence of sampled values is 
{.e-00! 002 
that is, 
ek  keN and T=0.01 
From Table 22.2 we find the z transform of this sequence is —. We note 
ce. 


that modifying the sampling interval, T, alters the z transform even though we are 
dealing with the same function f(t). 


When T = 0.1 we have, from Equation (22.8), 


fO= » e 9 k5(¢ — 0.1k) 


k=0 


=e "SG =01) +6 "5G — 02) + 


Note that an advantage of expressing the sequence as a series of weighted impulses is 
that information concerning the time of occurrence of a particular value is contained 
in the corresponding 6 term. 

When T = 0.01, we have 


fO = doe °"S(t — 0.01k) 


k=0 


= (6) +e "'8¢ =0.01) +e “8 = 0,02) ++: 


Example 22.14 (a) Find the z transform of the continuous function f(t) = cos 3t sampled at t = kT, 


keEN. 


(b) Write down the first four terms of the sampled sequence when T = 0.2, and express 


the sequence as a series of weighted impulses. 
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Solution (a) The sampled sequence is 

Fk] = cos 3kT = cos((3T )k) 

The z transform of this sequence can be obtained directly from Table 22.2 from 
which we have 
z(z — cosa) 

Zz A} = ———____. 
songs) 2 —2zcosa+1 
Writing a = 3T we find 

z(z — cos 3T) 
2 —2zcos3T +1 
(b) When T = 0.2, the first four terms are 


Z {cos 3kT} = 


1 cos 0.6 cos 1.2 cos 1.8 


From Equation (22.8), the sequence of sampled values can be represented as the 
following series of weighted impulses: 


f°) = > cos 3kT5(t — kT) 


k=0 
= Fy cos 0.6kd(t — 0.2k) 
=0 


k 
= d(t) + cos 0.66 (t — 0.2) + cos 1.25(t — 0.4) + cos 1.86(¢ — 0.6) + --- 


| 22.9 | THE RELATIONSHIP BETWEEN THE z TRANSFORM 
AND THE LAPLACE TRANSFORM 


We have defined the z transform quite independently of any other transform. However, 
there is a close relationship between the z transform and the Laplace transform, the z 
being regarded as the discrete equivalent of the Laplace. This can be seen from the fol- 
lowing argument. 

If the continuous signal f(t) is sampled at intervals of time, T, we obtain a sequence 
of sampled values f[k], k ¢ N. From Section 22.8 we note that this sequence can be 
regarded as a train of impulses. 


f°O = >_ flkS@ — kT) 


k=0 


Taking the Laplace transform, we have 


cirion= f eS ° flk]8(t — kT) dt 
k=0 


= ye fo e "S(t — kT) dt 
k=0 0 


Example 22.15 


Solution 
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assuming that it is permissible to interchange the order of summation and integration. 
Noting from Table 21.1 that the Laplace transform of the function 5(t — kT) is e~**", we 
can write 


Afi =) sine” (22.9) 


k=0 


Now, making the change of variable z = e’, we have 


Lip oy= > fac 


k=0 
which is the definition of the z transform. The expression £{f*(t)} is commonly written 
as F*(s). 

In Appendix I it is shown that the continuous function f(t) can be approximated by 
multiplying the function f*(t) by the sampling interval 7. Correspondingly, TF*(s) is 
an approximation to the Laplace transform of f(t). This is illustrated in the following 
example. 


1 
Consider the function f(t) = u(t) e~ which has Laplace transform F (s) = 1 Sup- 
Ss 


pose f(t) is sampled at intervals T to give the sequence f[k] = e*’, for 
k=0,1,2.... 

(a) Use Table 22.2 to find the z transform of f[]. 

(b) Make the change of variable z = e*” to obtain F*(s). 

(c 


~ 


wm 


Show that provided the sample interval T is sufficiently small, 7 F'*(s) approximates 
the Laplace transform F(s). 


(a 


~ 


From Table 22.2 we find Z{f[k]} = F(z) = 


z-eT 
sT 
(b) Letting z =e gives F*(s) = >: Dividing numerator and denominator by 
e* —e7 
sl gt F* _ 1 
e gives (s) = [—e-rd+s)" 
2 
(c) Using the power series expansion for e* we can write eX = 14+ x-+ i +.---.So we 
can approximate e~7“'+») for sufficiently small T as 1 — T(1 +s). Hence 
1 
F*(s) & 
1-—(d-T(Q+s)) 
_ 1 
~ Td +s) 


1 
and so TF*(s) © Tas that is the Laplace transform of f(t). 
Ss 


We have illustrated the connection between the two transforms and shown how the z 
transform can be regarded as the discrete equivalent of the Laplace transform. 
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22.9.1 


Example 22.16 


Solution 


Mapping the s plane to the z plane 


When designing an engineering system it is often useful to consider an s plane repre- 
sentation of the system. The characteristics of a system can be quickly identified by the 
positions of the poles and zeros as we saw in Chapter 21. Engineers will often modify the 
system characteristics by introducing new poles and zeros or by changing the positions 
of existing ones. Unfortunately, it is not convenient to use the s plane to analyse discrete 
systems. For a sampled signal Equation (22.9) yields 


P(g =L@)}=> > fle * 


k=0 


The continuous signals and systems that were analysed in Chapter 21 had Laplace trans- 
forms that were simple ratios of polynomials in s. This was one of the main reasons for 
using Laplace transforms to solve differential equations; the problem was reduced to one 
of reasonably straightforward algebraic manipulation. Here we have a Laplace transform 
that is very complicated. In fact it can have an infinite number of poles and zeros. To see 
this consider the following example. 


The continuous signal f(t) = cos() is sampled at 1 second intervals starting from 
t=0. 

(a) Find the Laplace transform of the sampled signal f*(t). 

(b) Show that F*(s) has an infinity of poles. 


(c) Find the z transform of the sampled signal and show that this has just two poles. 


Tt 
(a) The continuous signal f(t) = cos() sampled at 1 second intervals gives rise to 


the sequence 1, 0, —1, 0, 1, 0, —1,.... Consequently, from Equation (22.9) 


F*(s) =L{f*(} = a fikle™ = e fikle — since T =1 
k=0 


k=0 


that is, 


F*(s) =1+0-e%+0+e%+40-e% 4... 


—2s 


This is a geometric progression with common ratio —e~~ and hence its sum to 


infinity is 
1 
that is, 
MOA Tyee 
~2s 


(b 


we 


Poles of F*(s) will occur when | + e~* = 0. Writing s = o + jw we see that poles 
will occur when e~?°°+@) = —1. Since —1 can be written as @"-)", n € Z (see 
Chapter 9), we see that poles will occur when 


e727 -2je = el@n-l)n 
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-1 


Figure 22.17 
The sampled signal has an infinite Figure 22.18 
number of poles. There are two poles at z = +). 
that is, when o = 0 and w = —(2n — |)7t/2. Thus there exist an infinite number of 


poles occurring when 
Tt 
i cae neZ 


Some of these are illustrated in Figure 22.17. 


(c) The z transform of the sampled signal is 


: 0 1 0 1 
ZAtf(o}= 14+ oP Sr ae 
z & ‘6 z 
7 1 
~ 1-(-1/2) 
~ 4] 


which has just two poles at z = -+j as shown in Figure 22.18. 


It is possible to show in general that the result of sampling a continuous signal is 
to convert each simple pole of the Laplace transform into an infinite set of poles. Sup- 
pose that the Laplace transform of the signal f(t) can be broken down by using partial 
fractions into a series of n + | terms with simple poles do, a,,...,a,. For simplicity, 
repeating poles will not be considered but the proof for such a case is similar. So, 


A A A A 
Of 4 4 Ft 


S—d S—-a s—a, s—a 


F(s) = 


In the time domain this corresponds to 


f@—Age™ + Ayer +s FA 
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Now we consider the Laplace transform of the sampled signal f*(t): 


F*(s) = Lif} =) fle 
k=0 
= Yo eokT +A, et kT dees +A,e%'") ewok 
k=0 
— Ao > e kT (sag) +A, ~ e fT (s~a,) ienee" 
k=0 k=0 


lo) 
—kT(s—a_) 
ce. wee 
k=0 


Now each of the summations can be converted into a closed form. For example, 


[ore] 
pee a 1 a ef 4) fe e727 (8-4) 4: e 3F 6-4) as on 
k=0 
1 
_ 1 _ e P49) 
Therefore, 
A A A 
* = 0 1 i 

F (s) = 1 _ eT (sa) = 1 _ eT (s-a) + at 1 _ eT (s-4,) 


It is possible to show that for each simple pole in F(s) there is now an infinite set of 
poles. Consider the pole at s = a). This contributes the term 


Ao 
= ef 4) 


to F*(s). This term has poles whenever 1 — e~7°~4) = 0, that is e~7°~%) = 1. This 
corresponds to T (s — ay) = 271mj, m € Z. Therefore, 


T(s — dy) = 27mj 


27m 
S—h= ad 
27™™. 
s=a +4 meZ 


The effect of sampling is to introduce an infinite set of poles. Each one is equal to the 
pole of the original continuous signal but displaced by an imaginary component. This 
is illustrated in Figure 22.19 for a real pole a). However, the proof is equally valid for 
a complex conjugate pair of poles but the diagram is more cluttered and has, therefore, 
not been shown. 

Clearly discrete systems are not amenable to s plane design techniques. Fortunately, 
the z plane can be used for analysing discrete systems in the same way that the s plane 
can be used when analysing continuous systems. It is possible to map points from the s 
plane to the z plane using the relation z = e*? which gives rise to the definition of the z 
transform as described previously. 
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jo s plane 

X dag t+ Br j 
X dg + & j 
X ay + Fj 


27; 
x = 
ag + Fj 


jo s plane ] z plane 


a9 ——- 
o 
Xa 2a; 
0-7 J 
4a: 
xX dy— 7 J 


X dg - © j 


8a; 
X dg - Td N 
Figure 22.19 Figure 22.20 
The effect of sampling is to introduce an The left half of the s plane maps to the inside of the unit 
infinite set of poles. circle of the z plane. 


The great advantage of the z plane is that it eliminates the problem of infinitely re- 
peating poles and zeros when analysing discrete systems. This can be illustrated by con- 
sidering how the imaginary axis of the s plane maps to the z plane. 

Referring to Figure 22.20, we see that s = o + jw. On the imaginary axis 0 = 0 


: : ™ Tt . : 
and therefore z = e*’ = e'”". As w varies between -F and rT the locus of z is a circle 


Tl 
of radius |, centred at the origin (see Section 9.10). As w is increased from 0 to T the 


T 
upper half of the unit circle is traced out, while as w is decreased from 0 to FP the 
T 
lower half of the unit circle is traced out. Increasing w above T or decreasing it below 


Tt 
“FF leads to a retracing of the unit circle. In other words, the repeated s plane points are 


superimposed on top of each other. This is the reason why the z plane approach is much 
simpler than the s plane approach when analysing discrete systems. 

The z transforms of discrete signals and systems are, in many cases, simple ratios of 
polynomials. We shall see shortly that this means the process of analysing difference 
equations which model these signals and systems is reduced to relatively simple alge- 
braic manipulations. 


(eG) PROPERTIES OF THE z TRANSFORM 


Because of the relationship between the two transforms we would expect that many of 
the properties of the Laplace transform would be mirrored by properties of the z trans- 
form. This is indeed the case and some of these properties are given now. These are: 
(1) linearity; 

(2) shift theorems; 


(3) the complex translation theorem. 
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22.10.1 


Example 22.17 


Solution 


Example 22.18 


Solution 


Example 22.19 


Solution 


Linearity 


If f[k] and g[k] are two sequences then 
ZA fk] + sik]} = Z{f1K]} + Z {sik} 


This statement simply says that to find the z transform of the sum of two sequences we 
can add the z transforms of the two sequences. If c is a constant, which may be negative, 
and f[k] is a sequence, then 


Z{cflk]} = cZ{fIk]} 


Together these two properties mean that the z transform is a linear operator. 


Find the z transform of e~* + k. 


From Table 22.2 we have 
Ze = — 


z—e! 
and 

Zz 
g= 1)? 


Zk = 


Therefore, 
Z ii Z 
—e!  (z-—1) 


Z{e* +k} = : 


Find the z transform of 3k. 


From Table 22.2 we have 
Zz 
Z{k} = 
{k} G@- 1D? 
Therefore, 


3z 


Zz 
Z{3k} = 3Z{k} =3 x G=-i2 “Goi 


Find the z transform of the function f(t) = 2t? sampled at t = kT, k € N. 


The sequence of sampled values is 

fk] = 2(kT)* = 27° 
The z transform of this sequence can be read directly from Table 22.2 using the linearity 
properties. We have 


2T72(z + 1) 


ZQT?R) = 27’ Z2{k} = @-1 
—_ 


22.10.2 


Example 22.20 


Solution 


0 
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First shift theorem 


If f[k] is a sequence and F(z) is its z transform, then 

Z2(fik-+il} =zF@)—@flOl+z “flll+---+2fle—1l),1eN* (2.10) 
In particular, if i = 1 we have 

Ziv le i) =r @) — 2/10) Coa) 
If i = 2 we have 


2 ple 21) oe) Ola 


The sequence f[k] is defined by 


0 £50, 1,2;3 
ftel= {i ba 5, 6:5 


Write down the sequence f[k + 1] and verify that 
Z{f[k + 1]} = zF(z) — zf[0] 


where F(z) is the z transform of f[k]. 


The graph of f[k] is illustrated in Figure 22.21. The sequence f[k + 1] is defined as 
follows: 


] = f[U1] which is 0. 

] = f[2] which is 0. 
When k = 2, f[k + 1] = f[3] which is 0. 

] = f[4] which is 1, and so on. 
Consequently, 


fik+ n= {1 : 


as illustrated in Figure 22.22. We see that the graph of f[k + 1] is simply that of f[k] 
shifted one place to the left. More generally, f[k-+ i] is the sequence f[k] shifted i places 
to the left. Now 


S{k + 1) 
1 


123 45 678 9 k 0123 45 678 9 &k 


Figure 22.21 Figure 22.22 
The sequence of Example 22.25. A shifted version of the sequence of Figure 22.21. 
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22.10.3 


F(z) = 2{f1k} = D0 flkle* 


k=0 


The same argument shows that 


1 1 
Z{f[k + 1} = ae 
It then follows that 
1 
Z{f[k + 1]} = >= ZF (z) — zf[0] 
zez—1 


since f[0] = 0. This illustrates the first shift theorem. 


Second shift theorem 
The function f(t)u(t) is defined by 


f@O 


f(u(t) = ie foe 


t<0O 


where u(t) is the unit step function. The function f(t —iT)u(t —iT), where 7 is a positive 
integer, represents a shift to the right of i sample intervals. Suppose this shifted function 
is sampled; then we obtain 


Sk — iJul[k — i] keN 
The second shift theorem states: 
2( fk — iju{k — i)}} = 7‘ F() oe IN 


where F(z) is the z transform of f[k]. 


Example 22.21 


Solution 
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The function ¢ u(t) is sampled at intervals T = | to give k u[k]. This sample is then 
shifted to the right by one sampling interval to give 


(k — 1)u[k — 1] 
Find its z transform. 
Figure 22.23(a) shows t u(t) and Figure 22.23(b) shows the sampled function. Fig- 


ures 22.23(c) and (d) show (¢ — 1)u(t — 1) and (k — 1)u[k — 1], respectively. From 
Table 22.2, we have 


& 
(g=1) 


Zk = 


and so, from the second shift theorem with i = 1, we have 


1 


_— _ =z! . = 
Z{(k — 1)ulk — 1]} =z (z—1)? (z— 1)? 


(b) 


Figure 22.23 
Graphs for Example 22.21: (a) t u(t); (b) k u[k]; (c) (¢ — 1)u(t — 1); (d) (Kk — Du[k - 1). 


Example 22.22 Find the z transform of the unit step function u(t) and the shifted unit step u(t — 2T), 


Solution 


sampled at intervals of T seconds. 


If the function u(t) is sampled at intervals T then we are concerned with finding the 


z transform of the sequence u[k]. This has been derived earlier: Z{u[k]} = 7 ‘i If 
—_ 


u(t — 2T) is sampled, we have 
1 k=2,3,4,... 
u[k — 2] = ; 
0 otherwise 
Therefore, by the second shift theorem, 


Z{u[k — 2]} = 2? Z{u[k]} 
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1 
Example 22.23 Find the sequence whose z transform is i: 
z— 


1 z 4.2 


1 
Solution = = 
z-l1 zz—l z-l 


From Table 22.2 we have 
Z(ulk]} = — 
z= 1 
So from the second shift property, we have 
Zz 
—1 


The required sequence is therefore u[k — 1]. 


Z{ulk —1]} =z! 
Zz 


1 
Example 22.24 Find the sequence whose z transform is Zend 
al ie a 
: 1 ‘ 
Solution The expression Pe=Te does not appear in the table of transforms, but we observe 
i Ce 
that 
1 1 z 


Z(z—1)2 a 2 (z—1)2 


and ein does appear. It follows from Table 22.2 that 
Zz 


st) 


G— i 


From the second shift property, z~7 aay is the z transform of (k — 3)u[k — 3]. 
Z— 


22.10.4 The complex translation theorem 


Zhew Sik} =F (e’z) where F(z) is the z transform of SI. 


Example 22.25 Given that the z transform of cos(ak) is 
z(z — cos a) 
2 —2zcosat+ 1 


—2k 


find the z transform of e~“ cos(ak). 


Solution Since b = 2, the complex translation theorem states that we replace z by e”z in the z 
transform F(z). 


e?z(e?z — cosa) 


F(e’z) = 
ie) e472 —2e2zcosa+1 


is therefore the required transform. 


1 


EXERCISES 22.10 


Use Table 22.2 to find the z transforms of 
(a) 3(4)* + 7k?,k > O 
(b)3e~* sin4dk —k,k > 0 
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Prove the first and second shift theorems. 


Use the complex translation theorem to find the z 
transforms of 


2 Find the z transforms of the following continuous (a) ke—k (b) e~* sink 
ena SPE See ree oan If fk] = 4(3) find Z{f[k]}. Use the first shift 
(a) t (b) 4t (c) sin 2r theorem to deduce Z{f[k + 1]}. Show that 
(d) u(t —4T) (e) e* Z{f[k + 1]} — 32Z{f[A]} = 0. 

3. Find the z transform of (k — 3)u[k — 3] by direct use Write down the first five terms of the sequence 


of the definition of the z transform. Hence verify the 
result of Example 22.24. 


4 Prove that the z transform of e~“ f(t) is F(e“” z). 


defined by f[k] = 4(2)*—!u[k — 1], k > 0. Find its z 
transform directly, and also by using the second shift 
theorem. 


Solutions 
3z Tz(z + 1) 1 
1 (a) —+——_ — 
sm) z-4  (-1)3 2(z—1)2 
3ze! sin4 
(b) 2 2 = ay =) ( re (a) 
Ze~ ~ COS e i (ze? — 1)? 
T 1 4T. : 
Bo = (b) z . (b) ezsin 1 
(z—1) (z—-1) e222 —2ezcos1+1 
in2T 1 
Oa Ag We 
Z Pe z(z—1) 723 ps3 
e) —— 4 
©) z-er 0, 4, 8, 16, 32. —— 
z—2 


INVERSION OF z TRANSFORMS 


Just as it is necessary to invert Laplace transforms we need to be able to invert z trans- 
forms and as before we can make use of tables of transforms, partial fractions and the 
shift theorems. In very complicated cases more advanced techniques are required. 


Example 22.26 If F(z) = = find f[k]. 


> 


Solution Note that we can write F(z) as 


oo 2 i. 3 
ee ee ee ee 


The reason for this choice is that quantities like the first on the r.h.s. appear in Table 22.2. 
From this table we find 


Zz 
=2 


Zi) = ; 
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and write 


ess] 
: 


where Z~! denotes the inverse z transform. Also, 


3 38 Zz 
e=2 77=2 
Z 
= 37! 
7 z—-2 


Using the second shift property we see that 


Zz 
=2 


2{3(2) fk — 1]} = - 


so that 


z+3 Zz v4 
go Sv a ot 2113771 
{35 tsa} + {32 z-2 


= 2 4.3(2)* luk — 1] 


my k=0 
2k 4.3(2)-! k= 1,2,... 


k=0 
a 14.3(2)! k= 1,2,... 


=0 
oe rae) oe 


Often it is necessary to split a complicated expression into several simpler ones, using 
partial fractions, before inversion can be carried out. Also, if we examine the z transform 
table, Table 22.2, we notice that nearly all of the entries have a z term in the numerator. 
For this reason it is convenient to divide a complicated expression by z before splitting 
it into partial fractions. We illustrate this technique by means of an example. 


Example 22.27 Find the sequence whose z transform is 


22 —z 


P= Gea) 


Solution The first stage is to divide the expression for F(z) by z. We have 


227 —z 
ae ee 
_ z(2z — 1) 
= Be ha 
F(z) - 2z—1 


z (z—5)(z+4) 


22.11.1 


Example 22.28 


Solution 


Example 22.29 


Solution 
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We now split the r.h.s. expression into partial fractions using the standard techniques 
discussed in Section 1.7. This gives 


PO) A 1 
zg <-5 z+4 
Multiplying by z gives 
Zz Zz 
F() =— s+ 
@) z—-5 z+4 


It is now possible to invert this expression for F(z) using the z transform table, 
Table 22.2. This gives 


fk] = 5* + (—4)4 


Direct inversion 


Sometimes it is possible to invert a transform F(z) directly by reading off the coeffi- 
cients. 


Find f[k] if 
FQSlte se oe" 4s 
Using the definition of the z transform we see that 


Occasionally it is possible to rewrite F(z) to obtain the required form. 


1\> I 
Use the binomial theorem to expand (1 - =) up to the term —. Hence find the 
z z 


3 


ee ate 
Gal) 


sequence with z transform F(z) = 


Using the binomial theorem, we have 


=9 2 
(22) 90S) 
Zz Zz 2! Zz 


= ae = 3 = Ss = = 4 
4! 3)(—4)( >(-2) 4S 3)(—4)(—5)( >(-2) 4 


3! Z 4! Zz 
3 6 10 = #15 
= be ory eae 
Z Zz Zz Zz 


provided |z| > 1. Since 


bag e=1\7 t\? 
FO= gay =( z ) = ee, 
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we have 


2 6 10°, 15 
F@=1+-+5+ 5+ 
& Fé z 


Thus F(z) can be inverted directly to give 


_ k+2)K +1) 


Fk) =1,.3,:6,.10, 15,...... that is, f[<] = ———, k>0 
EXERCISES 22.11 
1 Find the inverse z transforms of the following: (e) 22° 
if ae (z — 1)(z — 0.905) 
@ ®) == Er 
z-4 37° —4z-—7 xpress 
ctl ad +2 @+1Qz-3)@-2) 
© G2 © G=3%EG-n n= ZB 
2 Find the inverse z transforms of in partial fractions and hence obtain its inverse z 
22 es transform. 
® G-DE-3) 0) Gem Bl ir 
2 z z 10z 
| F = 
ce z (¢—3)(z—4) a (22 — §) " (z— 1-2) 
find f[k]. 
Solutions 
1 (a) 444) (b) e*k 
e 13(7/3)k (1) (c) d[k] — 28[k — 1] + 4* — 34 
” ms q@ GBk+ C13) 
(c) uk 24) — d[k — 2] 2 


3 


(e) 21.05u[k] — 19.05(0.905)* 
May be written as: 


5 1 6 
3k-2y[k — 2] + 3*-3ulk — 3] m?-~-2t3 
@ 19k(3*) : 3u[k]  5(3*) flO] = 2, fl] = —5, f[2] = -1, 
6 4 4 fI3]=6, f[k]=0 k>4 
2 (a) —2(2*) +2(3*) & 10(2* — ufk]) 


Sey THE z TRANSFORM AND DIFFERENCE EQUATIONS 


In Chapter 21 we saw how useful the Laplace transform can be in the solution of linear, 
constant coefficient, ordinary differential equations. Similarly the z transform has a role 
to play in the solution of difference equations. 
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Example 22.30 Solve the difference equation y[k + 1] — 3y[k] = 0, y[0] = 4. 


Solution 


Example 22.31 


Solution 


Taking the z transform of both sides of the equation we have 
Z{y[k + 1] — 3y[k]} = Z{0} = 0 
since Z{O} = 0. Using the properties of linearity we find 
Ztylk + 1} — 32 (y[k]} = 0 
Using the first shift theorem on the first of the terms on the 1.h.s. we obtain 
zZ{y[k]} — 4z — 32 {ylk]} = 0 
Writing Z{y[k]} = Y(z), this becomes 
(z — 3)Y(z) = 4z 


so that 
Az 
Z=3 
The function on the r.h.s. is the z transform of the required solution. Inverting this, from 
Table 22.2 we find y[k] = 4(3)*. 


Y(z) = 


Higher order equations are treated in the same way. 


Solve the second-order difference equation 
yA + 2] —Sy[k+ 1] + 6y[k]=0 yO] =O  yll]=2 
Taking the z transform of both sides of the equation and using the properties of linearity 
we have 
Z{y[k + 2]} — 5Z{y[k + 1]} + 6Z{y[k]} = 0 
From the first shift theorem we have 
LY (z) — 2y[0] — zy[1] — 5(z¥ (z) — zy[0]) + 6Y (z) = 0 
where 
Y(z) = Zyl} 
Substituting values for the conditions gives 


LY (z) — 02 — 2z — 5(z¥ (z) — 0z) + 6Y(z) = 0 
2Y (z) — 52z¥(z) + 6Y (z) = 2z 


Hence 
(22 —5z+6)Y(z) = 2z 


so that 


2z 
Y(z) = Z{y[k]} = 


(g— 2) = 3) 
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Dividing both sides by z gives 
Y@)_ 2 
z  (z—2)(z—3) 
Expressing the r.h.s. in partial fractions yields 


Y@) 2 2 
 g=a goo 
2z 2z 
Y(z) = ae 
@) z-3 z-2 


Inverting gives the solution to the difference equation: 


ylk] = 23°) — 24) 


EXERCISES 22.12 
1. Use z transforms to solve the following difference 2 Solve the difference equation 
tions: 
nae afk +2] — 3x[k + 1] + 2x[k] = 31K] 
(a) x[k + 1] — 3x[k] = —6, x[0] = 1 subject to the conditions x[0] = x[1] = 0. 
(b) 2xfk +1] — x1] = ok. x{0] =2 3 Solve the difference equation 


yk + 2] + 3y[k + 1] + 2y[k] = 0 
(c) x[k +1] +4x[k] = 2k + 1, x[0] = 0 . Ae 
subject to the conditions y[0] = 0, y[1] = 1. 


(d) x[k + 2] — 8x[k + 1] + 16x[k] = 0, 4 Solve the difference equation 
x[0] = 10, x[1] = 20 
xfk + 2] — 7xtk + 1] + 12xfk] = k 


(e) x[k +2] — x[k] = 0, x[0] = 0, x[1] = 1 subject to the conditions x[0] = 1, x[1] = 1. 
Solutions 
k k a 
1) (@) xk} =3-26%) a + ee in 
(©) k (a) 10(4*) — 5(k4*) m (-1* - (2) 
ulk] — (—1)* Ke? eg Dae yak 
a aan 4 67 ag ult ba er) 
May be expressed as x[k] = { ’ : a 
REVIEW EXERCISES 22 
1” State for each of the following equations: 
(i) the order (a) x[n] +2[n — 2] =6 
(i1) the independent variable (b) ykK+ 1] +hky[k—-1]-k=0 
(iii) the dependent variable ©) old+Dyxk+0=2 


(iv) whether linear or non-linear (d) z[n] — e[n — 1] = n?z[n — 2] 


(e) glk +3) + Valk + 2] = qk] — 1 
For each linear equation state whether it is 


homogeneous or inhomogeneous. 


Given 


3(x[n +1])? —2x[n]J =n? ~—x[0] = 2 


find x[1], x{2] and x[3]. 


Rewrite each equation so that the highest argument of 

the dependent variable is as specified. 

(a) 3ny[n+ 1] —y[n- 1] = nr, highest argument of 
the dependent variable is to be n. 

(b) zk +2] + (3 +k/2)z[k] = Vk zk — 1], highest 
argument of the dependent variable is to be k + 1. 

(c) x[3]x[n] — x[2]x[n — 1] = (1+ 1)’, highest 
argument of the dependent variable is to ben + 1. 


4 Find f[k]if 


1 


2 


3 


z(1 — a) 


ae ee 


Solutions 


(a) Second order, n, x, linear, inhomogeneous 
(b) Second order, k, y, linear, inhomogeneous 
(c) First order, z, y, non-linear 

(d) Second order, n, z, linear, homogeneous 


(e) Third order, k, g, non-linear 


1.1547, 1.0503, 1.4260 


(a) 3(n— 1)y[n] — y[n — 2] = (n— 1)° 
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Find the inverse z transform of 


3z(z + 2) 

(z — 2)(z- 3)? 
+32 

322 +2z-—5 


(a) 


(b) 


The sequence 5[k — i] is the Kronecker delta sequence 
shifted 7 units to the right. Find its z transform. 


Show that sin ak can be written as 


etki = ea 
2) 
Given that 
Ze“ a, 
z—e4 
show that 
Shine = * 


2 —2zcosa+1 


(b) zlk+ 1] + (3+ 5k 0) z[k — 1] 
= Jk = z[k - 2] 

(c) x[3]a{n + 1] — x[2]}x[n] = (n + 2)? 

u[k] — ak 

(a) 12(2*) — 1203") + 543* 


ulk] — (—5/3)*/3 


(b) 5 


NL 
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INTRODUCTION 


The ability to analyse waveforms of various types is an important engineering skill. 
Fourier analysis provides a set of mathematical tools which enable the engineer to break 
down a wave into its various frequency components. It is then possible to predict the 
effect a particular waveform may have from knowledge of the effects of its individual 
frequency components. Often an engineer finds it useful to think of a signal in terms of 
its frequency components rather than in terms of its time domain representation. This 
alternative view is called a frequency domain representation. It is particularly useful 
when trying to understand the effect of a filter on a signal. Filters are used extensively 
in many areas of engineering. In particular, communication engineers use them in signal 
reception equipment for filtering out unwanted frequencies in the received signal; that is, 
removing the transmission signal to leave the audio signal. We shall begin this chapter 
by reviewing the essential properties of waves before describing how breaking down 
into frequency components is achieved. 
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[eee PERIODIC WAVEFORMS 


In this chapter we shall be concerned with periodic functions, especially sine and cosine 
functions. Let us recall some important definitions and properties already discussed in 


Section 3.7. The function f(t) = Asin(@t + ¢) = Asinw| t + — } is a sine wave of 
a) 
: o : 27 
amplitude A, angular frequency w, frequency a period T = — and phase angle @. 
T 2) 


The time displacement is defined to be . These quantities are shown in Figure 23.1. 


Similar remarks can be made about the function A cos(wt + @) and together the sine 
and cosine functions form a class of functions known as sinusoids or harmonics. It will 
be particularly important for what follows that you have mastered the skills of integrating 
these functions. The following results can be found in Table 13.1 (see page 431): 


’ cos nwt sin nwt 
sin nat df = —————_ +c cos nowt dt = +c 


nw nw 
forn = +1,+2,... 


Sometimes a function occurs as the sum of a number of different sine or cosine compo- 
nents such as 


f(t) =2sinaw,t + 0.8 sin2@,t + 0.7 sin 4a,t (23.1) 


The r.h.s. of Equation (23.1) is a linear combination of sinusoids. 

Note in particular that the angular frequencies of all components in Equation (23.1) 
are integer multiples of the angular frequency w,. Functions like these can easily be 
plotted using a graphics calculator or computer graph-plotting package. The component 
with the lowest frequency, or largest period, is 2 sinw,t. The quantity w, is called the 
fundamental angular frequency and this component is called the fundamental or first 
harmonic. The component with angular frequency 2, is called the second harmonic 
and so on. In what follows all angular frequencies are integer multiples of the fundamen- 
tal angular frequency as in Equation (23.1). A consequence of this is that the resulting 
function, f(t), is periodic and has the same frequency as the fundamental. Some har- 
monics may be missing. For example, in Equation (23.1) the third harmonic is missing. 
In some cases the first harmonic may be missing. For example, if 


f(t) = cos 2w,t + 0.5 cos 3m,t + 0.4cos 4a,t + --- 


ui) 


Figure 23.1 
The function: f(t) = Asin(@t + ¢). 
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Example 23.1 


Solution 


Example 23.2 


Solution 


Example 23.3 


Solution 


all angular frequencies are integer multiples of the fundamental angular frequency @,, 
which is missing. Nevertheless, f(t) has the same angular frequency as the fundamen- 
tal. A common value for w, is 1007t as this corresponds to a frequency of 50 Hz, the 
frequency of the UK mains supply. 


Describe the frequency and amplitude characteristics of the different harmonic compo- 
nents of the function 


f(t) = cos 207t + 0.6 cos 607t — 0.2 sin 1407¢ 


The fundamental angular frequency is 207 arising through the term cos 207tt. This cor- 
responds to a frequency of 10 Hz. This term has amplitude 1. The second, fourth, fifth 
and sixth harmonics are missing, while the third and seventh have amplitudes 0.6 and 0.2, 
respectively. 


If f(t) = 2sint + 3cost, express f(t) as a single sinusoid and hence determine its 
amplitude and phase. 


Both terms have angular frequency w = 1. Recalling the trigonometric identity (Sec- 
tion 3.7) 


Rceos(wt — 8) = acosat + bsinat 
b 
where R = a? + b*, tan@ = —, we see that in this case R = /32 +2? = /13 and 
a 
2 
tan@ = 3° that is 9 = 0.59 radians. Therefore we can express f(t) in the form 


f(t) = V13 cos(t — 0.59) 


We see immediately that this is a sinusoid of amplitude 13 and phase angle —0.59 
radians. 


Find the amplitude and phase of the fundamental component of the function 


f@) =0.5 sinw,t + 1.5 cos @,t + 3.5 sin2w,t — 3cos3a,t 


Contributions to the fundamental component — that is, that with the lowest frequency — 
come from the terms 0.5 sinw,t and 1.5 cos q@,t only. To find the amplitude and phase 
we must express these as a single component. Using the trigonometric identity 


Rceos(wt — 8) = acosat + bsinat 
b 

where R = Ja? + b*, tan@ = —, we find 
a 


R= v1.52 + 0.52 = 1.58 
tan@ = ote! that is, 9 = 0.32 radians 
Ts 3 areal 


Therefore the fundamental can be written 1.58 cos(w,f — 0.32) and has amplitude 1.58 
and phase angle —0.32 radians. 


SO 
1 
-l 0 1 
Figure 23.2 
Graph for Example 23.4. 


Example 23.4 


Solution 


Example 23.5 


Solution 


EXERCISES 23.2 


1. Describe the frequency and amplitude characteristics 
of the different harmonic components of the following 


waveforms: 
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2 3 t -l 0 1 2 3 4 5 6 t 


Figure 23.3 
Graph for Example 23.5. 


Many other periodic functions arise in engineering applications as well as the more fa- 


miliar harmonic waves. Remember, to be periodic the function values must repeat at 
27 
regular intervals known as the period, 7. The angular frequency w is given by w = T: 


To describe a periodic function mathematically it is sufficient to give its equation over 
one full period and state that period. From this information the complete graph can be 
drawn as Examples 23.4 and 23.5 show. 


Sketch the graph of the periodic function defined by 
fit) =t O<t<l period | 


To proceed we first sketch the graph on the given interval 0 < t < 1| (Figure 23.2), and 
then use the fact that the function repeats regularly with period | to complete the picture. 


Write down a mathematical expression for the function whose graph is shown in 
Figure 23.3. 


We first note that the interval over which the function repeats itself is 2; that is, period 
= 2. It is then sufficient to describe the function over any interval of length 2. The 
simplest interval to take is 0 < t < 2. We note in this example that a single formula is 
insufficient to describe the function for 0 < t < 2 since different behaviour is exhibited 
in the two intervals 0 < tf < land1 <t <2. For0 <t < | the function is a ramp with 
slope | and passes through the origin, that is it has equation f(t) = ft. For | < ft < 2 the 
function value remains constant at 1. Therefore this periodic function can be described 
by the expression 


i period 2 
ae 


(b) f(t) = sin 40r — 0.5 cos 120t 
+ 0.3 cos 240t 


(a) f(t) = 3 sin 1007¢t — 4 sin 2007 t Use a graph-plotting computer package or graphics 
+ 0.7 sin 3007tt calculator to graph these waveforms. 
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2 Express each of the following functions as a single 
sinusoid and hence find their amplitudes and phases. 


(a) f(t) =2cost —3sint 

(b) f(t) =0.5 cost + 3.2 sint 
(c) f(t) =3cos 3t 

(d) f(t) = 2cos2t +3 sin 2r 


3. Sketch the graphs of the following functions: 
(a) f(t) =t?,-1 <1 <1, period 2 


0 
(b) ro={" 


sint m/2<t<7 


0<t<7/2 periodz 


—t —2<t<0O period3 
0<t<il 


Figure 23.4 


(ce) f(y = 
4 Write down mathematical expressions to describe the 
functions whose graphs are shown in Figure 23.4. 
Solutions 


1 (a) Fundamental frequency is 50 Hz, amplitude 3. (c) 3.cos.3¢; amplitude = 3, phase = 0 


Second harmonic has frequency of 100 Hz, (d) V13cos(2t — 0.983); amplitude = /13, 
amplitude 4. Third harmonic has frequency of phase = —0.983 
150 Hz, amplitude 0.7. i @<7<i 
20 = SES 
(b) Fundamental frequency is ao amplitude 1. mg @ fe) ( l<t<2 period =2 
Second a is missing. Third harmonic has m4 0<r<1/2 
frequency of —Hz, amplitude 0.5. Fourth and (b) ff) = ; = Ge? 21 
T 
fifth harmonics are missing. Sixth harmonic has period = | 
120 
frequency of ——Hz, amplitude 0.3. 0<t<l 
. IO) ij teres 
2 (a) V13cos(t + 0.983); amplitude = J 13, . 
phase = 0.983 period = 3 
(b) 3.24 cos(t + 4.867); amplitude = 3.24, 
phase = 4.867 


GEE] opp and EVEN FUNCTIONS 


The functions sint and cost each possess certain properties which can be generalized to 
other functions. Figure 23.5 shows the graph of f(t) = cost. Itis obvious from the graph 


: ; 7 ‘ : 
that the function value at a negative f value, say — ri will be the same as the function 


value at the corresponding positive ¢ value, in this case ay This is true because the 


graph is symmetrical about the vertical axis. We can therefore state that for any value 
of t, cos(—t) = cost. 


Example 23.6 


Solution 
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Figure 23.5 
The function: f(t) = cost. 


fo) fa) 
3 
(a) (b) : 


Figure 23.6 
Examples of even functions: (a) f(t) = 17; (b) f(t) =3. 


More generally, any function with the property that f(t) = f(—t) for any value of 
its argument, f, is said to be an even function. 


If f(t) = f(—t) then f is an even function. 


In particular, the set of functions cos nwt, for any integer n, is even. The graphs of all 
even functions are symmetrical about the vertical axis — or equivalently, the graph on 
the left of the origin can be obtained by reflecting in the vertical axis that on the right. 
Some other examples of even functions are shown in Figure 23.6. 

Sketching a graph shows up the required symmetry immediately. However, even func- 
tions can be identified by an algebraic approach as shown in Example 23.6. Given any 
function f(t), we examine f(—r) to see if f(t) = f(t). 


Show that f(t) = ?? is even. 


We can argue as follows. If 


fpSr 
then 

f(-t) = (-t)y° 
— 2 


=fO 


so that f(t) is even, by definition. 
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Example 23.7 


Solution 


Example 23.8 


Solution 


Test whether or not the function f(t) = 4¢° is even. 


If 
f(t) =4° 
then 
f(—t) = 4(-1)’ 
= —4° 
=-f@® 


so that f(—t) is not equal to f(t) and therefore the given function is not even. 


Let us turn now to the graph of f(t) = sint in Figure 23.7. It is obvious from the 


™ 
graph that the function value at a negative t value, say — ry will not be the same as the 
T 
function value at the corresponding positive f value, in this case ae This graph is not 


symmetrical about the vertical axis. However, we can state something else. The function 
value at a negative ¢ value is minus the function value at the corresponding positive f 


Tl ™ 
value. For example, sin( - *) =— sin( 7) . In fact, for all values of t we can state that 


sin(—t) = — sint. More generally, any function with the property that f(—t) = —f(t) 
for all values of its argument, ft, is said to be an odd function. 


If f(—t) = —f(t) then f is an odd function. 


In particular, the set of functions sinnwf, for any integer n, is odd. In Example 23.7, 
f(t) = 40, we found that f(—t) = —f(t) so this function is odd. The graph of an odd 
function can be obtained by reflection first in the horizontal axis and then in the vertical 
axis. Some more examples of odd functions are shown in Figure 23.8. 

There are some functions that are neither odd nor even — for example, the exponential 
function (see Chapter 2). 


Show that any function, f(t), can be expressed as the sum of an odd component and an 
even component. 


We can write 


= fi) — fc 
= 5 


_f@ ,f®@ , fC) fh 
=o ToT 3 2 

Rearranging gives 

_f@+fCt) 4fOaFER 

- 2 2 


f@ 
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fo) 


(b) 


Figure 23.7 Figure 23.8 

The function: f(t) = sint. Examples of odd functions: 
(a) ff) =2; 
b) ft) = 1 t>0 
( —l t <0. 


Example 23.9 


Solution 


Now it is easy to check that the first term on the r.h.s. is even and the second term is 
odd, so that we have expressed f(t) as the sum of an even and an odd component as 
required. 


Show that the product of two even functions is itself an even function. Determine whether 
the product of two odd functions is even or odd. Is the product of an even function and 
an odd function even or odd? 


If f(t) and g(t) are even then f(—t) = f(t) and g(—t) = g(t). Let P(t) = f(t)g(t) be 
the product of f and g. Then 
P(-t) = f(—t)g(-t) by definition 
= f(t)g(t) since f and g are even 
= P(t) 


Therefore, P(—t) = P(t) and so the product f(t)g(t) is itself an even function. On the 
other hand, if f(t) and g(t) are both odd we find 


P(—t) = f(—t)g(—t) 
= (-f(t))(-g()) 
= fst) 
= P(t) 


so that the product f(t)g(t) is even. 
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fO 
Figure 23.9 Figure 23.10 
A typical odd function, f(t). A typical even function, f(t). 
If f(t) is even and g(t) is odd, we find 
P(—t) = f(-tg(t) 
= f(@)(-s@)) 
= —f(@)g() 
= =P() 
so the product is an odd function. These rules are obviously analogous to the rules for 
multiplying positive and negative numbers. 
The results of Example 23.9 are summarized thus: 
(even) x (even) = even 
(odd) x (odd) = even 
(even) x (odd) = odd 
23.3.1 Integral properties of even and odd functions 
Consider a typical odd function, f(t), such as that shown in Figure 23.9. Suppose we 
wish to evaluate ‘i f(t) dt where the interval of integration [—a, a] is symmetrical 
about the vertical axis. Recall from Chapter 13 that a definite integral can be regarded as 
the area bounded by the graph of the integrand and the horizontal axis. Areas above the 
horizontal axis are positive while those below are negative. We see that because positive 
and negative contributions cancel, the integral of an odd function over an interval which 
is symmetrical about the vertical axis will be zero. 
Example 23.10 Evaluate {”_f cos not dt. 
Solution The function ¢ is odd. The function cos nwt is even and hence the product f cos not is 


odd. The interval [—71, 7t] is symmetrical about the vertical axis and hence the required 
integral is zero. 


Consider now a typical even function, f(t), such as that shown in Figure 23.10. Suppose 
we wish to evaluate ie f(t) dt. Clearly the area bounded by the graph and the f axis in 


1 


23.3 Odd and even functions 731 


the interval [—a, 0] is the same as the corresponding area in the interval [0, a]. Hence 


we can write 


[oa = 2 [fea 
—a 0 


Example 23.11 Evaluate f”_fsin¢ de. 


Solution The functions ¢ and sint are both odd, and hence their product is even. Therefore, using 
the fact that the integrand is even we can write 


™ Kad 
i rsint dt = 2 [ t sint dt 
—7 0 


So, integrating by parts, 


™ 7 
2 | tsintdt = 21-1 cos t]> +f costar) 
0 0 


= 2((—7cos 7) — (0) + [sint]>) 


= 27 


EXERCISES 23.3 


Determine by inspection whether each of the 
functions in Figure 23.11 is odd, even or neither. 


iG) fo) 
t 
(a) (b) 
fo) fo 
t 
(c) (d) 
Figure 23.11 


(a) The function: f(t) = —r; 
) fO =F +1 


(ORG ear 
(d) f(t) = cost + 0.122. 


By using the properties of odd and even functions 
developed in Example 23.9 state whether the 
following are odd, even or neither: 


(b) tcos 2t 


(d) cos@t sin 2wt 


(a) 1 sin wt 
(c) sinf sin 4t 


(e) e! sint 


Evaluate the following integrals using the integral 
properties of odd and even functions where 
appropriate: 


(a) [2513 de 
(c) f™ 07 sint de 


(e) fi, |tlde 


(b) [513 cos 3¢ dt 
(d)_[?, t cosh 3¢ dt 


(fy ft, tltlde 
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Solutions 
1 (a) odd (b) neither 3 (a) 0 (b) 0 
(c) odd (d) even (c) 0 (d) 0 


(e) 1 (f) 0 


2 (a) even (b) odd (c) even 
(d) odd (e) neither 


Example 23.12 


Solution 


ORTHOGONALITY RELATIONS AND OTHER 
USEFUL IDENTITIES 


Recall from Chapter 16 that two functions f(t) and g(t) are said to be orthogonal on the 
intervala <t < bif 


b 
/ f(t)g(t) dt = 0 


Show that the functions cos mat and cos nwt with m, n positive integers and m + n are 
: T 
orthogonal on the interval -— <t< —. 
a) o 


We must evaluate 


Tt/@ 
/ cos mot cos not dt 


Tt/w 
Using the trigonometric identity 2 cos A cos B = cos(A + B) + cos(A — B), we find the 


integral becomes 


| Tt/w 
= / cos(m + n)@t + cos(m — n)at dt 
2 —Tt/w 


_ Ee +n)ot — sin(m— mary 
~ 2 


(m+ n)o (m—n)w ae 


=0 


since sin(m + n)7t = 0 for all integers m, n. It was necessary to require m + n since 
otherwise the second quantity in brackets becomes undefined. 
Hence cos mot and cos not are orthogonal on the given interval. 


A number of other functions regularly appearing in work connected with Fourier analysis 
are orthogonal. The main results together with some other useful integral identities are 
given in Table 23.1. 
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Table 23.1 
Some useful integral identities. 


slant . 
[ sin =0 for all integers n 
T 
fe dt = 0 [a eee Pea 
0 
T 
[cos ia—/ i n=O 
0 
T mnt  2nnt 0 m#én 
[ i ay n=ne0 
T 
2 t He t 
/ i mmt . 2nmt = 0 mseén 
0 T T/2 m=n4#~0 


T mnt a 2nitt . 
sin OS 7 dt =0 for all integers m and n 
0 


FOURIER SERIES 


We have seen that the functions sinwf, sin2wt, sin3wt,...,cos@t, cos2wf,... are 
periodic. Furthermore, linear combinations of them are also periodic. They are also con- 
venient functions to deal with because they can be easily differentiated, integrated, etc. 
They also possess another very useful property — that of completeness. This means that 
almost any periodic function can be expressed as a linear combination of them and no 
additional functions are required to do this. In other words, they can be used as build- 
ing blocks to construct periodic functions simply by adding particular multiples of them 
together. 

We shall see, for example, that the sawtooth waveform with period 27t, shown in 
Figure 23.12, is given by the particular combination 


1 1 1 1 
f@® = 2{ sint alas es Foe 5 sins ~~) 


This is an infinite series which can be shown to converge for almost all values of ¢ to the 
function f. This means that if any value of t is substituted into the infinite series and the 
series is summed, the result will be the same as the value of the sawtooth function at 
that value of t. There is an exception: iff is one of the points of discontinuity the infinite 
series will converge to the mean of the values to its left and right, that is 0. 

To obtain a feel for what is happening consider Figure 23.13. Graphs (a), (b) and (c) 
show the effect of including more and more terms in the series. As more terms are taken 


Figure 23.12 
Sawtooth waveform. 
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fp) fo 
KO 
t t t 
(a) 
(b) (c) 


Figure 23.13 
Fourier synthesis of a sawtooth waveform: (a) f(t) = 2 sint; 
(b) f(t) = 2(sint — } sin 2r + } sin 3r); 


(c) f(t) = 2(sint — 4 sin 2t + 3 sin3¢ — j sin4r + 5 sin5¢). 


we see that the series approaches the desired sawtooth waveform. The process of adding 
together sinusoids to form a new periodic function is called Fourier synthesis. We see 
that the sawtooth waveform has been expressed as an infinite series of harmonic waves, 
sint being the fundamental or first harmonic, and the rest being waves with frequencies 
that are integer multiples of the fundamental frequency. This infinite series is called the 
Fourier series representation of f(t) and what we have succeeded in doing is to break 
down f(t) into its component harmonic waveforms. In this example, only sine waves 
were required to construct the function. More generally we shall need both sine and 
cosine waves. 

Suppose the function f(t) is defined in the interval 0 < t < T and is periodic with 
period 7. Then, under certain conditions, its Fourier series is given by 


(@ = “0 + SS a, cos anu +b, sin elas (23.2) 
=e en Ao fe f ie ; 


or equivalently 


(C= = + SG cos not + b, sinnot) 


al 


where a,, and b,, are constants called the Fourier coefficients. These are given by the 


formulae 
5} T 
dy = ; f(t) dt (C3r3)) 
T Jo 
Q fp! 2ntt en 
i, = Tr f(t) cos TT dt for n a positive integer (23.4) 
0 
ay pe _ 2nttt ee 
= r f(t) sin T dt for n a positive integer (235) 
0 


a 
The term a represents the mean value or d.c. component of the waveform (see Sec- 


tion 15.2). The derivation of these formulae appears in Example 23.16. It is important to 


Example 23.13 


Solution 
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point out that the integrals in Equations (23.3), (23.4) and (23.5) can be evaluated over 
any complete period, for example from t = — 5 tor = Pie Prudent choice of the interval 


of integration can often save effort. The expression appearing in the r.h.s. of the Fourier 
representation, Equation (23.2), is an infinite series. We list conditions, often called the 
Dirichlet conditions, sufficient for the series to converge to the value of the function 
f(t). The integral f | f(t)| dt over a complete period must be finite, and f(t) may have 
no more than a finite number of discontinuities in any finite interval. Fortunately, most 
signals of interest to engineers satisfy these conditions. At a point of discontinuity the 
Fourier series converges to the average of the two function values at either side of the 
discontinuity. 


1 
Find the Fourier series representation of the function with period T = 50 given by 


a5 1 0<1t<0.01 
~ 10 0.01 <tr < 0.02 


The function f(t) is shown in Figure 23.14. Note that f(t) is defined to be zero between 
t = 0.01 and t = 0.02. This means we need only consider 0 < t < 0.01. Using 
Equations (23.3)-—(23.5) we find 


0.02 0.02 


0.01 
dy = 100 f(t) dt = 100 [ 1ldt+ 100 [ O dr 
0 0 0 


O01 


= 1000 R= 1 


i. sin 100n71]°"! 
a, = 100 cos 100n7t dt = 100} ———_—_— 
- 0 100n7 J 
=0 
[- : —~cos 100n7t 1°"! 
b, = 100 sin 100n7tt dt = 100) ———____— 
i 0 100n7t 0 


1 
= ——<(cosn7t — cos 0) 
nt 


Noting that cos n7t = (—1)"” we find 


1 
b, = —(1 - (-1)") 
Tl 


n 


Figure 23.14 
Graph for Example 23.13. 
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Example 23.14 


Solution 


2 
If n is even b, = 0. If nis odd b, = —. Therefore the Fourier series representation of 
™ 
f(t) is . 
1 2 in 3007 t in 5007 t 
f@= is = (sin 100 + = ae Maia - di +) 


The average value of the waveform is S This is the zero frequency component or d.c. 


value. We note that in this example only odd harmonics are present. 


Find the Fourier series representation of f(t) = 1 +t, —7 <tf < 7, period 27. 


As usual we sketch f(f) first as this often provides insight into what follows (see Fig- 
ure 23.15). Here T = 27, wm = 1, and for convenience we shall consider the period of 
integration to be [—7t, 71]. Using Equation (23.3) we find 


1 7 1 tr?" 
~| tera 145] 
T J on Tt Pal ee 
i ve ae 
 o- 2 B 


a 


Similarly, using Equation (23.4) we find 


1 vis 
a= =f (1 + t) cos nt dt 
T Jn 


Integrating by parts gives 


1 sin nt ]™ ™ sin nt 
a, =—(|(+n) dt 
T n j|_a _x on 
1 cos nt ]™ . ; 
=—(0+ 5 since sin -En7t = 0 
T n og 


1 
=, (cos n7t — cos(—nr7t)) 
Tn 


‘Figure 23.15 


Graph for Example 23.14. 
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but cos(—n7t) = cosn7t and hence a, = 0, for all positive integers n. Using Equa- 
tion (23.5) we find 


1 TT 
b, = ~| (1 +f) sinnt dt 
T Jon 


1 cos nt |™ ™ cos nt 
= (1 +f) + dt 
™ We Nene ae a 


_ ~(-« ise CON ae pe lial 4 [=] ) 
™ n - 


n n 
1 
= —(—27cos nT) 
Tn 
since sin -En7t = 0. Hence, 


n 


2 2 
b, = —-cosnt = —-(-1)" 
n n 


2 
We find b, = 2, b, = —1, b, = =,.... Thus the Fourier series representation is given 
from Equation (23.2) as 


2 
f(t) = 1+2sint — sin 2t + Fg eas 


which we can write concisely as 


lo. e) 


f(tj=1 221)" sin 


n=1 
Example 23.15 Find the Fourier series representation of the function with period 27 defined by 
fH =P,0<t < 20. 


Solution As usual we sketch f(r), as shown in Figure 23.16. Here T = 271 and we shall integrate, 
for convenience, over the interval [0, 27t]. Using Equation (23.3) we find 


ae ie a 
AO | a a 
0 


uC) 


4? 


Figure 23.16 
aT ! Graph for Example 23.15. 
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Using Equation (23.4) we have 


1 27 
2 
~ | t~ cos nt dt 
Tt Jo 


Integrating by parts, we find 
a(| i , sin nt 
a, = t 2t 
TU 0 0 n 
2 27 
—— if t sin nt dt 
nT Jo 
(| cos nt oe he cos nt 
= + 
n 0 0 n 


2 (—2ncos 2nt sin nt ]°" 
5 2 
ni7t n 0 


7 Sin nt 


n 


’ 


: 


n 


4 
Hence a, = 4, a, = l,a,; = ge Similarly, 


1 27 
ae 
b, = -{ t sin nt dt 
™ Jo 


1 5 COs nt “ an cosnt 

== | =e——_— + 2t dt 
Tl n 0 0 
Lf 47 a ia 

= —| ——— cos 2n7m + — t cos nt dt 
T™ n nN Jo 

7 a Ar? 2(([ se] " PP sin nt ir)) 
Tt n n n |o 0 n 

i ( 4? 2 cos ")") 
7 n n n Io 

_ 4a 

3 i 

Thus b, = —4n, b, = —2m,.... Finally, the required Fourier series representation is 


given by 


Ar? 4 
{QS aot POSE COREE anaes 


4 sin 3t 


=n (4sin +2 sin 2t + 


+ -) 
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Example 23.16 Obtain the expressions for the Fourier coefficients a), a, and b, in Equations (23.3), 


Solution 


(23.4) and (23.5). 


Assume that f(t) can be expressed in the form 


~ 2nrt 2nnt 
FoO= oy + Y(« cos = + b,, sin ay 


n=1 


(23.6) 


ot cd i 2mm t 
Multiplying Equation (23.6) through by cos 


- 2mrTtt y ao 2mTtt 
f(t) cos dt = — cos ——— dt 
0 T o 2 T 


+f Qn t +b si 2nit 
a, cos sin cos 
0 n=1 " r 7 T 


If we now assume that it is legitimate to interchange the order of integration and 
summation we obtain 


ig . 2mm t 
f f(t) cos F 


and integrating from 0 to T we find 


2mm t 
dt 


2mm t 


ii 
dt = i a) cos dt 
o 2 T 


> f( cos ceil +b. sin ) cos 
a acne 
0 " T " T 


n=1 


2mm t 
dt 


The first integral on the r.h.s. is easily shown to be zero unless m = 0. Furthermore, we 
can use the previously found orthogonality properties (Table 23.1) to show that the rest 
of the integrals on the r.h.s. vanish except for the case when n = m, in which case the 


An 


T 
r.h.s. reduces to 5 Consequently, 


2mm t 
dt 


2 T 
“== f f(t) cos 7 


as required. When m = 0 all terms on the r.h.s. except the first vanish and we obtain 


T T aq 
/ f(t) dt =} = gt 
0 0 2 


AT 


2 


for all positive integers m 


so that 


a) Y i 
a= 7 if foe 


2mm t 
To obtain the formula for the b, multiply Equation (23.6) through by sin . and 
integrate from 0 to T. 
- _ 2mnt r dy. 2mrTt 
f(t) sin dt = — sin dt 
0 T 0 2 
+f> Qnty gig ATI 5 moet 
d,, COS sin —— ] sin 
6 n TF n T 


n=1 
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23.5.1 


Example 23.17 


Solution 


Again assuming that it is legitimate to interchange the order of integration and sum- 
mation, we obtain 


z _ 2mnnt si dy. 2mTt 
f(t) sin dt = — sin dt 
0 0 2 T 


T 
ef ae ap _ 2ntt\ | 2mrTtt dy 
cos sin sin 
a, T n TT T 


The first integral on the r.h.s. is easily shown to be zero. Furthermore, we can use the 
properties given in Table 23.1 to show that the rest of the integrals on the r.h.s. vanish 


m 


except for the case when n = m, in which case the r.h.s. reduces to . Hence we find 


2mm t 
dt 
T 


2 T 
b= t) si 
m=z f FOsin 


as required. 


Fourier series of odd and even functions 


Let us now consider what happens when we determine Fourier series of functions which 
are either odd or even. 


Find the Fourier series for the function with period 27t defined by 


0 -m<t<-—m7/2 
f@= 44 -n/2<t<n/2 
0 m/2<t<7 


As usual we sketch the function first (Figure 23.17). Inspection of Figure 23.17 shows 

that the Dirichlet conditions (page 735) are satisfied. We note from the graph that the 

function is symmetrical about the vertical axis; that is, it is an even function. We shall 

see shortly that this fact has important implications for the Fourier series representa- 
T 

tion. For convenience we consider the period of integration to be l-F: >| Hence 


Equation (23.4) becomes 


ae cls 2ntt 
a4,=s f(t) cos 
D Jer 


dt for all positive integers n 
In this example the period T equals 27. The formula for a, then simplifies to 


1 77 
a, = -{ f(t) cos nt dt 
T Jon 


The interval of integration is from f = —7t to t = 7. However, a glance at the graph 


Tt Tt 
shows that the function is zero outside the interval — 3 <t< > and takes the value 4 
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Figure 23.17 
Graph for Example 23.17. 


inside. The integral thus reduces to 


1 7/2 
a — i 4 cos nt dt 
Tl J—7/2 


4 7/2 
= / cos nt dt 
T J 7/2 


4 Ea 
oe ae er 


8 | nt 
— sin — 
nT 2 


8 8 
We obtain a, = —, a, = 0,a, = =a etc. We find a) using Equation (23.3), again 
Tt . Tt 
nat fi EE 
integrating over | —~, — |: 
g g ip) 
T/2 


2 
aq==f fea 


ll 
| 
— 
a 
~~, 
Ne 
KR 
Q 
~ 
| 
= 
a 
as 
i} 


Similarly, to find the Fourier coefficients, b,, we use Equation (23.5): 


n? 


2 = fey sin 
== f@) sin 
ie -T/2 T 


which reduces to 


1 7/2 
b= =| 4 sin nt dt 
T J 7/2 


4 [=e] 
™ n ie 
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Example 23.18 


Solution 


that is, all the Fourier coefficients, b,, are zero. Finally we can gather together all our 
results and write down the Fourier series representation of f(t): 


8 8 8 
f(t) =2+ — cost — — cos 3t + —cos5t —--- 
mt 37 5m 


In this example we see that there are no sine terms at all. In fact, whenever a function 
is even its Fourier series will possess no sine terms. To see this we note that b, can be 
found from 

ie 2ntt 


2 
b=s f(t) sin 
T Jory 


dt 


2Qnit . _ 2nnt 
is odd, the product f(t) sin 


Since f(t) is even and sin is odd also. Now the 


integral of an odd function on an interval which is symmetrical about the vertical axis 
was shown in Section 23.3 to be zero. Hence whenever f(t) is even we can immediately 
assume b,, = 0 for all n. 
Correspondingly, when a function is odd its Fourier series will contain no cosine or 
constant terms. This is because the product 
2ntt 


f(t) cos 


is odd also and so the integral 


pa 2ntt 
a4,=—> f(t) cos 
T Jory 


dt 


will equal zero. We conclude that when f(t) is odd, a, = 0 for all n. These facts can 
often be used to save time and effort. Knowing the function in Example 23.17 was even 
before we started the Fourier analysis, we could have assumed that the b, would all be 
Zer0. 


Find the Fourier series representation of the sawtooth waveform described at the begin- 
ning of this section (see Figure 23.12). 


This function is defined by f(t) = t, —7 < t < 7, and has period T = 27t. It is an odd 
function and hence a,, = 0 for all n. To find the b,, we must evaluate 


1 ™ 
b --| t sin nt dt 
T JL 


n 
TT 


_{ | econ)” f cos nt 
aoa(e | 4 dt 
Tt n ee ee 


1 
= —{-—71cosnt — 7Icos nT} 
nt 


since the last integral vanishes. Therefore, 


2 2 
b, = —-cosnm = ——(-1)" 
n n 
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2 : ‘ 
We conclude that b, = 2,b, = —1, b, = 37 .... Therefore f(t) has Fourier series 


1 1 1 1 
fO= 2 {sin - Fone a _ fee go -..| 


Engineering application 23.1 


Spectrum of a pulse width modulation controlled solar charger 


Pulse width modulated (PWM) signals are found in a wide variety of electronics 
applications. A PWM signal is simply a rectangular wave with a fixed time period, 
T, where the ‘on’ time of the signal, known as the mark time, m, can be varied. The 
ratio of the ‘on’ time, m, of the signal to the period, T, is known as the duty cycle, 
denoted by d: 
duty cycle (d) = ue = 
period T 
The frequency of the PWM signal is f = = 
Solar charge controllers, which often use pulse width modulation, are devices that 
protect the batteries in the system from becoming overcharged or from becoming 
fully discharged, both of which can cause permanent damage. Pulse width modu- 
lation is used to vary the charge rate to the batteries. Charge controllers and other 
devices that employ pulse width modulation generate high-frequency harmonics due 
to their switching. 
Consider the PWM signal shown in Figure 23.18. It has an amplitude of A volts. 


: m 
It has period T and mark time m, so that the duty cycle is d = ve 
Note that the function is even because of the symmetry about the vertical axis. 

This means that the Fourier series will contain only cosine terms, that is all the b,, 
values are zero. The a, values can be shown (see Exercises 23.5, Question 8) to be 

2A. 
a, = — sin(ntd) 

nit 


n 


Figure 23.18 
A PWM signal of period T and duty cycle m/T. 
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Note that the value of a,, is the amplitude of the mth harmonic. Further, the frequency 
of the nth harmonic is nf, that is integer multiples of the frequency of the PWM 
signal. A graph of |a,,| against fn is often referred to as the frequency spectrum. This 
concept is discussed further in Chapter 24. 

Consider the specific case of a 12 V signal at a frequency of f = 300 Hz. 
Figures 23.19(a)—(d) show the frequency spectra when d = 0.01, 0.15, 0.30 and 
0.50 respectively. 

It can be seen that the frequency spectrum becomes narrower for higher values 
of the duty cycle. The sinc function (see Section 3.5) forms the envelope for the 
spectrum, which is most clearly evident for d = 0.15. For low values of the duty 
cycle the harmonics may extend up to radio frequencies and hence there is a potential 
for interference to occur. 

Note that when d = 0.5 

2A . -) 

a, = — sin( — 

0 pins 2} 
When n is even the value of sin (=) is zero and so all the even harmonics are 


absent. In other words, when d = 0.5 the PWM signal possesses only odd 


harmonics. 
lanl (V) 0.24 ° la,|(V) 35 
TT 3.0+| % 
0.22 * 
2.5 
0.20 
2.0 
0.18 . 
0.16 - 
0.14 as 
° : ° * f(kHz) : : : §F(kHz) 
(a) (b) 
@ 
la,1(V) 6 la, (V) 
5 6 
4 
4 
3 
: 2 
1 
2 4 6 8 2 4 v4 3 
f (kHz) f (kHz) 
(c) (d) 
Figure 23.19 


Spectrum for (a) d = 0.01, (b) d = 0.15, (c) d = 0.30, (d) d = 0.50. 
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EXERCISES 23.5 
1. Find the Fourier series representation of the function 5 Find the Fourier series representation of the function 
0 -5<t<0O_ period 10 211+t) -l1<t<0O_ period2 
f= : fO= - 
1 0<t<5 0 O0<r<l 
2 Find the Fourier series representation of the function 6 Find the Fourier series representation of the function 
with period 27t given by 
—t -m<t<0 iod 27 
ro=| ; i ee ee des 2? O<t<na 
t TT ‘4 — 
o_o i 0 nm<t<2n 


3 Find the Fourier series representation of the function 


7 Find the Fourier series representation of the function 


2. . 
f@)=t +t —m<t<7 period 27 f(t)=2sint 0<t<2n_ period 2x 
4 Find the Fourier series representation of the function 8 For the signal in Engineering application 23.1, show 
. that 
w) —4 -m<t<0 period 27 2A 
tf) => i 
f 4 O<t<m a> ae) 


Solutions 
2 i gin eS ain 8 (2sinr + } sin 3¢ + 3 sin 5 +--+) 
2 5 3m 5 Si 5 4 
2 . 7Tnt Tt 
+ — sin — 1 1 
7 5 5 = +2{(2/m) cost — sinner — = sin 2nt 
n 2 ieee all > ls 2 1 . 
2 ri noe a no querer +(2/9r) cos 3ret — 5 sin 30 +--+} /n 
re jm m4. nm. 1 
3 3 + 27 sint — 4cost — 7 sin 2t + cos 2f+ 6 6 = sint — 2cost 5 nee tg OSE 
2 
2m 4 one —4 2 
“gq Sindh — 5 cos3e--- Sin 3t — 5 008 3 bos 
7 2sint 


HALF-RANGE SERIES 


Sometimes an engineering function is not periodic but is only defined over a finite 
interval,0 < ft < 5 say, as shown in Figure 23.20. In cases like this Fourier analy- 
sis can still be useful. Because the region of interest is only that between t = 0 and 
t= we may choose to define the function arbitrarily outside the interval. In partic- 


ular, we can make our choice so that the resulting function is periodic, with period T. 
There is more than one way to proceed. For example, we can reflect the above function 
in the vertical axis and then repeat it periodically so that the result is the periodic even 
function shown in Figure 23.21. We have performed what is called a periodic exten- 
sion of the given function. Note that within the interval of interest nothing has altered 
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_— | 
> a a 
0 T t T 
2 0 7 t 


Figuns 23:20 Ze Figure 23.21 
Function defined over interval 0 < t < a" An even periodic extension. 


Oo 
NIN 
~ 


Figure 23.22 Figure 23.23 
An odd periodic extension. A periodic extension that is neither even nor odd. 


but we have now achieved our objective of finding a periodic function. We can find 
the Fourier series of this periodic function and within the interval of interest this will 
converge to the required function. What happens outside this interval is not important. 
Moreover, since the periodic function is even the Fourier series will contain no sine 
terms. 

An alternative periodic extension is that shown in Figure 23.22, which has been ob- 
tained by reflecting in both the vertical and t axes before repeating it periodically to give 
a periodic odd function. Its Fourier series will contain no cosine terms and within the 
interval of interest will converge to the function required. 

A third alternative periodic extension is shown in Figure 23.23. However, this exten- 
sion is neither odd nor even and so it has none of the desirable properties of the other two. 
Whichever extension we choose, the resulting Fourier series only gives a representation 


T 
of the original function in the interval 0 < t < a and as such is termed a half-range 


Fourier series. Similarly we have the terminology half-range sine series for a series 
containing only sine terms and half-range cosine series for a series containing only co- 
sine terms. The Fourier series formulae then simplify to give the following half-range 
formulae: 


Half-range sine series: 


dy = 0, a, = 0 for n a positive integer 
Al fp eit ee 
QD, = 7 f(t) sin 7 dt for n a positive integer (@3.7) 
0 
and f(t) is given by 


2ntt 


FO) — ) bys 
i=l 
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Half-range cosine series: 


4 hfe 

Cy = a f(t) dt (23.8) 
T Jo 
A ple 2ntt alae 

“== f(t) cos dt for n a positive integer (23.9) 
W dh ap 

in, =O for n a positive integer 


and then f(t) is given by 


GQ) = = Ee. 


2ntt 


Example 23.19 By defining an appropriate periodic extension of the function illustrated in Figure 23.24, 
find the half-range cosine series representation. 


Solution 


The function illustrated in Figure 23.24 is given by the formula f(t) = ft forO <t <7 


and is undefined outside this interval. Since the cosine series is required an even peri- 
odic extension must be formed. This is illustrated in Figure 23.25. Taking T = 27 in 
Equations (23.8) and (23.9), we find a, and a,,. 


a 


Figure 23.24 
Graph for Example 23.19. 
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2 
mL n> do 


—27 2n 


Figure 23.25 
Graph for Example 23.19. 
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Now cosn7t = (—1)", so that 


2((-1)" 1 
a 2 2 | ee eee 


Of course, all the b, are zero. Therefore the half-range cosine series is 


ee ee 
=> cos cos ee 
2 #7 On 


and this series converges to the given function within the interval 0 < tf < 7. 


EXERCISES 23.6 
1. Graph an appropriate periodic extension of 4 Graph an appropriate periodic extension of 
f(t) = 3t O<t<7 fi =e O0<t<l 
and hence find its half-range cosine series and find its half-range cosine series. 
representation. 


. ; . . 5 Find the half-range sine series representation of 
2 Find the half-range sine series representation of the i st—70< 7422 
: = »URSRER 4. 


function given in Example 23.19. 


3 Find the half-range cosine series representing the 
function 


f(t) = sint O<t<T7 


Solutions 


3n 6 Caf cosnnt—1 ecosn7t — | 
1 aD ( 5 ) om 4 ieee (Se Beat ) sos 
CO oe) . 
cosn7t\ . 4 sin(n7tt/2) 
—2 t a — —_— 


1 
2 2 Sfcosnn+1 
3 cos nt 
Tt =>( n2—1 ’ 


yx ~=PARSEVAL’S THEOREM 


If the function f(t) is periodic with period T and has Fourier coefficients a, and b,,, then 
Parseval’s theorem states: 


le,¢) 


ears 1 
al (f())2 dt = 5% + D0 (an +B) 


7 


It is frequently useful in power calculations as the following example shows. 


23.8 Complex notation 749 


Engineering application 23.2 


Average power of a signal 


Find the average power developed across a 1 Q resistor by a voltage signal with 
period 27 given by 


1 1 
u(t) = cost — = sin 2t + — cos 3t 
3 2 
Solution 
We note that v(t) is periodic with period T = 27; v(t) is already expressed as a 
1 1 
Fourier series with a, = 1,a, = 5 and b, = — 3° All other Fourier coefficients are 0. 
The instantaneous power is (v(t))? and hence the average power over one period is 


given by 


av on 
Therefore, using Parseval’s theorem we find 


pee eae 1 (1) )\ coeew 
5) 3 2 ee 


1 2m 5 
P,=— (u(t))° dt 
0 


COMPLEX NOTATION 


An alternative notation for Fourier series involving complex numbers is available which 
leads naturally into the more general topic of Fourier transforms. Recall from Chapter 9 
the Euler relations 


e+? — cos@ +jsind 
from which we can obtain expressions for cos @ and sin 6: 
el? 4 ei? ei? — e-i? 
cos @ = ——_—— sin@ = ——_—— 
2 2j 
which enable us to rewrite the Fourier representation 


fl = 2nttt : 2ntt 
ft) = — 5 : a,, COS + b. sin 
T T 


n=1 


as 
Ay oe ei2nmtt/T ae ea i2nmt/T ei2nmt/T _ e i2nmt/T 
t= b 
fon 4 aa : +. - ) 
_ 4% <. ay, a, — jb, jn7t/T , Cn IP, ~j2n7t/T 


n=1 
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Example 23.20 


Solution 


which we can write equivalently as 


f(t) = a ee e2nt/T 


n=—Oo 


—jb jb 
i ed i292 er ee 
2 2 


and cy = d)/2. It can be shown that the Fourier coefficients, c,, are then given by 


n? 
nD 


1 ; 
== Gym ads 
1p =I) 


The integral can also be evaluated over any complete period as convenient. Further, if 


2n , 
we write T = — then this complex form can be expressed as 


@) 
CO 
iOo= yo 46" 
n=—CO 
where 
Tt/w 


1 ‘ 
c= — the "i! dt 
at Lo 


Find the complex Fourier series representation of the function with period T defined by 


1 -T/4<t<T/4 
ft) = 
0 otherwise 
We find 
1 T/4 
c= zi le i2n71/T dt 
. T Jorja 
1 e i2nTt/T T/4 
iT eae -T/4 
= a 
= (e jnm/2 el™/?) 
2nt j 
1 ein /2 _ enn /2 
a 
1 . nt 
= — sin — 
nT 2 
Therefore, 


aoe oi nm, 
fQ®O= ~ — sin oat 


n=—OO 
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The observant reader will note that the expressions for c, appear invalid when n = 0, 
since the denominator is then zero. We can compute cy, in either of two ways. Using an 
integral expression we see that 


an 1 
= 7 ldt = — 


Also, using a Taylor series expansion it is possible to show 


17 eci2are/T T/4 1 
im =| | at 
n>0 T | —j2nn/T _T/4 2 


giving a consistent result. 


EXERCISES 23.8 
1. Find the complex Fourier series representation of 2 if 
© . 
i Heres FO =D cern 
(a) f(t) = period 4 =a 
0 2<t<4 . : 
show that the coefficients, c,,, are given by 
(b) ff =e! -l<t<1 period 2 1 ot 
c= a f(t) e 20m /T dt 
Asinot 0<t<T7/w oF Jo 
(c) f(t) = F . j2mrct/T : 
0 m/w <t < 2n/w [Hint: multiply both sides by e~!?”*‘/7 and integrate 
period 27t/w over [0, T].] 
Solutions 
OO oo —jn7t) ,jnwt 
J jnret/2 A(l +eJ"™) e! 
1 (a) ep oosmt— De! (c) 2 saga 
—oO —oo 
1a feninm! — ec ltinn : 
b ent 
© (= 


—o 


| 23.9 | FREQUENCY RESPONSE OF A LINEAR SYSTEM 


Linear systems have the property that the response to several inputs being applied to 
the system can be obtained by adding the effects of the individual inputs. Another useful 
property of linear systems is that if a sinusoidal input is applied to the system then the 
output will also be a sinusoid of the same frequency but with modified amplitude and 
phase. This is illustrated in Figure 23.26. 

In Section 9.7 we saw that sinusoidal signals can be represented by complex numbers 
and that an a.c. electrical circuit can be analysed using complex numbers. This is true 
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Aj 
Linear 
t system 


Figure 23.26 
The response of a linear system to a sinusoidal input is also sinusoidal. 


for linear systems in general. It is possible to define a complex frequency function, 
G(jw), where @ is the frequency of the input; G relates the output and the input of a 
linear system. 

If a sine wave of amplitude A, is applied to the system then the amplitude, A,,, of the 
output is given by 


A, = IGGo)|A; 
The phase shift, ¢, is given by 
$ = LG(jw) 


Note that A, and ¢ depend upon a. It is important to note that G(jw) is a frequency- 
dependent function. Although the notation for G(jw) may seem slightly odd it arises 
because one method of obtaining the frequency function for a linear system is to substi- 
tute s = jw in the Laplace transform transfer function, G(s), of the system. 

It is now possible to analyse the effect of applying a generalized periodic waveform to 
a linear system. The first stage is to calculate the Fourier components of the input wave- 
form. The amplitude and phase shift of each of the output components is then calculated 
using G(j@). Finally, the output components are added to obtain the output waveform. 
This is only possible because of the additive nature of linear systems. An example will 
help to clarify these points. 


Engineering application 23.3 


Analogue low-pass filter 


Recall from Engineering application 22.2 that a low-pass filter is a filter that has a 
tendency to attenuate high frequency signals. If the filter consists of electronic com- 
ponents and operates on analogue signals directly then it is known as an analogue fil- 
ter. A simple circuit that acts as an analogue low-pass filter is shown in Figure 23.27. 
Derive the frequency response for this circuit and draw graphs of its amplitude and 
phase characteristics. 

Consider the circuit of Figure 23.27. Using Kirchhoff’s voltage law and Ohm’s 
law we obtain 


v, = iR+ v, 
For the capacitor, 


i 
Vv, ==> 
°  jaC 
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Eliminating 7 yields 
v; = VJOCR + v, = v,(1 + jaRC) 


Vv 1 
== ———— 23.10 
v; 1 +ja@RC ( ) 


Equation (23.10) relates the output of the system to the input of the system. 
Therefore, 


C(@) = ——= 
02) enn 
It is convenient to convert G(j@) into polar form: 
120 
Go) = 
JV 1+ (@RC)2Z tan“! w RC 
! =il 
= Z —tan”~ wRC 
JV 1+ (@RC)* 
Therefore, 
1 
|GGo)| = —===—— (23.11) 
JV1+ (@RC) 
/G(jw) = — tan“! w RC (23.12) 


The amplitude and phase characteristics for the circuit of Figure 23.27 are shown in 
Figure 23.28. These show the variation of |G(jw)| and 7G(Qjqw) with angular 
frequency o. 


IG(jo)| 
1 


0.707 


Figure 23.27 
Circuit for Engineering Figure 23.28 
application 23.3. Amplitude and phase characteristics for the circuit 


of Figure 23.27. 


Note that the circuit is a low-pass filter; it allows low frequencies to pass eas- 
ily and rejects high frequencies. The cut-off point of the filter, that is the point at 
which significant frequency attenuation begins to occur, can be varied by changing 


754 


Chapter 23 Fourier series 


the values of R and C. The quantity RC is usually known as the time constant for the 
system. Consider the case when RC = 0.3. Equations (23.11) and (23.12) reduce to 


1 

|G(§jo)| = ———————— (23.13) 
0 V¥1+0.09a? 

LG(jw) = — tan! 0.3@ (23.14) 


Let us examine the response of this system to a square wave input with fundamental 
angular frequency | and amplitude 1. This waveform is shown in Figure 23.29(a). 
We note that T = 27. The waveform function is odd and so will not contain any 
cosine Fourier components. It has an average value of 0 and so will not have a zero 
frequency component; that is, there will be no d.c. component. Therefore calculating 
the Fourier components reduces to evaluating 


aes Qn nt 
fF) = 0, sin 


=i 


9) 18/2) 
i —— f(t) sin 
© Sap 


27 nt 


dt for positive integers n 


= 


Since T = 271, we find 


1 0 aE 
db. = -({ =Isinnedr +f sinnt dt) 
7™ —1 0 
2(| =] =“) ) 
=— ar | = 
™ n = n 0 


1 
= —/(cos 0 — cos7tn — cos 7 + cos 0) 
nm 


= 


1 
= —(2 —2cos7n) 
nt 


2 
= —(1 —cos7tn) 
nt 


os 


(a) 


Figure 23.29 
(a) Input to low-pass filter; (b) output from low-pass filter. 


(b) 
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The values of the first few coefficients are 


& 
II 


2 4 
—(1 —cos7) = — 
7 1 


oo 
II 


2 
—(1 —cos27) =0 
Die 


2 4 
dy = pa — cos 37) = a 


The next stage is to evaluate the gain and phase changes of the Fourier components. 
Using Equations (23.13) and (23.14): 


n=1 
@, = Il 
(409) = — tie 
J1+0.09 x 1 
/G(jo,) = — tan! 0.3 = —16.7° 
p=3 
a, =3 
| —— te 
JT +0.09 x 9 
/G(jo@;) = — tan 0.9 = —42.0° 
R= s 
@; =5 
ce 
y IEs009ba25 


LG(jo;) = — tan! 1.5 = —56.3° 


It is clear that high-frequency Fourier components are attenuated and phase shifted 
more than low-frequency Fourier components. The effect is to produce a rounding 
of the rising and falling edges of the square wave input signal. This is illustrated 
in Figure 23.29(b). The output signal has been obtained by adding together the 
attenuated and phase-shifted output Fourier components. This is possible because 
the system is linear. 
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1 


Find the half-range Fourier sine series representation = 0<t< i 

of f(t) =tsint,0<t< 7. rw) T 4 
a — 

Find the half-range sine series representation of = fi : a <t<t 

f(t) =cos2r,0<t<n. 3 i 


Find (a) the half-range sine series, and (b) the 4 Find the Fourier series representation of the function 
half-range cosine series representation of the function with period T defined by 


defined in the interval [0, t] by 


ey V(constant) |t| < 7/6 
i= 0 T/6<\t|<7/2 
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5 The output from a half-wave rectifier is given by 


‘(t) Ismot O0<t<T/2 
ut) = 
0 T/2<t<T 


27 
and is periodic with period T = —. 
a) 


Find its Fourier series representation. 


6 Find the complex Fourier series representation of the 


function with period T = 0.02 defined by 


w) V(constant) O0<t < 0.01 
va) = 
0 0.01 <t < 0.02 


7 Find the Fourier series representation of the function 


with period 8 given by 


w 2-t O<t<4 
hh= 
f t-—6 4<t<8 


8 Therm-s. voltage, v,.,. 
u(t), with period T, is given by 


ie ia 4 
Ur m.s. = zi (v(t)) dt 


Solutions 


sin(nt) 


" a An(1 + cos nz) 
2 ; m(n + 1)2(n — 1)2 


pe n(l—cosnm) , 
2 =o ae sin(nt) 
32 SS sin(nm/4) sin(n7t/T) 
3 
(a) 372 X n2 
1 8 
(b) > + =) 
oe) 
4cos(n7t/4) — cosnmt — 3 nit 
x X 3n2 cs( _ 
me 2a, vainbvne eos (enti) 
3 Tt n 


1 


of a periodic waveform, 


10 


10 


If v(t) has Fourier coefficients a, and b,, show, using 
Parseval’s theorem, that 


1 i 
2 2 2 
Ur ms. = 4% + 2 Via + b;,) 


n=1 


If f(t) has Fourier series 


a as 2nitt 2nrt 
{®HO= $4 So(oqor r + b, sin 7 


n=1 


prove Parseval’s theorem. 
[Hint: multiply both sides by f(t) to obtain 


(¢f)) = “of “Ylaro cos 


n=1 


2Qnnt 


. 2nttt 
+b, f(t) sin 
T 
and integrate both sides over the interval [0, T] using 
Equations (23.3)—(23.5).] 


Find the half-range cosine series and the half-range 
sine series for the function 


f(t) = sinh att 0<t<l 


I i: I. é I 5 cosnm + 1 ’ 
sin w cos nw 
nm 2 Tt n2—1 


= 1 
v _COS NT — @100nmjt 


am 2 
8 S 1—cosnn nit 
ar aa cy 


; feel 
cosine series: —(cosh7 — 1) 
T 


= 2 
{ > ((—1)" cosh z — 1) cosn7tt 
m1 +72) 


n=1 


sine series: 


2n oni F 
~)*> 1) sinh 1 sinn7t 
m(1 +n?) 


n=1 


t = | 
“7 x PAF 4 

a AY i 

on } 
iO » ay a 
j =. | 
ate: 
“hol eae SPN ss 
: eh a 
; | 
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INTRODUCTION 


We have seen that almost any periodic signal can be represented as a linear combination 
of sine and cosine waves of various frequencies and amplitudes. All frequencies are inte- 
ger multiples of the fundamental. However, many practical waveforms are not periodic. 
Examples are pulse signals and noise signals. The function shown in Figure 24.1 is an 
example of a non-periodic signal. 
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24.2 


Figure 24.1 
A non-periodic 
function. 


We shall now see how Fourier techniques can still be useful by introducing the Fourier 
transform which is used extensively in communications engineering and signal process- 
ing. For example, it can be used to analyse the processes of modulation, which involves 
superimposing an audio signal onto a carrier signal, and demodulation, which involves 
removing the carrier signal to leave the audio signal. 


THE FOURIER TRANSFORM — DEFINITIONS 


Under certain conditions it can be shown that a non-periodic function, f(t), can be 
expressed not as the sum of sine and cosine waves but as an integral. In particular, 


foO= [ AeV00s ot + B(w) sinwt dw (24.1) 
0 
where 
A(@) = =f. f(t) cos wt dt and Bw) = =f. f@)sinatdt (24.2) 


Provided 
(1) f(t) and f’(t) are piecewise continuous in every finite interval, and 
(2) i | f(t)| dt exists 


then the above Fourier integral representation of f(t) holds. At a point of discontinuity 
of f(t) the integral representation converges to the average value of the right- and left- 
hand limits. As with Fourier series, an equivalent complex representation exists which 
is, in fact, more commonly used: 


Fourier integral representation of f(t): 


1 fe 
(C= — Hl F(w) e dw (24.3) 
Die Saws 
where 
F(w) = i f the dt (24.4) 


There is no universal convention concerning the definition of these integrals and a num- 


: . ; . I, 
ber of variants are still correct. For instance, some authors write the factor — in the 


second integral rather than the first, while others place a factor in both, giving 
Tt 


some symmetry to the equations. There is also variation in the location of the factors 
e J" and e'. We shall use definitions (24.3) and (24.4) throughout but it is important to 
be aware of possible differences when consulting other texts. 

Equations (24.3) and (24.4) form what is called a Fourier transform pair. The 
Fourier transform of f(t) is F(@) which is sometimes written F{f(t)}. Similarly 
f(t) in Equation (24.3) is the inverse Fourier transform of F(@), usually denoted 
FMF (o)}. 


Example 24.1 


Solution 


fo 


Figure 24.2 


Graph of u(t) e~’. 


Example 24.2 


Solution 
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The Fourier transform of f(t) is defined to be 


PHO) Sa) = / fie dt 


You will also note the similarity between Equation (24.4) and the definition of the Laplace 
transform of f(t): 


Lif} = i: f(tye™ dt (24.5) 


We see that, apart from the limits of integration, the substitution ja = s in Equa- 
tion (24.4) results in the Laplace transform in Equation (24.5). There is indeed an impor- 
tant relationship between the two transforms which we shall discuss in Section 24.7. We 
note that Equation (24.3) provides a formula for the inverse Fourier transform of F(a), 
although the integral is frequently difficult to evaluate. 


Find the Fourier transform of the function f(t) = u(t)e“ 


function. 


, where u(t) is the unit step 


The function u(t) e~ is shown in Figure 24.2. Using Equation (24.4), its Fourier trans- 
form is given by 


F(@) = / * f thes” dt 


= i ee i” dt since f(t) = 0 fort < 0 
0 


me . 
= / e tient qt 
0 


e Ute 7° 
- Ee = 


1 


= a since e+" _» 0 ast > 00 
that is, 
Fo) =—— 
1+jo 


Use Equation (24.3) to find the Fourier integral representation of the function defined 
by 


i eel 
r= | It] >1 


Using Equation (24.3) we find 


Lf? ; 
fOH= =f F(a)e™ dw 
27 af 6s 
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where 


F(o@) 


/ _ f(t)e'™ dt 
ai. 

i, 
ever! 
Ene 


ele — ele 


lei dt 


jo 


since f(t) is zero outside [—1, 1] 


Using Euler’s relation (Section 9.6) 


sind = 
we find 
F(@) = 


so that 


f= =f. 2sinw 
mT JE o 


is the required integral representation. Note that F(@) = 


of f(t). The function 


ec’ —ed 


0 jo 
2j 


2sin@w 


CO 


sin w 


(see Section 3.5). 


el da 


2sinw , . 
is the Fourier transform 


occurs frequently and is often referred to as the sinc function 


As with Laplace transforms, tables have been compiled for reference. Such a table of 
common transforms appears in Table 24.1. 


EXERCISES 24.2 


1 


Find the Fourier transforms of 


_fi/4 itl<3 
@F0=| It] > 3 
ia. G2729 
CP ad) Ce ee re a 
0 otherwise 
e@ +30 a>0 
FO =| oa t<0 
_fetcost t20 
@sro={5 rb 


(e) f@) =u(tye/™ 


where T is a constant 


2 Find 


(a) the Fourier transform, and 


(b) the Laplace transform of 
a>0O 


ft) =une™ 


Show that making the substitution s = jw in the 
Laplace transform of f results in the Fourier 


transform. 
_ fl Ith<2 
m ffe)= | 0 otherwise 


and g(t) = el’, find F{ f (t)g(t)}. 


Solutions 


(a) 


(d) 


sin 3w 
20 
1l+jo 
(1+jo)?+1 


(b) 


24.3 


24.3.1 


24.3 Some properties of the Fourier transform 


Table 24.1 
Common Fourier transforms. 
f(t) F(a) 
f(t) =Au(the“,a>0 a 
, , atjo 

1 -a<t<a 2 sin wa 
I= ie otherwise wo 
fO=HA constant 271A6 (a) 
f(t) =u(tyA a(nsi - 1) 

oO 
f(t) = 50) 1 
f(t) = S(t —a) e jwa 
f(t) = cos at T1(d(w +a) + 46(w —a)) 
f(t) = sinat 7 (8(w — a) —5(@ +a) 
J 


ft) =e a>0 


f(t) = sen(t) = | Pee 
1 
f(t) = : 
fnaee 
1 —cos2w (c) 2a 
2 ee SI 
T 
© Ty jor 


2a 
Fae 
2 
jo 


—jmsgn(@) 


us ee /4a 
a 


1 
at+jo 


a @) Sta 


2 sin2(1 — @) 


l-@ 


SOME PROPERTIES OF THE FOURIER TRANSFORM 


A number of the properties of Laplace transforms that we have already discussed hold 
for Fourier transforms. We consider linearity and two shift theorems. 


Linearity 


If f and g are functions of ¢ and k is a constant, then 


Shiela aey 


Fikf} = kF{f} 
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Both of these properties follow directly from the definition and linearity properties of 
integrals, and mean that F is a linear operator. 


Example 24.3 Find F{u(t)e + u(t) e"~}. 


Solution We saw in Example 24.1 that 


Flue} = itis 


Furthermore, 


Flu(the“} = / ude di 


—cO 


ee . 
= / e CHey dy 
0 


e- 2+ie)t i 
~ L-2+jo) Jo 


1 
~ 24+jo 
Therefore, 
Flu(t)et +u(the"} = : + : by linearity 
l+jo 2+ jo 
_ 2+jo+1+jo 
(1 + jo)(2 + jo) 
_— 3 4+2jo 
~ 2-0? + 3jo 


24.3.2 First shift theorem 


If F (@) is the Fourier transform of f(t), then 
Fie!” f (t)} = F(w — a) where a is a constant 


Example 24.4 (a) Show that the Fourier transform of 


fo= 3 —2<t<2 
~ | 0. otherwise 
at 6 sin 2H 
is given by F(w) = : 
0) 


(b) Use the first shift theorem to find the Fourier transform of e~" f(t). 
(c) Verify the first shift theorem by obtaining the Fourier transform of e~ f (t) directly. 
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20 —jot 72 
Solution (a) Fo)=3 [ e !” dt =3[° 


=) —J® j-2 


—2jo _ a2djw 
= a(S e ) 
2jo _ a—2jo 
=6( e ) 
2jw 


= — sin2@ 
o 
6 sin 2@ ; ; : 
(b) We have F{f(t)} = F(w) = . Using the first shift theorem with a = —1 
we have 
—jt 6 ‘ 
Fle*fO}=F@+l)= sin 2(w + 1) 
ot+l1 


Be 2<t<2 
0 otherwise 


c) e*foO= | 


So to evaluate its Fourier transform directly we must find 


9 


Fe tyoy=3 / i aa 


—2 


2 
-2 
eo (+a) jt 2 
-|5a50)] 
—ji+o) 2 
6 e2(l+e)j = e 21 +@)j 
~ l+@ ( 2j ) 


sin2(1 +) 


6 
l+o 


as required. 


1 
Example 24.5 Use the first shift theorem to find the function whose Fourier transform is 34 ](@ 2)” 
JLo — 
1 
given that F{u(t)e7""} = —,m> 0. 
m+ jo 


Solution From the given result we have 
F{u(t)e"*} = eee F(a) 
34 jo 
Now 


1 


ag 
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Therefore, from the first shift theorem with a = 2 we have 


; 1 
Fle" u(t) e*} = ——_—_ 
3+ j(@ — 2) 
1 : 
Consequently the function whose Fourier transform is ——————— is u(t) ee BA, 
3+ j(@ — 2) 
Example 24.6 Find the Fourier transform of 
e* t>0 
f@) = {fs <0 
: ; , 6 
Deduce the function whose Fourier transform is G(@) = ———_—___.. 
10+ 2@ + w? 
Solution F(w)= / f(t)e dt 
0 . ~ . 
= : e eet dt zi / et e Jt dt 
—0o 0 
0 co 
=f eGov dt a / e Gtie)t dt 
—0o 0 
eG-joyt 0 e Grier 7% 
SST Lsas) 
3-jol_. L-G+jo) Jo 
ol 2 1 
~~ 3-jo 3+jo 
_ 6 
~ 9+ a? 
Now 
6 6 
G(@) = = =F(@+1) 


10+20+a2 (w9+1)?4+9 


Then, using the first shift theorem F(w + 1) will be F{e~ f(t)}; that is, the required 
function is 


a it 450 
BY) =) Git tt <0 


24.3.3 Second shift theorem 


If F (@) is the Fourier transform of f(t) then 
Fif(tt —a)} =e °F) 
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1 |t)}<1 
0 |tl>1 
to find the Fourier transform of 


si 
Example 24.7 Given that when f(t) = | ,F(@)= clad apply the second shift theorem 
o 


@= lL. iet<3 
= 10. otherwise 
Verify your result directly. 
g(t) 
Figure 24.3 
The function 
1 l<t<3 
f= 
at) 0 otherwise. 


Solution The function g(t) is depicted in Figure 24.3. Clearly g(t) is the function f(t) translated 


2 
2 units to the right, that is g(t) = f(t—2). Now F(a) = “eo is the Fourier transform 


of f(t). Therefore, by the second shift theorem 


2e~7/° sin w 


Fig@t)} = Ff — 2} =e PF) = 


To verify this result directly we must evaluate 
3 


—jot ; —jot oo 
g(the 1" dt = el dt = - 
—oo 1 JO 41 


Fi{gt)} 


_ 2e7 sinw 
~ wo 
as required. 
EXERCISES 24.3 
A —te1 _ 72 < 
1. Prove the first shift theorem. (by h(t) = 4 © d-f) |th<l 
0 \t}>1 
2 Find the Fourier transform of 
3 Find the inverse Fourier transforms of 
fa) = 1-7? |th<1 
~ 10 \t} > 1 tas 1 
a) ——____ 
Use the first shift theorem to deduce the Fourier (o+7)j+1 
transforms of (b) 2 
wera eit(1—12) |t}<1 14+2(@— 1)j 
eG It} >1 


4 Prove the second shift theorem. 
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1 : . ' 
5 Given F{u(t)e~} = ert use the second shift 6 Find, using the second shift theorem, 
jo 
theorem to find SaBiy 
1) ¢.—4jo 
Flu(t+4)e"9} a {s o 
Verify your result by direct integration. 
Solutions 
my 4cosm | asin 3 (a) wt)ete7# =~ (b) u(t) et? 
=e oe 
F etio 
(a) —4cos(w—3) 4sin(w—3) 5 
a : 
(w — 3)2 (w — 3)3 1+jo 
) —4cos(w—j) 4sin(w — j) 6 3for2 <t < 6,0 otherwise 
i 
(@ —j)? (@—j) 


In the Fourier analysis of periodic waveforms we stated that although a waveform physi- 
cally exists in the time (or spatial) domain it can be regarded as comprising components 
with a variety of temporal (or spatial) frequencies. The amplitude and phase of these 
components are obtained from the Fourier coefficients a, and b,. This is known as a 
frequency domain description. Plots of amplitude against frequency and phase against 
frequency are together known as the spectrum of a waveform. Periodic functions have 
discrete or line spectra; that is, the spectra assume non-zero values only at certain fre- 
quencies. Only a discrete set of frequencies is required to synthesize a periodic wave- 
form. On the other hand when analysing non-periodic phenomena via Fourier transform 
techniques we find that, in general, a continuous range of frequencies is required. Instead 
of discrete spectra we have continuous spectra. The modulus of the Fourier transform, 
|F (@)|, gives the spectrum amplitude while its argument arg(F (w)) describes the spec- 
trum phase. 


Example 24.8 In Example 24.2 the Fourier transform of 


fil <i 
f= {4 It] >1 


2 sin w 
was found to be F(@) = 


. Sketch the spectrum of f(r). 


Solution F() is purely real. The spectrum of f(t) is depicted by plotting |F(w)| against w as 


illustrated in Figure 24.4. Note that lim aaa 1. (See Review exercises in 
o- @M 
Chapter 18.) 


24.4 Spectra 767 


|F()| 


= ca Figure 24.4 


An -3 -20 aw 2m 30 44 w Spectrum of f(t) -| : Hels 


0 |t)>1. 


Engineering application 24.1 


Amplitude modulation 


Amplitude modulation is a technique that allows audio signals to be transmitted as 
electromagnetic radio waves. The maximum frequency of audio signals is typically 
10 kHz. If these signals were to be transmitted directly then it would be necessary 
to use a very large antenna. This can be seen by calculating the wavelength of an 
electromagnetic wave of frequency 10 kHz using the formula 


C= iA 
Here c is the velocity of an electromagnetic wave in a vacuum (3 x 10° ms7'), f is 


the frequency of the wave and A is its wavelength, and hence A = —_ 30000 m. 


It can be shown that an antenna must have dimensions of at least one-quarter of the 
wavelength of the signal being transmitted if it is to be reasonably efficient. Clearly a 
very large antenna would be needed to transmit a 10 kHz signal directly. The solution 
is to have a carrier signal of a much higher frequency than the audio signal which is 
usually termed the modulation signal. This allows the antenna to be a reasonable size 
as a higher frequency signal has a smaller wavelength. The arrangement for mixing 
the two signals is shown in Figure 24.5. 


Antenna 


x(t) 


X(t)COsw,t 


Figure 24.5 


le Amplitude modulation. 


Let us now derive an expression for the frequency spectrum of an amplitude- 
modulated signal given by 


b(t) = x(t) cos w,t 


where w, is the angular frequency of the carrier signal and x(t) is the modulation 
signal. Now $(t) = x(t) cos w,t can be written as 


el2.t —jo,t 
o() =x) 
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X(w)| 
|P(o)| 


(a) (b) . 


Figure 24.6 
Amplitude modulation. (a) Spectrum of the modulation signal; (b) spectrum of the 
amplitude-modulated signal. 


Taking the Fourier transform and using the first shift theorem yields 


Fi{o(t)} = 0(w) = F| x(t) (ee! 2} 


2) 
a el@'x(t) e I'x(t) 
ley il 
= 5X —o,)+X(w@+@,)) 


where X(@) = F{x(t)}, the frequency spectrum of the modulation signal. 

Let us consider the case where the frequency spectrum, |X (w)|, has the profile 
shown in Figure 24.6(a). The frequency spectrum of the amplitude-modulated sig- 
nal, |®(@)|, is shown in Figure 24.6(b). All of the frequencies of the amplitude- 
modulated signal are much higher than the frequencies of the modulation signal thus 
allowing a much smaller antenna to be used to transmit the signal. This method of 
amplitude modulation is known as suppressed carrier amplitude modulation be- 
cause the carrier signal is modulated to its full depth and so the spectrum of the 
amplitude-modulated signal has no identifiable carrier component. 


EXERCISES 24.4 


Plot a graph of the spectrum of f(r) for 

—27m < w < 2z. Write down an integral expression 
f= t+1l -1<r<0O for the pulse which would result if the signal f(t) 

: 1-t O<t<l were passed through a filter which eliminates all 


1. Show that the Fourier transform of the pulse 


Sanibe waittencak angular frequencies greater than 271. 


F(@)= 4 — cosw) 
o 


Solutions 


1 2m alot 
1 mall 2(1 — cos w)—~ dw 
2n w 


27 


THE t—w DUALITY PRINCIPLE 


We have, from the definition of the Fourier integral, 


f®H= al F(w) da (24.6) 
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FO F(w) 2 sint 2uf(@) 
F t F 
1 — — 
-l i: 2 0) t 
Figure 24.7 Figure 24.8 
Illustrating the t—-w duality principle. Illustrating the t—@ duality principle. 
where 
F(@) = / f(t)es™ dt (24.7) 
is the Fourier transform of f(t). In Equation (24.6), w is a dummy variable so, for ex- 
ample, Equation (24.6) could be equivalently written as 
1” ss 
fO == F(z) e* dz (24.8) 
2Tt Joo 
Then, from Equation (24.8), replacing t by —@ we find 
f(—@) = = i” F(zje dz = Es 2 F(t)e” dt 
27 Joo “Ort Jog 
l 
which we recognize as oe times the Fourier transform of F(t). 
Tt 
We have the following result: 
If F (@) is the Fourier transform of f(t) then 
1 
f(—@) is a x (the Fourier transform of F (t)) 
which is known as the t—q duality principle. 
We have seen in Example 24.2 that if 
_fi isi 
f= It} >] 
2sinw a : ee : a, 3 
then F(@) = . This is depicted in Figure 24.7. From the duality principle we can 
immediately deduce that 
2 sint 
PG ee) =a 
since f is an even function (Figure 24.8). Unfortunately it is very difficult to verify this 
result in most cases because while one of the integrals is relatively straightforward to 
evaluate, the other is usually very difficult. However, we can use the result to derive a 
number of new Fourier transforms. 
Example 24.9 Given that the Fourier transform of u(t) e~ is ini Us the duality principle to deduce 


1 
the transform of 
1+ 
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Solution 


24.6 


24.6.1 


Example 24.10 


Solution 


We know F(w) = 


is the Fourier transform of f(t) = u(t)e™‘. Therefore 


27t(u(—a@) e®) is the Fourier transform of 


1+jt 


FOURIER TRANSFORMS OF SOME SPECIAL FUNCTIONS 


We saw in Section 24.4 that the Fourier transform tells us the frequency content of a sig- 
nal. If we were to find the Fourier transform of a signal composed of only one frequency 
component, for example f(t) = sint, we would hope that the exercise of finding the 
Fourier transform would result in a spectrum containing that single frequency. 

Unfortunately if we try to find the Fourier transform of, say, f(t) = sint problems 
arise since the integral 


as « 
/ sinte!” dt 
—0o 


cannot be evaluated in the usual sense because sint oscillates indefinitely as |t| — oo. 
In particular, Condition (2) of Section 24.2 fails since Fe | sin t| dt diverges. There are 
many other functions which give rise to similar difficulties, for instance the unit step 
function, polynomials and so on. All these functions fail to have a Fourier transform 
in its usual sense. However, by making use of the delta function it is possible to make 
progress even with functions like these. 


The Fourier transform of 5(¢ — a) 


Use the properties of the delta function to deduce its Fourier transform. 


By definition 
F{s(t —a)} = / 5(t —a)e'™ dt 


—oo 


Next, recall the following property of the delta function 
i f(t)d(t — a) dt = f(a) (24.9) 


for any reasonably well-behaved function f(t). Using Equation (24.9) with f(t) = ee 
we have 


F{6(t —a)} = i: e!'5(t — a) dt = eI" 


—oO 


In particular, if a = 0 we have F{d(t)} = 1. This result is depicted in Figure 24.9. 
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f(t) F() F(t) 2775 (w) 
1 
o t o 


Figure 24.9 Figure 24.10 
F{8(t)} = 1. F{1} = 275(). 


Example 24.11 Apply the t—w duality principle to the previous result. Interpret the result physically. 


Solution We have f(t) = 5(t) and F(w) = 1. The duality principle tells us that 
f(-—a@) = 6(-a) which equals Fill 
that is, 
F(l} = 2n85(o) 


(since 6(—w) = 6(@)). This is illustrated in Figure 24.10. Physically F(t) = | can be 
regarded as a d.c. waveform. This result confirms that a d.c. signal has only one frequency 
component, namely zero. 


Example 24.12 Given that F{5(t — a)} = e J find F{e“"4}. 
Solution We have f(t) = 5(t — a), F(w) = e- 1. Applying the t—w duality principle we find 
1 ; 
—wek—bu0=— Fic" 
f(-@) = 8(—w — a) a eee 


Therefore 


Fle} = 275(— — a) 
= 2nd(—(@ + a)) 
= 2715(w + a) 


since 5(@) is an even function. 


24.6.2 Fourier transforms of some periodic functions 


From Example 24.12 we have F{e—"“} = 276(w + a) and also, replacing a by —a, 
F {ei} = 275 (w — a). Adding these two expressions we find 


Fle} + Fie} = 2n(5(w +a) + 5(@ — a)) 
Recalling the linearity properties of F we can write 

Fle" + el} = 2n(5(w +. a) + 6(w — a)) 
and using Euler’s relations we find 


F {cos at} = 71(6(w + a) + 6(@ — a)) 
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Example 24.13 


Solution 


24.7 


Example 24.14 


F(t) = cos at 


Figure 24.11 
The spectrum of cos at. 


We see that the spectrum of cos at consists of single lines at @ = ta corresponding to 
a single frequency component (Figure 24.11). 


Find F {sin at}. 
Subtracting the previous expressions for F {el} and F{e—"“} and using Euler’s relations 
we find 
Fie} — Fle} = 2n(5(@ — a) — 5(@ +.4)) 
that is, 


ela _ —jta 

{Ss} 2 Shaw) = 
2j j 

so that 


F {sin at} = Flo ~a)—8(@ +2)) 


THE RELATIONSHIP BETWEEN THE FOURIER 
TRANSFORM AND THE LAPLACE TRANSFORM 


We have already noted (Section 24.2) the similarity between the Laplace transform and 
the Fourier transform. Let us now look at this a little more closely. We have 


FAFO} = fjemde and Lf} =) fae dr 
—00 0 
In the definition of the Laplace transform, the parameter s is complex and we may write 
5 =o +ja, so that 
L{f)} = i fOe"e dt 
0 


Thus an additional factor, e~°’, appears in the integrand of the Laplace transform. For 
o > Othis represents an exponentially decaying factor, the presence of which means that 


the integral exists for a wider variety of functions than the corresponding Fourier integral. 


Find, if possible, 


(a) the Laplace transform 


(b) the Fourier transform 


of f(t) = u(t) e*. Comment upon the result. 


Solution 


Example 24.15 


Solution 
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(a) Either by integration, or from Table 21.1, we find 


1 
L{u(t)e*} = a provided s > 3 


s-— 


~ ne (3-jw)t 7° 
(b) F{u(t)e*} =} e* Tier a= eG-ioyt dy — E 
7 0 3 —jo Jo 


Now, ast > oo, e — ov, so that the integral fails to exist. Clearly, u(t) e* has a 


Laplace transform but no Fourier transform. 


Suppose f(t) is defined to be 0 fort < 0. Then its Fourier transform becomes 


FifO}= [ fae dt 


and its Laplace transform is 


Lf} = i Fie di 


By replacing s by jw in the Laplace transform we obtain the Fourier transform of f(t) if 
it exists. Care must be taken here since we have seen that the Fourier transform may not 
exist for a function that nevertheless has a Laplace transform. 


Find the Laplace transforms of 
(a) u(t)e-7 
(b) u(t) e* 
Let s = jw and comment upon the result. 
(a) L{u(t)e-7} = 7 
s+2 
(b) L{u(t)e"} = > 


5° 
Replacing s by jw in (a) gives - . Similarly, replacing s by jw in (b) gives - : 
Now 
F{u(t)e*} = = 
jo+2 


so that replacing s by jw in the Laplace transform results in the Fourier transform. How- 
ever, F{u(t) e7} does not exist and even though we can let s = jw in the Laplace trans- 


form and obtain 


5 we cannot interpret this as a Fourier transform. 


The Fourier transform does possess certain advantages over the Laplace transform. While 
the Laplace transform can only be applied to functions which are zero for t < 0, the 
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24.8 


24.8.1 


Fourier transform is applicable to functions with domain —oo < t < oo. In some ap- 
plications where, for example, f represents not time but a spatial variable, it is often 
necessary to work with negative values. 

The inverse Fourier transform is given by 


lo, e) 


FMF (o)} = = / F(w) & da (24.10) 
~~ Ont . 


—o 


The corresponding inverse Laplace transform requires advanced techniques in the theory 
of complex variables which are beyond the scope of this book. The existence of Equa- 
tion (24.10) is not quite as advantageous as it may seem because it is often difficult to 
perform the required integration analytically. 


CONVOLUTION AND CORRELATION 


Convolution is an important technique in signal and image processing. It provides a 
means of calculating the response or output of a system to an arbitrary input signal if 
the impulse response is known. The impulse response is the response of the system to 
an impulse function. Convolving the input signal and the impulse response results in the 
response to the arbitrary input. Correlation is a second important technique. It can be 
used to determine the time delay between a transmitted signal and a received signal as 
might occur in radar or sonar detection equipment. 


Convolution and the convolution theorem 


If f(t) and g(t) are two real piecewise continuous functions, their convolution, which 
we denote by f * g, is defined as follows: 


The convolution of f(t) and g(t): 


ue |) fA)get —A)da 


J * gis itself a function of f, and to show this explicitly we sometimes write (f * g)(t). 
Note that convolution is an integral with respect to the dummy variable A. In general, 
as can be seen from the limits of integration, A varies from —oo to ov, but in particular 
cases we will see that this interval of integration can be reduced. In the examples which 
follow the precise meaning of the terms f(A) and g(t — 2) will become apparent. 

Because convolution is commutative, that is f * g = g « f, the convolution can be 
defined equivalently as 


/ F(t — A)g(a) da 


In cases when f(t) and g(t) are zero for t < 0 this expression reduces to that defined for 
the Laplace transform in Section 21.9. 


Example 24.16 


Solution 


f(t) =u(te+ 
g(t) = u(t)e- 


Figure 24.12 


Graphs of 
f(t) = u(t) e~ and 
g(t) =u(the ~". 
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The convolution theorem states that convolution in the time domain corresponds to 
multiplication in the frequency domain: 


The convolution theorem: 
If F{f(t)} = F(@) and F{g(t)} = G(@) then 


Ff * g} = F(@)G(@) 


This theorem gives us a technique for calculating the convolution of two functions using 
the Fourier transform, since 


f*xg=F '{F(@)G(o)} 


So, it is possible to find the convolution of f(t) and g(t), by 


(1) finding the corresponding Fourier transforms, F(w) and G(w), 
(2) multiplying these together to form F(w)G(a), 


(3) finding the inverse Fourier transform which then yields f x g. 


This is a process often used for finding convolutions using a computer as will be ex- 
plained in Section 24.15. 

A graphical representation of convolution is useful as it can throw light on the under- 
lying process and help us to determine appropriate limits of integration. We will illustrate 
this in the following example. 


(a) Using the definition of convolution, calculate the convolution f « g when 
f(t) =u(t)e™ and g(t) = u(t) e*', where u(t) is the unit step function. 


(b) Verify the convolution theorem for these functions. 


(a) The convolution of f and gis given by 


fee= | SA)gt — A) da 


Graphs of f(t) and g(t) are shown in Figure 24.12. 

Evaluating a convolution integral can be difficult, so we will develop the solu- 
tion in stages. First of all it is necessary to be clear about the meaning of the different 
terms in the integrand. Note that if f(t) = u(t)e~ then f(A) = u(A) e*. Similarly 
g(A) = u(A) e~*. The function g(—A) is found by reflecting g() in the vertical axis 
as shown in Figure 24.13(a). In signal processing terminology this is also known as 
folding. The folded graph can be translated a distance f to the left or to the right by 
changing the argument of g to g(t — A). If t is negative the graph moves to the left as 
shown in Figure 24.13(b) whereas if ¢ is positive it moves to the right as shown in 
Figure 24.13(c). In Figure 24.13 we have superimposed the graphs of f(A), shown 
dashed, and g(t — A), for t being negative, zero and positive. Where the graphs do 
not overlap, the product f(A)g(t — 4), and hence the convolution, must be zero. We 
examine separately the domains ¢ < 0 andt > 0 corresponding to where the graphs 
do not overlap and where they do overlap respectively. 
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fA) = uwaje* 


. 
- 


g(-A) 


Sine 


(a) 


g(t—A) when t<0 


(b) t r 


g(t—A) when t>0 Figure 24.13 
The function g(t — 4) for various values 
(c) ‘ A pte 


Whent <0 
When t < 0 there is clearly no overlap and it follows that f * g = 0. 
Whent > 0 


When t > 0 there is overlap for values of A between 0 and tf, that is when0 < 1 <t, 
and hence 


t 
f*xg= / ere 20-4 gy 
0 


t 
= / e e 2 e2* di 
0 


t 
= on) eda 
0 


= eo [e* i 


=e “(e'-1) 


Finally the complete expression for the convolution is 


—e when t > 0 


when t < 0 


et 
(f #g)(t) = | 
This may also be written as 


(f *g)(t) =u(t)(e! —e”) 


(b) Using Table 24.1 the Fourier transforms of f and g are 


1 
F(@)= T+jo G(@) = 2+ jo 
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and their product is 
1 
F(@)G(@) = @+ia@ +0) (24.11) 
The Fourier transform of the convolution is, using linearity and Table 24.1, 
1 1 
1+jo ~ 24 jo 
(2+ jo) — (1+jo) 
~ (1+ jo) 2+ jo) 
_ 1 
~ (1+ jo)2 + jo) 
which is the same as Equation (24.11). We have shown that F{f *« g} = F(w)G(@) 
and so the convolution theorem has been verified. 


Flu(ty(et-—e*)} = 


Example 24.17 (a) Using the definition of convolution find the convolution, f * g, of the ‘top-hat’ 
function 

1 -l<t<l 

0 otherwise 


ro={ 


and the function g(t) = u(t)e 


~, where u(t) is the unit step function. 


(b 


wm 


Verify the convolution theorem for these functions. 


Solution (a 


ae 


The functions f(t) and g(t) are shown in Figure 24.14. 
The convolution of f(t) and g(t) is given by 


fess | FA)g(t — A) da 


Note that since g(t) = u(t)e~, it follows that g(A) = u(A)e™* as shown in Fig- 
ure 24.15(a). The function g(—A) is found by reflecting, or folding, g(A) in the 
vertical axis. This folding is shown in Figure 24.15(b). The folded graph can be 
translated a distance ¢ to the left or to the right by changing the argument of g to 
g(t — A). Ift is negative the graph in Figure 24.15(b) moves to the left, whereas if t 
is positive it moves to the right. Study Figures 24.15(c—g) to observe this. 

Convolution is the integral of the product of f(A) and g(t — 2). We have super- 
imposed f(A) on the graphs in Figure 24.15. For values of 2 where the graphs do 
not overlap, this product must be zero. 

Inspection of the graphs shows that when f is less than —1 (Figure 24.15(c)) there 
is no overlap and hence f(A)g(t — 4) = 0. So: 


fra! 


fxg=0 


1 g(t) = u(tye ' 


Figure 24.14 
The ‘top-hat’ function f(r), 
=1 1 t t and g(t) = u(t)e. 
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g(a) 


g(-A) 


(b) A 


g(t—A) when t< —-1 1 


ae ee 


g(t—A) when t= -1 


Se dloc 5 


mj----- 
> 


(d) =f 


i) 


g(t—A) when t = 1 


(f) -1 1 Xr 


g(t-A) when t>1 = i------> 


(g) -1 1 ¢ A 


Figure 24.15 
The function g(t — 2) for various 
values of f. 
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When f is greater than —1 but less than 1, as in Figure 24.15(e), there is an overlap, 
and hence a non-zero product. This occurs for values of A between —1 and f, that is 
in the interval —1 < A < t. Within this interval f(A) = 1 and g(t — A) =e *”. 
So: 


if -l<t<1 
t t 
f[*g= i eV) d= al eda =e“[e*}._, =e“[e’' —e ']=1-e'* 
-1 -1 
When ¢ is greater than | the graphs overlap, but only for values of A between —1 


and 1, that is for —1 < A < 1. So: 


fre 
1 
f*g= / ed, =e“fe*]!, =e“(e' —e7!) 
-1 
Putting all these results together 
0 t<-l 


(fegQ=,1-e -l1<t<1 
e‘(e!—e!) t3>1 


(b) Using Table 24.1 the Fourier transforms of f and g are 
F(w) = 2sin@ Glo) = 
1+jo 
and their product is 
Pei (24.12) 
ow +jo) 


Since the convolution is defined differently on each part of the domain, then the 
Fourier transform of the convolution must be found by integrating over each part of 
the domain separately. You will also need to recall that 


ele _ ee 
2j 


sinw = 


So, 
FIf+oO) = / (f * g)(t) eat 


-1 a 
=) O-etart fl=e "jes" a 
—oo -1 
+ i e7(e! —e!)e dt 
dl 


1 
=|, eit — elie q¢ 4+ (el — 2M eet dt 


1 


[ elt _e -1 e (tie) dt + (e! -e fo e tHe) g 
1 


e-ier 7! ent (Hie) 1 e(Hie) 7° 
*[E-° asal_te- a 
—jo jy —CU+jo) J, —CU+jo) J; 
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ee ee '( e7 (tie) ) 
=—tT+3 e- : 
—jo jo =(14- Ja) 


4 etic) : es e7 Ute) 
o (<i) + © -< (Te) 


dsj 
The first two terms simplify to “ = By simplifying and rearranging the remain- 
jo _ pa-jw 
der becomes SS Putting all this together we find 
1+jo 


2sinw ed? —e i 
o 1+jo 

2sinw 2jsinw 
o 1+jo 
2 sinw 

o(1+jo) 


FAS * g)(t)} 


which is the same as Equation (24.12). We have shown that F{f « g} = F(w)G(@) 
and so the convolution theorem has been verified. 


24.8.2 Correlation and the correlation theorem 


If f(t) and g(t) are two piecewise continuous functions their correlation (also called 
their cross-correlation), which we denote by f x g, is defined as follows: 


The correlation of f(t) and g(t): 


fes= | fA) g(A — t) da 


This formula is the same as that for convolution except for the important difference that 
the function g is not folded; that is, we have g(A — f) here rather than g(t — A). 

It is possible to show that the correlation of f(t) and g(t) can be written in the alter- 
native form 


fre | S(t + A)g(A) da 


Some texts use this form, but note that it is the argument of f which is t + A (see Ques- 
tion 8 in Exercises 24.8). 
We can now state the correlation theorem. 


The correlation theorem: 
If F{f(@)} = F(@) and F{g(t)} = G(w) then 


FA f x 3} = F(w)G(-a) 


Example 24.18 


Solution 


f(t) = ule" 


a(t) = u(t)e 7" 


Figure 24.16 
Graphs of f(t) = u(t)e~“ 
and g(t) = u(t)e~!. 
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(a) Using the definition of correlation, calculate the correlation of f(t) = u(t) e~ and 
g(t) = u(t) e~~, where u(t) is the unit step function. 


(b) Verify the correlation theorem for these functions. 


(a) The correlation of f and g is given by 


fem | S(A)g(A — t) da 


Graphs of f(t) and g(t) are shown in Figure 24.16. 

In Figure 24.17 we have superimposed the graphs of f(A), shown dashed, and 
g(A — t), for t being negative, and then positive. Where the graphs do not overlap, 
the product f(A)g(A — t), and hence the correlation, is zero. 


When t < 0 


When t < 0 the graphs overlap for 0 < 4 < oo. Hence 


fre= f f(A) g(A — t) da 


CO 
/ er e724) ay 

0 

CO 
of e 4 dx 
0 

CO 

|, 


—3A 


(a) 


g(A— t) when t < 0 


© Siinw 


(b) ; 


g(A— t) when t > 0 


Figure 24.17 
Graphs of f(A), and g(A — ft) for (b) t < 0, 


(c) (c)t > 0. 


A 
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Whent > 0 
When t > 0 the graphs overlap for tf < 1 < oo. Hence 


fee= | fA)g(A — t) da 


= Le 


3 


Finally the complete expression for the correlation is 


e* whent <0 


1 
(feoO=) 4 


5 e’ whent >0 


(b) Using Table 24.1 the Fourier transforms of f and g are 


1 
F — iar G — 
Ne eae = Fee 
and hence 
1 
F(w)G(—q@) = 


(1 + jw) — jo) 


Furthermore, taking the Fourier transform of f * g, obtained in part (a), we have 


0 il ; oO] : 
Fifxgh= / —e% eI” dt +f ~e%e™ dt 
66 3 0 3 
ef2-io) 7° ef(-l-ja) 7% 
- [se-0|_.* Les] 
S210) | ng, LO(e 1 18) to 
1 1 


~ 3@—ja) * 30 +40) 

_ 3 +43j0 +6 = Jo 

~ 9(2 — jo) (1 + jo) 

_ 1 

~ (2 —jo)(1 + jo) 
We have shown that F{f * g} = F(w)G(—q@) and so the correlation theorem has 
been verified. 


EXERCISES 24.8 


1. Find the convolution of 


ron {s — 
0 otherwise 
and 
4 -1<t<3 
8) = { 0 otherwise 


2 The convolution of a function with itself is known as 
autoconvolution. Find the autoconvolution f « f 
when 

1 -l<r<l 
iQ) = to otherwise 


3. Find the correlation of f(t) = 1 for —1 <t < 1 and 
zero otherwise, and g(t) = u(t) e~’. Verify the 
correlation theorem for these functions. 
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Prove that convolution is commutative, that is 
f *g = 9% f. Note that correlation is not. 


Show that if f(t) and g(t) are both zero, fort < 0, 
then (f * g)(t) = fo fA)g(t — A) da. 


Prove the convolution theorem. 


Show that f(t) * g(t) = f(t) * g(—t). Deduce that a 
correlation can be expressed in terms of a convolution. 
Show also that f(t) * g(t) = f(t) * g(—t). 


Prove that the correlation integral 
fxeg= hie f)g(A — t) da can also be written in 


the form [°° g(a) f(t + A) da. 


Prove the correlation theorem. 


Solutions 
0 t<-l t+2 —-2<t<0 
4 at? “p292> 2 fxf= 42-1 O0<1t<2 
3 0 otherwise 

1 f*xg= } 12 2<t<3 

4 e(e!—e!) t<-1 

12- 5-3) 3<t<6 B® ofege die! =i<#<1 
0 t>6 0 oe 
Ye €6oTHE DISCRETE FOURIER TRANSFORM 


For most practical engineering problems requiring the evaluation of a Fourier transform 
it is necessary to use a computer and so some form of approximation is needed. In this 
section we show how a function f(t), with Fourier transform F'(@), can be sampled at 
intervals of T to give a sequence of values f(0), f(T), f(2T) .... The discrete Fourier 
transform (d.f.t.) takes such a sequence and processes it to produce a new sequence 
which can be thought of as a sampled version of F (w). The d.f.t. is important as it is the 
basis of most signal and image processing methods. The use of a computer is essential 
because of the enormous number of calculations required. 

We start by stating the transform and provide a simple example to show how it is 
calculated. The interested reader should refer to Section 24.10 for a derivation which 
shows the relationship between the Fourier transform and the d.f.-t. 
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24.9.1 


Example 24.19 


Solution 


Definition of the d.f.t. 
We consider a sequence of N terms, f[0], f[1], f[2],..., f[N — 1]. 


The d.f.t. of a sequence f[n],n = 0, 1, 2,..., N—1, is another sequence Fk], also 
having N terms, defined by 
N-1 
Flk|— ye fies we tork:—(O0sle27a N= 
n=0 


We write F[k] = D{f[n]} to denote the d.f.t. of the sequence f[n]. 
There are a number of variations of this definition and you need to be aware that 
different authors may use slightly different formulae. This can cause confusion when 


1 
the d.f.t. is first met. In particular, some authors include a factor W in the definition, and 


others use a positive exponential term instead of the negative one given above. What is 
crucial is that you know which formula is being used and you apply it consistently. 
We now give an example to show the application of the formula. 


Find the d.f.t. of the sequence f[n] = 1, 2, —5, 3. 


We use the formula 


N-1 
D{f[n}} = Fik] = Y- fin e2inkr/N fork =0,1,2,...,N—1 


n=0 
Here the number of terms, N, is four: 


3 
Fikl=>_ flnle 2" ~— fork = 0, 1,2,3 


n=0 


So, when k = 0, 


3 
F(0] = D7 flnle® 
n=0 


=1+2+(-5)+3 
= 


When k = 1, 


3 
F{1) _ ria eo 2int/4 
n=0 
=| ae Qe 2i7/4 4. (—5) e 2i27/4 4 3e7 2371/4 
= 14 2e°% — 57% 4 3e 3? 
= 142(-j) -5(-1) +3j 
=6+j 


24.9 The discrete Fourier transform 785 


When k = 2, 


3 
F[2] = DU flnje mnt 


n=0 


3 
=) flle™ 


n=0 
= 1+2e37 + (—5) 47 4 3073" 
14+ 2(-1) —5(1) + 3(-1) 
= —9 


Finally, when k = 3, 


3 
F[3] = i eo 2n3/4 


n=0 


3 
= S- flr) enn /2 


n=0 
=| <t Je 3iT/2 of (—5) ei" 4 3797/2 
=1+2q) -—5(-1) +3(-j) 
=6-j; 
So, the d.f.t. of the sequence 1, 2, —5, 3 is the sequence 1, 6 +j, —9, 6 —j. 


You will note from this simple example that a significant amount of calculation is nec- 
essary even when there are just four points in the sampled sequence. This is why it is 
necessary to use a computer program to find the transform. Even then, the computation 
can be very time consuming. For this reason, much effort has gone into developing fast 
algorithms known as fast Fourier transforms (f.f.t.s). For details of these algorithms 
you should refer to one of the many specialist texts on the subject. You should also inves- 
tigate which packages are available to enable you to perform this sort of calculation. For 
example, the computer package MATLAB® has built-in commands for finding a d.f.t. 
using an f.f.t. For example, the MATLAB® command fft (f) calculates the d.f.t. of a 
sequence f as shown below: 


f=[1 2 -5 3]; 
y=fft (£) 
y = 
1.0000 6.0000+ 1.0000i -9.0000 6.0000- 1.00001 
In MATLAB*® either j or i can be used in input to represent /—1, for example 
1455 
or 


1+5i 
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and they are both equivalent. However when variables are printed, i is always used. 
Hence both of these inputs if typed at the command line would give 


ans = 1+ 5i 


this should be borne in mind when viewing the results of the f-f.t. 


EXERCISES 24.9.1 


Use the definition to find the d.f.t. of the sequences 
film] = 1,2, 0, —1 and g[n] = 3,1, -1, 1. 


Calculate the d.f.t. of the sequence f[n] = 5, —1, 2. 


Use a technical computing language such as 
MATLAB® to verify your answers to Questions 1 


pe flnje%"k"/" — FTK] where the overline 
denotes the complex conjugate of F'[k]. 


For a sequence of complex numbers FA], let FIk] 
represent the sequence obtained by taking the 
complex conjugate of each term in the sequence. 
Show that if f[n] is a sequence of real numbers, then 


and 2. F(N —k] = F[k] fork =0,1,2,...,N/2 if N is 
even, and fork = 0,1, 2,..., (N — 1)/2 if N is odd. 
This can be seen in Example 24.20 where N = 4 and 


F([3] = F[I]. 


4 From the definition of the d.f.t. show that if f[m] is a 
sequence of real numbers with d.f.t. F[k], then 


Solutions 


1 F[k] = 2,1 — 3j,0, 1+ 3). G[k] = 4, 4, 0,4 | 2 6, 4.5 + 2.5981j, 4.5 — 2.5981j 


24.9.2 The inverse d.f.t. 


Just as there is an inverse Fourier transform which transforms F (w) back to f(t), there 
is an inverse d.f.t. which converts F[k] back to f[n]. 


The inverse d.f.t. of the sequence F[k], 
k=0,1,2,...,N—1, is the sequence f[n], also having N terms, given by 
eee! 
D'{F[k}} = fn} = — YS Fike" n=0,1,2,...,N—-1 
{FIA} = fla] = = ye [k] 


Example 24.20 Using the definition, find the inverse d.f.t. of the sequence F[k], for k = 0, 1, 2, 3, 
given by 


F[k] = —4, 1,0, 1 


Solution In this sequence there are four terms and so N = 4. From the definition 


3 
D"'(F[k]} = fin] = ; Yi Fike"/* ~~ n=0,1,2,3 
k=0 


3 
1 ; 
=] Fine’ 1=0,1,2,3 
k=0 
1 . ; ; 
= gi-4+ 1ei"7/2 + 0e”"" ac 1e7i"7"/7) 
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Then, taking n = 0, 1, 2, 3 gives the sequence: 


flr] 1 : 1 
nj=—7, ’ ’ 
2 2 
EXERCISES 24.9.2 

1. Use the definition to find the inverse d.f.t. of the 3 Prove that f[n] = Es yee F[k] e2"7/N is indeed 

sequence F[k] = 6, —2, 2, —2. : N N-1 dikmrt/N 
the inverse of F[k] = )°),-9 flmle"7 mm/N by 

2 Investigate whether you have access to a computer substituting the expression for F'[k] from the second 
package which will calculate an inverse d.f.t. Use the formula into the first, interchanging the order of 
package to verify your answer to Question 1. summation and simplifying the result. 


Solutions 


Mm f(r) =1,1,3,1 


72a = DERIVATION OF THE D.F.T. 


24.10.1 Some preliminary results 


A number of results discussed earlier in this book will be required in the development 
which follows. To assist in this development we remind you of these now. 

Euler’s relations have been discussed in Section 9.7. Recall that 
—jo 


e ” =cos@ —jsin@ 


and in particular that 
e 7" — cos 2nn — j sin 2nn 


Furthermore, when 7 is an integer cos 2n7t = | and sin 2n7t = 0 and so erm = |, 
The following integral property of the delta, or impulse, function will be needed: 


/ f(t)5(t — a) dt = f(a) 


So, multiplying any function, f(t), by the delta function 6 (t—a), representing an impulse 
occurring at ¢ = a, and integrating, results in f(a). This sifting property of the delta 
function, so called because it sifts out the value of f(t) at the location of the impulse, 
has been discussed in Section 16.4. In particular, when f(t) = e~)® we note 


/ e/’'§(t —a)dt =e 
When a continuous function f(t), which is defined for t > 0, is evaluated at times 
t = 0,7,2T,...,nT,... we obtain the sequence of values f(0), f(T), f(2T),..., 
f(aT),..., which we denote more concisely by the sequence f[0], [1], f[2],-.-, 
f[{n],.... This sequence can be expressed in the form of a continuous function, /(f), 
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24.10.2 


Example 24.21 


in the following way: 


f)=T > flan) 


n=0 


Note that whereas f(t) is the original continuous function of f, f (t) is an approximation 
obtained using only the sampled values. This representation is discussed in greater detail 
in Appendix I. 


Derivation 


This derivation is based upon the fact that a function f(t), defined for t > 0, and having 
been sampled at intervals of JT, can be represented by the function f(t) where 


fOQ)=TY- flab — nT) 


n=0 


In any real problem it is only possible to sample a signal over a finite time interval. 
Suppose we obtain N samples of the signal at times t = 0, T, 2T,..., (N — 1)T. Then 
the sampled signal can be represented by amending the limits of summation and writing 


N-1 


f)=T >> flab nT) 


n=0 
Taking the Fourier transform of both sides gives 


N-1 


Ff} = rf eo S- flnlsee —nT)dt 


n=0 


If we make the assumption that it is permissible to interchange the order of integration 
and summation we find 


N-1 = 
FROr=T Yo sfim [ema —nryar 

n=0 =< 00 

N-1 


=f) face 


n=0 


where we have used the sifting property of the delta function. The quantity on the r.h.s. 
is a continuous function of w derived using the values in the sequence f[n]. Write this as 
F (@) and note that F (@) is an approximation to the Fourier transform F'(w) = F{f(t)}. 
It can also be thought of as a Fourier transform for sequences. An important point to 
note about this function of q is that it is periodic with period alg This is proved in the 
following example. : 


Show that the function 
N-1 


Fo) =T > finje"” 


n=0 


: eee 3 2 
is periodic with period T° 
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i : s 27 
Solution Consider F{ @ + a for any w. 


N-1 


y 2 ; 
F( 4 =) =T y~ fla] eg i(ot2n/T nT 


n=0 
N-1 
— -jonT .—j22nT D Pe" 

=f finle e/T using the laws of indices 


=f fin] e lent ens 


Now we saw in Section 24.10.1 that 
e773 — cos nn — jsin2nn = 1 


Finally we have 


27 = 
i = —jonT 
(w + =| =T d finje? 
which equals F (w). 


ms OT & re 
We have shown that F (© + =) = F(a) for any w. Hence F(q) is periodic with 


‘od 27 
eriod —. 
P T 


For computational purposes it is necessary to sample F (@) at discrete values. Suppose 
27 
we choose to sample at N points over the intervalO < w < Tr: By choosing this 
interval we are sampling over a complete period in the frequency domain. So we choose 
27 ~( 27 
the sample points given b = k— fork = 0,1,2,..., (VN — 1). Writing F| k— 
‘ ple p 8g yo NT ( ) 8 ( a) 
as F'(@,) we have 


N-1 


F() =T D0 fle" 
n=0 
N-1 
=7y fae (24.13) 


n=0 


This last sum, without the factor 7, is usually taken as the definition of the d.f.t. of f[n], 
written F'[k]. To denote the d-f.t. of the sequence f[n] we will write D{f[n]}. 


The d.f.t. of the sequence f[n],n = 0,1, 2,...,N — 1, is the sequence given by 
N-1 
Diill) — Hl — ye files ec erton c= 10)1 2 eV all 


=O 
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24.11 


Example 24.22 


Solution 


When we use this transform to approximate the Fourier transform of the function f(t) 
sampled at intervals 7’, it follows from Equation (24.13) that we must multiply by the 
factor T to obtain our approximation. An example of this is given in the next 
section. 


USING THE D.F.T. TO ESTIMATE A FOURIER TRANSFORM 


One application of the d.f.t. is to estimate the continuous Fourier transform of a signal 
f(t). The following example shows how this can be done. 


The signal f(t) = u(t)e~~%, where u(t) is the unit step function, is shown in 
Figure 24.18(a). Its Fourier transform, which can be found from Table 24.1, is 


F(@) = 


24 jo and a graph of |F (w)| is shown in Figure 24.18(b). 
Suppose we obtain 16 sample values of f(t) at intervals of T = 0.1 from t = 0 to 
t= 1.5. 
(a) Obtain the d.f.t. of the sampled sequence. 
(b) Use the d.f.t. to estimate the true Fourier transform values. 


(c) Plot a graph to compare values of |F[k]| and |F(@)|. 


(a) If f(t) = u(t) e~*' is sampled at t = nT, that is t = 0, 0.1, 0.2,..., 1.5, we obtain 
the sequence f[n] = e~7"7 = e~°* forn = 0, 1,2,..., 15. This is the sequence 
given in the second column of Table 24.2. 


(b 


we 


The calculation of the d.f.t. is far too laborious to be done by hand. Instead we have 
used the MATLAB® command fft() to do the calculation and the results, F [Kk], 
have been placed in the third column. 


(c 


Ym 


27 
From Section 24.10 we know that the d.f.t. samples at intervals of NT in the fre- 


quency domain. So for comparison the fourth column shows the true values of F (w) 


LY] f= u(new*! 


(a) 


Figure 24.18 
The signal f(t) = u(t) e~! and its Fourier transform. 
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Table 24.2 
n,k jfiia=s"* FIk] F (a) 0.1F [k] 
0 1.0000 5.2918 0.5000 0.5292 
1 0.8187 1.4835 — 1.9082; 0.1030 — 0.2022; 0.1484 — 0.1908; 
2 0.6703 0.7882 — 1.0837; 0.0304 — 0.1196; 0.0788 — 0.1084; 
3 0.5488 0.6311 — 0.6952; 0.0140 — 0.0825} 0.0631 — 0.0695 
4 0.4493 0.5743 — 0.4702; 0.0080 — 0.0626; 0.0574 — 0.0470} 
5 0.3679 0.5485 — 0.3159j 0.0051 — 0.0504; 0.0548 — 0.0316j 
6 0.3012 0.5355 — 0.1964) 0.0036 — 0.0421) 0.0536 — 0.0196j 
7 0.2466 0.5293 — 0.0944) 0.0026 — 0.0362; 0.0529 — 0.0094; 
8 0.2019 0.5274 0.0020 — 0.0317} 0.0527 
9 0.1653 0.5293 + 0.0944) 0.0016 — 0.0282; 0.0529 + 0.0094) 
10 0.1353 0.5355 + 0.1964) 0.0013 — 0.0254) 0.0536 + 0.0196j 
1 0.1108 0.5485 + 0.3159j 0.0011 — 0.0231) 0.0548 + 0.0316 
12 0.0907 0.5743 + 0.4702; 0.0009 — 0.0212; 0.0574 + 0.0470} 
13 0.0743 0.6311 + 0.6952; 0.0008 — 0.0196; 0.0631 + 0.0695; 
14 0.0608 0.7882 + 1.0837; 0.0007 — 0.0182; 0.0788 + 0.1084) 
15 0.0498 1.4835 + 1.9082; 0.0006 — 0.0170} 0.1484 + 0.1908; 


27 
also sampled at intervals of NT’ Recall that to obtain an estimate of the Fourier 


transform of f(t) we must multiply the d.f.t. values by T = 0.1. This is shown in 
the fifth column of the table. 


Figure 24.19 shows graphs of |0.1F [A]| and |F (w)|. Values of |F[k]| obtained beyond 
k = 8 are mirror images of the earlier values. This is because F[N —k] = F[k] as proved 
in Question 5 in Exercises 24.9.1. This is a phenomenon of the d.f.t. and arises because 
of its periodicity and symmetry properties, further details of which are beyond the scope 
of this book. The interested reader is referred to a text on signal processing for further 


details. 


0.6 
0.5 
0.4 2k 
— > |F(,)| = FFP) 
0.3 
s=-= 0.1 /FTRI| 


Figure 24.19 
Comparison of true values of a Fourier 


transform and approximate values obtained 
0 2 4 6 8 10 12 14 k  ysing a df. 
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724 ~MATRIX REPRESENTATION OF THE D.F.T. 


Example 24.23 


Solution 


When it is necessary to develop computer code for performing a d_f.t., an understanding 
of the following matrix representation is useful. 
We have seen that the d.f.t. of a sequence f[n] is given by 


N-1 
F[k] — ze fila] e2inkt/N 
n=0 
Consider the term e~7"*"/" 
Define W = e~2i7/" so that 


which can be written using the laws of indices as (e~7)7/" )"*, 


N-1 
F{k] = 90 flnlw™ 


n=0 
Note that W does not depend upon n or k. For a fixed value of N we can calculate W 
which is then a constant. Writing out this sum explicitly we find 
F[k] = fl0}W° + fUJW* + fi21W™ + fI3]W* +--+ 
+f[IN-1we- = fork =0,1,2,...,.N—1 


Writing this out for each k we find 


F[0] = f[O]w° + fl1JW° + fI2]w° + fI3]JWo +---+ fIN—-1]W° 
F(1] = floyw° + fliyw' + fi2]w? + fi3w? + ---+ fIN-LW* 
F[2] = f[0]w° + FW? + f[2]W* + f[3]Wo +--+ fIN-Hw?e-) 


FIN —- 1] = flow? + ftlywr! + f(2]w2"-) 
+ fBIW28—) +--+ f[N - ywe-be-b 


These equations can be written in matrix form as follows: 


F[0] wo we we... =w f(0] 

F{1] Ww? wi! we... Wwr-! fUl] 

F227] |_|wWw° w wt ... wed f12] 
FIN — 1] we wr-l wv) . ww-bo-b FIN — 1] 


where W = ei7/", 
(a) Find the matrix representing a three-point d_f.t. 


(b) Use the matrix to find the d.f.t. of the sequence f[n] = 4, —7, 11. 


(a) Here N = 3 and so W = e-°/3, The required matrix is 


1 1 
-2nj/3  @—4ni/3 


—4nj/3 87/3 


le 
le 
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which using the Cartesian form can be written 


1 1 1 
twa 1 ow 
1 -=-] =Sor J 
2 2 2 2 
bee 2 
a. i a) 
1 1 1 
F[0] L We ot . ald 4 8 
(b) FOI) =]! 3-3 5435 | [7] =| 24 15.5885; 
F[2] 1 73 61) W354 11 2 — 15.5885; 
1 +j =] 
2 2 2 2 
So, the required d.f.t. is the sequence 
8, 2 + 15.5885j, 2 — 15.5885) 
EXERCISES 24.12 
1. Using the definition of the d.f.t. show that the matrix Use this matrix to find the d.f.t. of the sequences 
which can be used to implement the d.f.t. of a fla] = 01, 2,0, 1} and gin] = 13,1, —1, 1}. 
four-point signal is 
1 1 1 
1 -j -l j 
as 1 -l 1 -1 
{ j= 4 


Solutions 


1 F[k] = 2,1 —3j, 0, 1 +3). G[k] = 4,4, 0,4 


24.13 


24.13.1 


24.13.2 


SOME PROPERTIES OF THE D.F.T. 

Periodicity 

The d.f.t. F[k] is periodic with period N, that is 
F[k+N] = FI[k] 


This is why we take only NW terms in the frequency domain. Calculating further values 
will only reproduce the earlier ones. The inverse d.f.t. is also periodic with period N. 


Linearity 


The d.f.t. is a linear transform. This means that the d.f.t. of the sequence af[n] + bg[n] 
where a and bp are constants is aF [k] + bG[k]. 
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24.13.3 Parseval’s theorem 
If D{ f[n]} = F[k] and D{g[n]} = G[k] then 


N-1 N- 
> fin — as F[KIGIK] 
n=0 k=0 


where the overline indicates the complex conjugate. 


i 


24.13.4 Rayleigh’s theorem 


This theorem is obtained from Parseval’s theorem by letting g[n] = f[n]. 


N-1 1 N-1 
do lflel? = 5 Do IIAP 
n=0 k=0 


Example 24.24 Verify Rayleigh’s theorem for the sequence f[n] = 5, 4. 


Solution Here N = 2. We first calculate F[k]. 


1 
= Y- fin ev ink 
n=0 


= f(0l+ fle" 
=5+4+4e%" 


When k = 0, we have F[0] = 5 + 4e° = 9. 
When k = 1, we have F[1] =5+4e7% =5—4=1. 
For brevity we often write F[k] = 9, 1. 


Then )-\_o [fin]? = 5? +4? = 41. 
Also, > y—9 IF IKI? = 9? + 1? = 82. 


1 
We note that 41 = 5 x 82 and this verifies Rayleigh’s theorem. 


EXERCISES 24.13 
7) (a) Obtain the d.f.ts of f[n] = 1,2, 2, 1 and (d) Hence verify Parseval’s theorem for these 
gin] = 2, 1, 1,2. sequences. 
(b) Verify Rayleigh’s theorem for each of these 
sequences. 3. Show that if the d.f.t. of f[m] is F[k] then the d.f.t. of 
ag en 2mjki/N rar ‘ 
2) suppose f{n] = 3, 1,5, 4 and g[n] = 2, -1,9, 5. fin — ilis e777" F [kK]. This is known as the shift 
theorem. 


(a) Show that )~3_9 fln|gin] = 70. 
(b) Obtain the d.f.t.s F[k] and G[k]. 
(c) Calculate F[k]G[k]. 5 Prove Rayleigh’s theorem. 


4 Prove Parseval’s theorem. 
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Solutions 
mM (a) F[k]=6,—-1—-j,0,-14}. me (b) F[k] = 13, —2 +3), 3, —2 — 3}. 
eiesieot—j G[k] = 15, —7 + 6j, 7, —7 — 6j 


(b) 3° | flnl|? = 10, © |F LAI? = 40. 


(c) 195,32 —9j, 21, 32+ 9j 


Y |glnl|? = 10, © |GlkI? = 40 


24.14 


24.14.1 


THE DISCRETE COSINE TRANSFORM 


There is an alternative method to that of the d.f.t. for transforming a set of time domain 
samples of the signal f(t) into the frequency domain, F'[q]. It is known as the discrete 
cosine transform or d.c.t. We have already seen that the discrete Fourier transform takes 
N samples, f[0], f[1], f[2], ..., f[N — 1] and produces N output samples in the fre- 
quency domain F[0], F[1], F[2],..., F[N — 1]. The d.c.t. is calculated in a similar way. 
However, the d.c.t. of a set of samples has special properties which make it particularly 
suitable for image and audio processing. These will become apparent in the following 
examples. 

A detailed derivation will not be presented here for the d.c.t., although much of the 
mathematics of the d.c.t. and the d.f.t. is very similar. It should also be mentioned that 
very efficient methods for computing the d.c.t. are possible; this is beyond the scope of 
this book which will focus on the important properties of the d.c.t. as a transformation. 


Definition of the d.c.t. and its inverse 
We consider a sequence of N terms, f[0], f[1], f[2],..., f[N — 1]. 


The d.c.t. of a sequence f[n],n = 0, 1, 2,..., NM —1, is another sequence Fk], also 
having N terms, defined by 


N-1 
FIR = >" simeos| (+5) 4] fork =0,1,2,...,N—1 


There are several variants of the definition of the d.c.t. The one given above is probably 
the most commonly used and is usually termed d.c.t.-II The main mathematical differ- 
ence between this and the d_f.t. is that in the d.c.t. only real numbers are produced in 
F[k], assuming real-valued samples in f[n]. The same cannot be said for the d.f.t. be- 
cause even with real-valued samples in f[n] the output F'[k] in general contains samples 
which have both a real and imaginary part. 

The d.c.t. and its variants are popular choices for image and audio compression. 
Compression is the process of reducing the volume of data associated with a signal by 
removing redundant, partially redundant or repeated data. We will see in the following 
examples that the d.c.t. is very useful for compressing certain types of signal. 

In order to make use of the d.c.t. in these applications it is necessary to use the inverse 
operation to recover the original samples. The inverse of d.c.t.-II is another d.c.t. termed 
d.c.t.-TIL. 
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The inverse d.c.t. of a sequence F[k], k = 0, 1,2,...,N — 1, is another sequence 
f{n], also having N terms, defined by 


N-1 - : 
fin] = ql Fla +2y reson Ee (>+-5)]| 


k=1 
form =O, 1,2, coc oY = Il 


Again, a derivation of this inverse will not be presented here. However, we will demon- 
strate the inverse property for a specific example. 


Example 24.25 (a) Find the d.c.t., F[k], of the sequence f[n] = 2, 4, 6. 
(b) Apply the inverse d.c.t. to F[k] and show that the original sequence, f[n], is 
obtained. 


Solution (a) We use the formula 


Fik] = Fee Sin cos E (+5) 4 fork =0,1,2,...,N—1 


Here the number of terms, N, is three. 


When k = 0 
dP ae nt 1 
F[0] = Wp cos E (n+ 5) x 0| 
ap) 0+ 4cos0 + 6cos 0] Z 
= —[2cos cos cos 0] = — 
V3 V3 
When k = 1 
2 
1 1 
Fil] = i cos E (n+ ;) x 1 
SAileee ea OG 4 
Se — cos 5 cos 
3 6 =| 
i V3 
= —|2x —+0+6x 30/3| = —2 
alex tore (2) = als-14 
When k = 2 
i = Tn 1 
F[2] = F 2 Jt cos E (: + 5) x 2| 
1 ™ 51 
= Fz |2e08 F + 400s7+ 608 | 
3 3 3 
: 2 are (-1)+6 : 1-—4+3=0 
— ed x= x (— Xr l= = = 
J3 2 2 


12 
So the d.c.t. of the sequence 2, 4, 6 is —~, —2, 0. 
V3 
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(b) We now apply the inverse d.c.t. to F[k]. 
The inverse d.c.t. is 


fin] = = FO] + 25 Fld cos E (: 4 5) 
JN = N 2 


forn=0,1,2,...,N—1 


When n = 0 


f[0] = make Pri teeos] Fk (0+ 5)]| 


k=1 


=| Fe2xc 2) x cos (= t)+2x0| 
V3LV3 6 
1 | 12 J3 
— 4 0|}=4-2=2 
ala “a 
When n = 1 


mesf' obo] 
Bee 2) x eos (7) +20] 


-4x040]=4 


When nn = 2 


f2l= A en ejoos |e (2+ 5)]| 


1] 12 57 
= Ses (—2) x00 (Z) +20] 


i i ta V3 
=f 8 au (A) 40] estan 


So f[n] = 2, 4, 6, which was the original sequence as expected. 


If a d.f.t. had been performed on the sequence f[n], the output, F[k], would have been 
the sequence of complex terms 12, —3 + j1.7321, —3 — j1.7321. Compare this with 


12 
the d.c.t. which is F[k] = VF —2, 0. Note that this latter sequence contains only real- 


valued terms. 
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Engineering application 24.2 


Effect of truncating the d.c.t. and d.f.t. of a set of samples 


Now we turn our attention to a longer sequence and consider the effect of deleting 
samples in the d.c.t. 

In this example, we produce both a d.c.t. and a d.f.t. of a given set of samples 
and then set some of the samples in the latter part of the sequence to zero. Following 
this we carry out an inverse d.c.t. to return an approximation to the original sample 
set. 

We consider the sequence of samples 


Fi 0; 1 253,4,5,6,.77 8,2 
Using the methods described in the previous example we can show that the d.c.t. is 


Focrlk] = 14.2302, —6.3815, 0.0000, —0.6835, 0.0000, 
— 0.2236, 0.0000, —0.0904, 0.0000, —0.0254 


The notation Fpyc7 has been used to distinguish this sequence from the d.f.t. sequence 
that it will be compared to. The d.f.t. is 


Fyprlk] = 45.0000, —5.0000 + j 15.3884, —5.0000 + j6.8819, —5.0000 
+ 53.6327, —5.0000 + j1.6246, —5.0000, —5.0000 — j1.6246, 
— 5.0000 — j3.6327, —5.0000 — j6.8819, —5.0000 — j15.3884 


By definition, applying the appropriate inverse operations to either set of data would 
recover the original input samples. 

If instead we delete samples from the end of each sequence by setting the values 
to zero and then perform the inversion then the results are different. In this example 
we set the last five samples to be zero. So we perform the inversion on 


Focrlk] = 14.2302, —6.3815, 0.0000, —0.6835, 0.0000, 0, 0, 0, 0, 0 
Fyprlk] = 45.0000, —5.0000 + j15.3884, —5.0000 + j6.8819, —5.0000 
+ j3.63271, —5.0000 + j1.6246, 0, 0, 0, 0, 0 


The results of the inversion are shown plotted in Figure 24.20. The values obtained 
from the inverse d.c.t. are all real values. The values plotted on the graph for the 
inverse d.f.t. are the magnitudes of the complex values that are returned. This is nec- 
essary because now samples have been removed from the d_f.t., it can no longer be 
guaranteed that the solution consists of wholly real numbers. 

Notice that the samples in both cases are not exactly the same as the ones we 
started with. This is expected because in setting some of the sample values to zero in 
the d.c.t. we have destroyed some data. In fact we have destroyed half of the samples 
in this example. However, notice also that the d.c.t. appears to perform much better 
than the d.f.t. in recovering the input data. The reason the d.c.t. looks qualitatively 
better than the d.f.t. is that the information content or energy is concentrated in the 
lowest order samples. Hence when we set the later samples to zero they affect the 
d.c.t. far less than the d.f.t. 
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Figure 24.20 
Lossy compression using the d.c.t. and the d.f.t. Markers: circle = original data; square = 
d.c.t. derived values; diamond = d.f.t. derived values. 


This is an example of lossy compression, which is used extensively in audio, image 
and video compression. Because some of the samples are known to be zero they do 
not need to be stored and hence the file or data storage requirement is much lower. An 
approximation of the original data can be recovered even though not all of the original 
d.c.t. samples are provided. It can be shown that the more samples we retain, the better 
the quality of the reproduction. It is well known that the d.c.t. performs well on straight 
line data and is used extensively in compression standards for images such as JPEG. In 
image data which is viewed qualitatively a certain amount of data loss is tolerable, and 
this will be illustrated in the following example. 


Engineering application 24.3 


Two-dimensional d.c.t. and image compression 


So far we have only seen the d.c.t. applied to a one-dimensional data set. This could 
be used to handle the compression of audio data, for example. When applying the 
d.c.t. to a two-dimensional problem such as an image a slightly different form is 
required: 


1 N-1 (M-1 = 1 Pe 1 
Fk, 1] = INN [> som cos E (m+ 5) ‘fem E (n+ 5) c 


tora Ol, Be cas gl = Il 
m=0,1,2,...,M—1 
This formula can be applied to a 2D data set. We take the greyscale image given 


in Figure 24.21 as the source data. Each pixel in the image has assigned to it a 
numerical brightness value and it is to these values that we apply the d.c.t. 
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f(, 0) —> 


Figure 24.21 
Source image for d.c.t. 
—+— f(124, 124) application. 


We use the transform given above to produce an N x M matrix containing the 
real d.c.t. values, F[k, /]. Figure 24.22 is a visualization of F[k, /]. Dark pixels rep- 
resent large values, white pixels represent zero or close to zero elements. Notice the 
concentration of large values close to F[0,0], which is in the top-left corner. 


F(0, 0) 


F(124, 124) 


Figure 24.22 
The d.c.t. matrix. 


In the next step we compress the image by setting the array elements to zero at the 
right-hand side and bottom. Since the coefficients are small in this region the effects 
on the image after inversion are minor, unless a significant number of values on the 
right-hand side and bottom are set to zero. 

The results of applying the inverse transform are shown in Figure 24.23. We can 
see that the image on the right which contains the lowest number of d.c.t. coefficients 
has more distortion due to the compression. Such distortions are often termed visual 
artefacts. The advantage of having a highly compressed file is that it requires less 
space for data storage. 

In practice, the image is often broken into smaller blocks with the d.c.t. being 
carried out on these blocks. If a highly compressed image is viewed on a computer 
at a low resolution it is often possible to see these blocks, which typically measure 8 
by 8 pixels. 


24.15 


24.15.1 
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it a 


Figure 24.23 
Effect of setting elements to zero in the d.c.t. matrix. Moving from left to right a greater 
number of elements in the matrix are set to zero, resulting in more image artefacts. 


DISCRETE CONVOLUTION AND CORRELATION 


As stated at the beginning of Section 24.8, convolution and correlation are important 
techniques in signal and image processing. In this section we will describe their discrete 
representations. Both of these can be implemented efficiently using the d.f-t. 


Linear convolution 


The linear convolution of two real sequences f[n] and g[n] is another sequence, h[7] 
say, which we denote by f * g, which is defined as follows: 


Linear convolution of f[n] and g[n]: 


hin} = fxg= )~ flmigin—m] forn=...—3,-2,-1,0,1,2,3,... 


m=—OO 


Notice the similarity between this definition and that of convolution defined in the con- 
text of the continuous Fourier transform in Section 24.8. Notice also that in this for- 
mula the sequence g is folded and shifted. This will be illustrated in the example which 
follows. 

Frequently the sequences being considered will be finite. 

Assume now that f[1] is a finite sequence of N, terms, so that all terms other than 
F(0), f{1], ..., ALN, — 1] are zero. 

Suppose also that g[n] is a finite sequence of N, terms, with all terms other than 
g[O], g{1], ..., g[N, — 1] being zero. 

It can be shown that the sequence h[n] will have length N, +N,—1, and the convolution 
sum simplifies to the following: 


The linear convolution of the two finite sequences f[n] and g[n] is defined as 


hin] = fxg=)_ flmlgin—m] forn =0,1,2,3,...,(M +N, —2) 


m=0 
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Example 24.26 


Solution 


The terms of this sequence can be calculated by brute force as the following example 
will show, although in practice a convolution can be calculated much more efficiently 
using a d.f.t. 


Suppose f[n] is the sequence 3, 9, 2, —1, and g[m] is the sequence —4, 8, 5. 


(a) Find the linear convolution h[n] = f « g using the previous formula. 


(b) Develop a graphical interpretation of this process. 


(a) Note that both f and g are finite sequences of length 4 and 3 respectively. The con- 
volution f * g will be a sequence of length 4 + 3 — | = 6. By definition, 


hln| =) flmgin — m] forn=0,1,2,...,5 


m=0 


The first term in h[n] is obtained by letting n = 0: 


0 
h{O] = D> flmlgi0 — m| 


m=0 


= f[0]g{0] 
= (3)(-4) 


The second term is obtained by letting n = 1: 


1 
hE] = D2 flmigtl — m) 


m=0 


= f[O]gl1] + fl1g{9] 
= (3)(8) + (9)(-4) 
= 24 — 36 

=) 


Subsequent terms are calculated in a similar manner. You should obtain these for 
yourself to ensure that you understand the process. The complete sequence is 


hn] = f * g= —12, —12, 79, 65, 2, —5 


(b) The graphical interpretation is developed along the same lines as was done for the 
continuous convolution in Section 24.8. 


The sequences f[m] and g[m] are shown in Figure 24.24. 

The sequence g[—m] is found by reflection in the vertical axis, that is folding as 
shown in Figure 24.25(a). Then the folded graph can be translated n units to the left or 
the right by changing the argument of g from g[m] to g[n — m]. If n is positive the graph 
moves to the right. Study Figures 24.25(b—g) to observe this. Convolution is the sum 
of products of f[m] with g[n — m]. The graph of f[m] is superimposed. We are only 
interested in values of n for which the graphs overlap — otherwise each product is zero. 
For each value of n the superimposed graphs make it easy to see which values must be 


1 


2 


Figure 24.24 
The sequences f[m] = 3,9, 2, —1 and g[m] = —4, 8, 5. 
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multiplied together and added. If required, a table can be constructed which summarizes 
all the necessary information as shown below. 


m —2 
f{m] = 
n=0 gl—m] 5 
n=1 gil — m] - 
n=2 g[2 — m] - 
n=3 gl3 — m] - 
n=4 gl4 — m] - 
n=5 g[5 — m] - 
EXERCISES 24.15.1 
Given f[n] = 1, 2,2, 1 and g[n] = 2, 1, 1, 2, find the 
linear convolution f * g. 
(a) The linear convolution of the sequences 
3,9, 2, —1 and —4, 8, 5 was obtained in 
Example 24.27. Show that this convolution is 
equivalent to multiplying the two polynomials 
3 49x + 2x? — x3 and —4 + 8x + 5x7. 
(b) By using polynomial multiplication find the 
linear convolution of the sequences f[n] = 9, —8 
and g[n] = 1, 2, —4. 
Solutions 
2,5,7, 8,7, 5,2 
(b) 9, 10, —52, 32 (c) 63, 11, —17, 10, —12 


24.15.2 Circular convolution 


0 1 2 3 4 5 
3 2 =i 2 . 
ex a _ = = 7 
8 -4 - = = ™ 
5 8 -4 i - z 
= 5 8 -4 = 2 
2 2 5 8 tf 2 
7 Hs . 5 8 af 


(c) Use the polynomial method to find the linear 
convolution of f[n] = 9, —1,3 and 
gin] = 7,2, —4. 


Find the linear convolution of f[n] = 1, —1, 1,3 and 
gin] = 7, 2,0, 1. 


Prove from the definition that linear convolution is 
commutative, that is f * g = gx f. 


7, —5,5, 24,5, 1,3 


In this section we consider periodic sequences having period N. We can select one period 
for examination by looking at the terms f[0], f[1], f[2], ..., f[N—1], say. For example, 


the sequence 


fla) =. — 7,11, 2,-=7, 11, 2, =7, 11,2... 
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(b) -44 


Figure 24.25 


sum of products: 
(3)(-4) = -12 


sum of products: 
(3)(8) + (9)(-4) = —12 


sum of products: 
(3)(5) + (9)(8) + @)(-4) = 79 


sum of products: 
(9)(5) + (2)(8) + (-1(-4) = 65 


sum of products: 


(2)(5) + (-1)(8) = 2 


sum of products: 
(-D)G6) = 5 


The effect of translating g[—m] by n units, forn = 0,1,..., 5. 


Example 24.27 


Solution 


g[0] 


g(l] g[2] 
g{-2] glt-1 
Figure 24.27 


The periodic sequence 
gin] = 7,2, —4. 
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f(0]=f [3] Ff [0] 
fia pois) fee —ziay “Meee eeas a 
A periodic sequence can be visualized 
(a) (b) by listing its terms around a circle. 


is a periodic sequence with period N = 3. Wecan select the terms f[0], f[1], f[2], that is 
terms —7, 11, 2, and use these to study the entire sequence. 

Such a sequence can be represented graphically by listing its terms around a circle as 
shown in Figure 24.26. Doing this allows us to calculate further terms in the sequence 
as we require them. Rotation anticlockwise represents increasing n as in (a). Rotation 
clockwise represents decreasing n as in (b). 

Suppose f[n] and g[n] are two periodic sequences having period N. Their circular 
convolution, or periodic convolution, is the sequence h[n], which we denote by f @® g 
and which is defined as follows: 


The circular convolution of two periodic sequences each of period N is defined as 
N-1 


hin] = f ®g=)>_ flmlgin—m] forn=0,1,2,...,N-1 


m=0 


The sequence is periodic with period N and so we can state it forn = 0,1, 2,..., 
N-1. 


(a) Calculate the circular convolution, h[n] = f @ g, of the two periodic sequences 
f{n] = 9, —1, 3 and g[n] = 7, 2, —4. 


(b) Develop a graphical representation of this process. 


(a) The sequence g[n] is depicted in Figure 24.27. 
We use the formula given above. In this Example, N = 3. First let n = 0. 


h{O] =D) flmlglo — m] 


m=0 


= f[0]s(0] + fU.Je[—1] + fl2}s{—2] 
= (9)(7) + (—1)(-4) + G)(2) 
= 73 

Next letn = 1. 


h{1] =D) flmigit — m] 


m=0 


= flOlgl1] + fUIgl0] + f[2}l—1] 
OQ eD)+ 4 
—1 
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n=0 n= n=2 


A 


1 
"a =A 
-4 2 7 4 @ 


Sums of products: 


()7) + (-DG 4) + BQ) = 73 


O)- 4) + (-DQ@) + B)M) = -17 
(9)(2) + (-D@) + B\-4) = -1 


Figure 24.28 
The sequences f[m] drawn around the inner circle, and g[n — m] drawn around the outer circle, 
forn = 0, 1, 2. 


(b 


we 


Finally, let n = 2. 


2 


h{2] = D> flmgi2 — m) 


m=0 


= f[0]gl2] + FU Isf1] + f[21s0] 
(9)(—4) + (-1)(2) + G)(7) 
=-17 


So the circular convolution f @ g = 73, —1, —17. 

If required, a table can be constructed which summarizes all the necessary in- 
formation as was shown in Example 24.26. The sequences must be extended to show 
their periodicity, and this time we are only interested in generating the convolution 
sequence over one period, namely that section of the table for m = 0, 1, 2. 


me <2 <1 0 1 2 3 4 5 

fim) 1 9 of 3 oe <i 3 
n=0 g[-m]  —4 a % =a 2 7 =A 2 
n=1  gfl—ml 7 of , T. of 2 a th 
n=2 ~~ 9[2—ml 2 G =i 2 7 -4 3 q 


A graphical representation can be developed by listing the fixed sequence f[m], 
for m = 0, 1, 2, anticlockwise around an inner circle as shown in Figure 24.28. 
We list g[—m] around an outer circle but do so clockwise to take account of the 
folding. By rotating the outer circle anticlockwise we obtain g[1 — m] and g[2 — m]. 
By multiplying neighbouring terms and adding we obtain the required convolution. 
The result is 73, —1, —17 as obtained in part (a). 


In the following section you will see how circular convolution can be performed using 
the d.f.t. 


1 


1 
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EXERCISES 24.15.2 
Calculate the circular convolution of f[n] = 1, —1, 1, 3 and g[n] = 7, 2, 0, 1. 
Solutions 
12, —4, 8, 24 
24.15.3 The (circular) convolution theorem 
Suppose f[n] and g[n] are periodic sequences of period N. Suppose further that the d.f.t.s 
of f[n] and g[n] forn = 0,1,2,..., NM — 1 are calculated and are denoted by F[k] and 
G[k]. The convolution theorem states that the d.f.t. of the circular convolution of f[n] 
and g[n] is equal to the product of the d.f.t.s of f[m] and g[n]. 
The convolution theorem: 
Dif © g} = FIKIGIA] 

This is important because it provides a technique for finding a circular convolution. It 
follows from the theorem that 

f ®g=D"{FIKIG(K} 
So, to find the circular convolution of f[n] and g[m] we proceed as follows: 
(1) Find the corresponding d.f.t.s, F[k] and G[k]. 
(2) Multiply these together to obtain F[kK]G[k]. 
(3) Find the inverse d.f.t. to give f @) g. 
Whilst this procedure may seem complicated, it is nevertheless an efficient way of cal- 
culating a convolution. 

Example 24.28 Use the convolution theorem to find f @ g when f[n] = 5, 4 and g[n] = —1, 3. 
Solution — First we find the corresponding d.f.t.s, F[k] and G[k]. Using 


N-1 


F[k] = Y- fin eo 2ink7/N 


n=0 
with N = 2 gives 


1 


1 
F[0] = )> flnl = 9, FI => fine" =5+4e" =1 


n=0 n=0 


808 Chapter 24 The Fourier transform 


Example 24.29 


Solution 


and similarly, 


il | 
G[0] = >> gla] = 2, Gl] = )0 gine" = -14+3e" =-4 


n=0 n=0 


F[k] =9,1 G[k] = 2, —4 
These transforms are multiplied together, term by term, to give 
A[k] = F[RJG[k] = (9)(2), (1)(—4) = 18, —4 


Finally the inverse d.f.t. of the sequence 18, —4, is found using 


N-1 


h{n] _ 5 its e2inkr/N 


k=0 


to give 
1 
(F@De)= 508-4) =7 


1 1 . 
(f © gi] = 5 (18) + 5 (40) =11 


and so 
f@®g=7,1l 


The convolution could also be evaluated directly using the technique of Section 24.15.2. 
You should try this to confirm the result obtained using the theorem. 


Use the convolution theorem and a computer package which calculates d.f.t.s to find the 
circular convolution of the sequences f[m] = 1,2, —1, 7 and g[n] = —1, 3,2, —5. 


You will need access to a computer package such as MATLAB® to work through this 
example. The d.f.t. of f[m] can be calculated either directly, which is laborious, or using 
the MATLAB® command fft (). 


F=fft({1 2 -1 7]) 
ans = 
9.0000 2.0000+ 5.00001 -9.0000 2.0000- 5.0000i 


Hence F'[k] = 9,2 + 5j, —9, 2 — 5j. 
Similarly, 


G=fft([-1 3 2 -5]) 
ans = 
-1.0000 -3.0000- 8.0000i 3.0000 -3.0000+ 8.0000i 


Hence G[k] = —1, —3 — 8j, 3, —3 + 8j. Then, the product of these d_-f.t.s is calculated 
by multiplying corresponding terms together: 


F[k]G[k] = —9, 34 — 31j, —27, 34+ 31j 


Example 24.30 


Solution 
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In MATLAB®, 
product = F.*G 


product = 
-9.0000 34.0000-31.0000i1 -27.0000 34.0000+31.0000i 


Finally, taking the inverse d.f.t., using the MATLAB® command ifft (), gives 


ifft([-9 34-311 -27 34+31i]) 
ans = 
8 20 -26 -i1 


This is the circular convolution of f[n] and g[m], that is 
f{n] ® gin] = 8, 20, —26, -11 


This example has illustrated how circular convolution can be achieved through the use 
of the d.f.t. 


The convolution theorem applies to circular convolution but not linear convolution. How- 
ever, by modifying the procedure slightly, the linear convolution of two finite sequences 
can also be found. 

If f[n] is a finite sequence of length N, and g[n] is a finite sequence of length N, we 
know from Section 24.15.1 that their linear convolution is a sequence of length N, + 
N, — 1. 

First we extend both the sequences f[n] and g[n] to make each have length N, +N, —1. 
This extension is done by adding zeros. This process is known as ‘padding’ with zeros. 
Then the d.f.t.s of the padded sequences are calculated to give F[k] and G[k]. It can 
be shown that the linear convolution of the original sequences is equal to the circular 
convolution of the padded sequences. Hence the linear convolution f « g is then found 
from 


f*g=D "{F[KGIk} 


Consider the following example. 


If f[n] is the finite sequence 7, —1 and g[n] is the finite sequence 4, 2, —7 use circular 
convolution with padded zeros to obtain the linear convolution f * g. 


Their linear convolution is a sequence of length 2+3-—1=4. 
We pad f and g to give sequences of length 4. 


f{n] =7, -1, 0, 0 and g[n] = 4, 2, —7,0 
Then, either by direct calculation or by using a computer package you can verify that 


F[kKJ=6,7+j,8,7-j and  G[k]=—-1,11—2j,-5,1142j 
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Then 

F[k]G[k] = —6, 79 — 3j, —40, 79 + 3) 
Then taking the inverse d.f.t. gives 

D7! {F[K|G[k]} = 28, 10, —51,7 


which is the required linear convolution. You should verify this by calculating the linear 
convolution directly. 


Engineering application 24.4 


Convolution reverb 


Convolution can be used on an audio signal to simulate the echoing or reverberation 
of a real room or space. An application of such a method can be found in recording 
studios where it may be desirable to simulate the acoustics of a large room such as a 
concert hall on a recording. 

The first stage involves capturing the impulse response of the room to be simu- 
lated. An impulsive sound source such as a pistol or a small explosion can be used to 
excite a broad range of frequencies. It can be shown that a short-duration impulse has 
a very broad spectrum. A popular alternative is to use a sinusoidal signal source of 
time-varying frequency. In the latter case the output of the convolution process, h[n], 
is known because the spectrum can be measured directly. The second stage consists 
of applying the inverse process, termed deconvolution, to this data in order to obtain 
the impulse response of the room. 

When the impulse response has been found, whichever method has been used, it 
becomes a case of convolving the impulse response of the room or space with the 
signal. It is necessary to carry out a linear convolution process to obtain the desired 
effect. We do not wish to use circular convolution because the signal is not periodic. 
However, we may wish to make use of the circular convolution theorem 


f ®g=D {D{fin}} - P{gin}}} 


Recall, however, that this theorem relates only to circular convolution, not to linear 
convolution. We may still make use of the convolution theorem if we pad the signals 
with zero values in order to prevent unwanted overlap. This is best illustrated by use 
of an example. 

The problem of calculating the circular convolution h[n] = f «&) g for the signals 
f{n] = 9, —1, 3 and g[n] = 7, 2, —4 has already been explained (see Example 24.27). 
The result of A[n] = 73, —1, —17 could have been obtained either graphically or us- 
ing the convolution theorem. The linear convolution is h[n] = 63, 11, —17, 10, —12 
and can be found using the direct method, and this result is presented here for refer- 
ence (see Exercises 24.15.1, Question 2(c)). 

To use the convolution theorem it is first necessary to take the original signals 
and pad them with zero values so that the overall length of each is equal to the sum 
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of the two — 1. So in the simple example, both signals start with length 3, and after 
padding they must have a length of 3 + 3 — 1 = 5. The padded signals are 

f'[n] = %, =ll, ap 0, 0 

2 lee 4 OO) 


The calculation of the first term in the output signal is shown graphically in Fig- 
ure 24.29. 


i n=0 
0 2 
Figure 24.29 
The padded signals f’[m] (inner circle) and g’[m] (outer 
0 =o circle). 


The first element of h[n] is (9)(7) = 63. Notice how the padding is sufficient to 
ensure that all of the values in the linear convolution can be produced without overlap. 
In this position all of the other terms are multiplied by 0 and hence can be removed 
from the calculation. By rotating the outer circle anticlockwise four more times the 
other values can be calculated. It can be seen that this method is exactly equivalent 
to carrying out the linear convolution. 

Hence we can use the convolution theorem on the padded signals f’[n], g’[n] and 


f @ g=D "(Dif In} - Dien} 


This method is sometimes preferred because the d.f.t. values of the signals are readily 
and quickly calculated on a computer. 

The signals may be padded with additional zeros if desired with no effect on the 
result. In order to optimize the computation speed for the d.f.t. it is often desirable 
to use the f.f.t. algorithm mentioned earlier. One restriction of the f.f.t. is that the 
number of samples in the input signal must be a power of 2, so it has to contain 2, 4, 
8, 16, 32,... samples. Since padding is possible without affecting the result then the 
f.f.t. can be used. 

As an example we consider adding reverb to a signal g[n], which is a 30 second 
duration piece of music played on an electric guitar. The signal g[n] is very ‘clean’, 
obtained by plugging the guitar directly into the recording equipment. As a result, it 
contains no reverberant room effects that would have been present if a microphone 
and amplifier had been used instead. An impulse response f[7] is obtained by bursting 
a balloon in an environment to be simulated. The two signals prior to introducing 
padding are shown in Figure 24.30. 

The two padded Fourier transformed signals are multiplied together and the in- 
verse d.f.t. taken to produce the signal in Figure 24.31. The signal h[n] has a duration 
of 32 s due to the effects of linear convolution. When the signal is played through 
speakers it has a distinctive echo or reverb effect that is not present in g[]. There are 
visible differences in the signal if it is examined carefully. 
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fn] Scale: = 


gin] 
Figure 24.30 
Two audio signals: a 2 s impulse response of an environment to be simulated, f[n], and a 30 s 


music track to be processed, g[n]. Both signals are sampled at a rate of 44 100 samples per 
second. 


Figure 24.31 
The output signal h[n] = f @ g including reverb. 


EXERCISES 24.15.3 


obtain the linear convolution of f[n] = 9, 0, 1 and 
g[n] = 5, 4, 5, 2, 1. Further, verify the convolution 
theorem for these sequences. 


1. The circular convolution of f[m] = 1, —1, 1,3 and 
g[n] = 7, 2, 0, 1 was calculated in Question 1 in 
Exercises 24.15.2. Verify the convolution theorem for 


these sequences. 3. Prove the circular convolution theorem. 


2. Use circular convolution and padding with zeros to 


Solutions 


1 F[k] =4, 4j, 0, —4j. G[k] = 10,7 -j,4,74+j | WB 45, 36, 50, 22, 14,2, 1 


24.15.4 Linear cross-correlation 


The linear cross-correlation of two real sequences f[n] and g[n] is another sequence, 
c[n] say, which we denote by f * g, which is defined as follows: 


Linear cross-correlation of f[n] and g[n]: 


CO 


cn]=fxg= >> flmigin—n] — forn=...—3,-2,-1,0,1,2,3... 


m=—OoO 
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Note the similarity between this definition and that of linear convolution defined in 
Section 24.15.1. In the formula for cross-correlation the sequence g is not folded. If 
the two sequences are finite and of length N, that is f[n] and g[n] are non-zero only 
when 0 < n < N — 1, there are 2N — | terms in the cross-correlation sequence and the 
formula can be written as follows: 


Linear cross-correlation of two finite sequences f[n] and g[n]: 
N-1 


cin] =fxg= > flmlgin—n] — forn=0,1,2,...,N-1 


m=n 
and 
N+n-1 


cin] = fxg= >> flmlgim—n) fori 01) = 2... 95 —(V — 1) 


m=0 


Example 24.31 Suppose f[n] = 7,2, —3 and g[n] = 1,9, —1. Assume both sequences f and g start at 
n=0. 
(a) Find the linear cross-correlation c[n] = f « g using the formulae above. 


(b) Develop a graphical interpretation of this process. 


Solution (a) Both f and g are finite sequences of length N = 3. Their cross-correlation is a 
sequence c[n], forn = —2, —1, 0, 1, 2, of length 5. 
Using the formulae above with n = —2 gives 


0 
c{—2] = 3 flmlgin + 2] 


m=0 


= flOle(21 
= (7)(-1) 
=-7 


When n = —1 we have 


1 
c{=-1] = 3 flmigin + 1) 


m=0 


= f[0lg{1] + fUlsi2] 
= (7)(9) + 2)(-1) 
= 61 


The remaining terms are calculated in a similar fashion. You should calculate one 
or two terms yourself to verify that the full sequence is 


c[n] = —7, 61, 28, —25, —3 n= -—2,—1,.0,1,2 


(b) The graphical interpretation is developed along the same lines as for linear convo- 
lution in Example 24.26. Figure 24.32 shows the sequence f[m], form = 0, 1, 2, 
denoted by the symbols o. Also shown is the sequence g[m] denoted by e. 
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glm |] 


n=0 
sum of products: 


(7)) + (2)9) + (-3)C-1l) =7 + 18 +3 = 28 


(a) 


n= 1 
sum of products: 


(2)(1) + (-3)9) = 2-27 = -25 


glm - 1] 


(b) 
n=2 
gl[m — 2] sum of products: 
(-3)) = -3 
(c) 
n=l 
gim + 1] sum of products: 


(7)9) + (2)(-1) = 63 —2= 61 


(d) 7 
n= - 
gl + 2] sum of products: 
(7)(-1) = -7 


(e) 


Figure 24.32 
The effect of translating g[m]. 


The graph of g[m] can be translated n units to the left or right by changing the 
argument of g from g[m] to g[m — n]. If n is positive the graph moves to the right. 
Study the figure to observe this. Correlation is the sum of products of f[m] and 
gl[m — n]. For each value of n, the graph of f[m] is superimposed. We are only 
interested in values of n for which the graphs overlap — otherwise each product is 
zero. For each value of n the superimposed graphs make it easy to see which values 
must be multiplied together and added. Placing these results in order confirms the 
result obtained in part (a), that is the correlation is —7, 61, 28, —25, —3. 


When a sequence is cross-correlated with itself the process is known as autocorrelation. 
Autocorrelation is used to search for possible periodicities in signals, because if a signal 
is periodic with period N its autocorrelation sequence will show peaks at intervals of N. 


EXERCISES 24.15.4 
1 Find the linear cross-correlation of the sequences 2 Show that the linear cross-correlation of f[n] and g[n] 
S{n] = 4,5, 9 and g[n] = 3, 1, 1. can be written in the alternative form 


f= Ogg flim + negli). 


3 There are variants of the definition of correlation. 


1 
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correlation of the sequences in Question | is 


Show that if f * gis redefined to be 27, 24, 26,9, 4. 
oe, oo flim — n]g[m] then the corresponding 4 Find the linear autocorrelation of the sequence 
fin] = 3, 2, 1. 
Solutions 
4,9, 26, 24, 27 | BM 3,8,14,8,3 
24.15.5 Circular cross-correlation 
The circular cross-correlation of two periodic sequences of period N is defined in a 
similar manner to their circular convolution. It is a sequence c[n] of length N. 
The circular cross-correlation of two periodic sequences, f[n] and g[n], each of 
period N, is defined as 
N-1 
c[n] =f @®g=)_ flmigim — n1 forn =0,1,2,...,N—1 
m=0 
When a sequence is cross-correlated with itself the process is known as autocorrelation. 
Example 24.32 (a) Find the circular autocorrelation of the sequence f[n] = 3, 2, 1 using the formula. 


Solution 


(b) Develop a graphical method for performing this calculation. 


(a) Here N = 3. From the definition 


2 
cin] =f@f= >> flmiflm—n] — forn=0,1,2 


m=0 


2 
c[0] = )> fm) flm| 


m=0 
= (3)(3) + (2)(2) + (1)(1) 


2 
cf] = )0 flim) fim — 1] 


m=0 
= flO) f[-1] + FLL F10] + f21f11) 
= (3)(1) + (2)(3) + (1) (2) 
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24.15.6 


Example 24.33 


Solution 


n=0 n= 
1 


@ HY ( 


Sums of products: 


(3) + (Q2)+ Md) = 4 B)C) + (YB) + ODZ=HN BZ) + (A) + WG) = 1 


Figure 24.33 
Graphical calculation of the autocorrelation of f[n] = 3, 2, 1. 


n= 2 
3 


2 
c[2] = )) fll flim — 2] 
m=0 
= (3)(2) + (2)(1) + (1) (3) 


Hence c[n] = 14, 11, 11. 

The graphical method involves listing the sequence f[m], m = 0, 1, 2, around an 
inner circle. Around an outer circle we list it again. This method is identical to that 
used in Example 24.27 for circular convolution, but because now there is no fold- 


ing, the sequence on the outer circle is not reversed. The calculation can be seen in 
Figure 24.33. 


(b 


ee 


(Circular) correlation theorem 


For real sequences f[n] and g[n] the correlation theorem states: 


D{f ® g} = FIKIGIK] 
where G[k] denotes the complex conjugate of G[k]. 


This provides a technique for calculating a correlation using the d.f.t. 


Find the circular correlation f @® g when f[n] = 8, —9, 3, 2 and g[n] = 11,4, —1, —5. 


Either directly from the definition of the d.f.t., or by using a computer package, we can 
show that 


F[k) =4,5+11j,18,5-11j  G[k] =9, 12 —9j, 11,12+9j 


The conjugate of G[k] is G[k] = 9, 12 + 9j, 11, 12 — 9j. Then 


F[k]G[k] = 36, —39 + 177j, 198, —39 — 177] 


Example 24.34 


Solution 
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Either directly, or using a computer package, taking the inverse d.f.t. yields the sequence 
39, —129, 78, 48 


which is the required circular correlation. You may like to verify this by directly calcu- 
lating f ® g. 


It follows from the theorem that the circular autocorrelation sequence has the following 
property: 


D{f @® f} = FIAIFIK] 
Use the d.f.t. to find the circular autocorrelation of the sequence f[n] = 8, 3, —1, 2. 


It is straightforward but tedious to show that the d.f.t. of f[n] is 
F[k] = 12,9 —j,2,9+j 

Then, the conjugate of F'[k] is F[k] = 12,9 +j,2,9 —j, and 
F[k|F[k] = 144, 82, 4, 82 


Finally, taking the inverse d.f.t. gives the sequence 78, 35, —4, 35 which you can verify 
is the circular autocorrelation of f[n] = 8,3, —1, 2. 


Engineering application 24.5 


Use of correlation in radar 


Both sonar and radar operate by transmitting a signal which bounces off a distant 
target and returns to a receiver. The main difference is that sonar uses sound waves 
whereas radar uses radio waves. The time taken for the signal to return to the receiver 
having reached the target can be used to deduce, d, the distance between the trans- 
mitter and the target. In both radar and sonar, the received signal is typically very 
much smaller than the transmitted signal and can contain a lot of unwanted noise. 
High-gain electronic amplifiers may be required to make the signal large enough to 
be processed and these introduce even more noise. In addition the receiver experi- 
ences interference from other sources which has to be separated from the returning 
signal. Correlation techniques are very useful for retrieving a signal from the noise 
and the interference. 

We will consider a radar system. The signals transmitted are usually modulated 
using a high-frequency carrier signal (see Engineering application 24.1). For our pur- 
poses the presence of a carrier signal is not important. 

However, the high-frequency component of the signal can be removed 
electronically prior to the signals being sampled by a process called 
demodulation. 
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The received and demodulated signal is represented by f[n] and the transmitted 
signal prior to modulation is represented by g[n]. Consider the case where g[n] is a 
square pulse comprising 10 samples. Let the time between each sample be T. Both 
signals are shown plotted in Figure 24.34. Note that this is an idealized case in which 
there is no noise on the received signal. 


20 40 60 80 100 120 140 


Figure 24.34 
Plot of transmitted and received digital signals for a radar system. 


The returned signal is delayed due to the time taken for it to travel to a distant 
target and back. Examining Figure 24.34 we note that the returned signal is delayed 
by 65 samples when compared with the transmitted signal. This equates to a time 
delay of 657. We know the speed that the transmitted pulse travels is c, the speed of 
light. Using 


distance = time x velocity 


gives total distance travelled = 657 x c. But the total distance travelled is twice the 


65Tc 
distance between the transmitter and the target, that is 2d. Hence d = a 


We now confirm the number of samples delay by calculating the circular cross- 
correlation of f[n] and g[n]. We wish to calculate c[n] = f @ g and plot this on a 
graph for analysis. This can be done either directly by calculating the circular cross- 
correlation or by using the circular correlation theorem. Owing to the large number 
of calculations involved they are not presented here. Normally such a process would 
be carried out using a computer. 

The result is a sequence of samples which are plotted in Figure 24.35. The largest 
value of c[n] occurs at 65, which corresponds to the delay in samples between the 
two signals. It agrees with our initial observation where the difference was quite 
straightforward to see by inspecting the two signals. 
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c[n] 
2.0 


ils) 


1.0 


0.5 


20 40 60 80 100 120 140 


Figure 24.35 
Cross-correlation function c[m] for the signals f[n] and g[n]. 


We now consider a more realistic case in which there is random noise on the re- 
turned signal and carry out the same calculation that we did previously. Figure 24.36 
shows the two signals. Although the return signal is still present it is no longer pos- 
sible to accurately determine where it actually starts and finishes by just looking at 
the graph. 


1.0 


0.8 


0.6 BS gin] 


0.4 f[n] 


0.2 iil 


ln elite a 


20 40 60 80 100 120 140 


n 


Figure 24.36 
Plot of transmitted and received digital signals for a radar system in which noise is present on 
the returned signal. 
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Carrying out the circular cross-correlation on these signals gives the result shown 


in Figure 24.37. The highest peak still occurs at 65 time intervals, allowing the dis- 
tance of the remote object to be accurately calculated. 


c[n] 
2.0 
115) AIHIIT 
1.0 IW ; 
0.5 
i | if mt | rf : | 
20 40 60 80 100 120 
Figure 24.37 


Cross-correlation function c[m] for the signals f[n] and g[n] for the case where f[n] contains 
additive noise. 


The cross-correlation function allows the distance of the remote object to be calcu- 
lated even when there is a large quantity of noise on the returned signal. It is therefore 


a very useful signal processing operation for radar systems. Similar considerations 
apply when working with sonar signals. 


EXERCISES 24.15.6 


1 Find the circular cross-correlation of f[n] = 2,3, —1 
and g[n] = 8, 7, 1. Verify the correlation theorem. 


3 There are variants of the definition of correlation 
Show that if f ® g is redefined to be 
ah : — fm — njg[m] then the corresponding 
2 From the definition, prove the correlation theorem. correlation theorem states D{f ® g} = F[AIGIK]. 
Solutions 


1 36, 19,9 
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REVIEW EXERCISES 24 


Find the Fourier transforms of 


1-? |t}<1 
may = ib otherwise 
_ jsint |t]h<7 
Ores fe otherwise 


1 O<t<tT 
0 otherwise 


(c) ft) = | 
e t¢>0 


@so=| at 


—e t <0 ails 


Find the Fourier integral representations of 


_ J 3t [th <2 
AP of EL | 0 otherwise 
0 t<0 
) f@)=16 O<t<2 
0 t>2 


(a) If F(w) = F{f(t)}, show that 
Ff! (t)} = joF (0). 

(b) If F(w) = F{f(t)}, show that 
Fif™ (t)}} = (jo)"F (w). These results enable us 
to calculate the Fourier transforms of derivatives 
of functions. 


If F(w) = F{f(t)}, show that 


1 
Ff (at)} = =*(8)- 


If F(@) is the Fourier transform of f(t) show that 


(a) FO) =f, fae. 


[e.e} 


1 
(b) fO)= mall F(w) do. 


(c) Show that if f(t) is an even function, 
F(w) = 2 f>° f(t) coset de. 


The convolution theorem given in Section 24.8.1 
represents convolution in the time domain. 
Convolution can also be performed in the frequency 
domain, in which case the equivalent convolution 
theorem is 


1 
FF OO} = 5 LF) * G@)] 


Prove the convolution theorem in this form. 


7 


10 


(a) Given that the Fourier transform of f(t) = e ltl 
2 
is ene use the t-w duality principle to find 
a? + w 
1 
+7" 
(b) From a table of transforms write down the 
Fourier transform of cos bt. 


the Fourier transform of a 
a 


(c) Use the convolution theorem obtained in 
Question 6 to find the Fourier transform of 


cos bt ee, 
—>—,, forb> 0. 
a? + 2 


(d) Using the result in part (c) evaluate the integral 
cos bt 


oe) 
—+—~ dt 
_. 


Find the d.f.t. of the sequence f[n] = {3, 3, 0, 3}. 
Verify Rayleigh’s theorem for this sequence. 


Find the linear convolution of the two finite sequences 
f{n] = 3, -1, -7 and g[n] = 4,0, 5. 


The signum function is defined to be 


1 ¢t>0 
sen(t)= 4-1 £ <0 
0 r=0 


This function can be represented as the exponential 
function e~“', if t > 0, and as —e“, if t < 0, in the 
limit as € > 0. 


(a) Show that 
o.e) 
F 2 
/ sen(t) eI dt = — 
—oo jo 
(b) Use the t-w duality principle to show that 
[oe 1 5 
—e dt = -j 
- ae e jsgn(@) 


(c) Use the second result in part (b), the convolution 
theorem and the integral properties of the delta 
function to show that 


1 
— *cos(at) = sin(at) 
Tt 


Show that f[n] @ gin] = f[n] ® g[—n] and hence 
deduce that a correlation can be expressed in terms of 
a convolution. 
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A(sin@ — 1 £% 6 
ll (a) —_ (b) — —[sin 2m + (cos 2w — 1)j] el dw 
(7) 27 Jog O 
2j sin w7t TM —ateo| 
Oa Boo 
‘ : T 
sin Wt (cos wt — 1) J 7 ,—alo+b| —a|w—b| 
() 2 (©) srle +e ] 
@ o a 
eee @) 
a? + @? ze 


By ria =9,3,-3,3 
MM 12, -4, -26.5, -0.5, -3.5 


1 [e.e} 


6 . 
| (a) — —5 (2j cos 2 — j sin 2w) el" dw 
270 Joo W 


t = | 
“7 x PAF 4 

a AY i 

iO » ay a 
j =. | 
ate: 
“hol eae SPN ss 
: eh a 
; | 
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INTRODUCTION 


In engineering there are many functions which depend upon more than one variable. For 
example, the voltage on a transmission line depends upon position along the transmission 
line as well as time. The height of liquid in a tank depends upon the flow rates into and 
out of the tank. We shall examine some more examples in this chapter. 

We have already discussed differentiation of functions of one variable. However, we 
also need to be able to differentiate functions of two or more variables. This is achieved 
by allowing one variable to change at a time, holding the others fixed. Differentiation 
under these conditions is called partial differentiation. 


FUNCTIONS OF MORE THAN ONE VARIABLE 


In Chapter 10 we saw how to differentiate a function y(x) with respect to x. Many stan- 
dard derivatives were listed and some techniques explained. Since y is a function of x 
we call y the dependent variable and x the independent variable. The function y depends 
upon the one variable x. Consider the following example. 

The area, A, of a circle depends only upon the radius, r, and is given by 


AG) =909" 
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Fs 
depend upon more than one variable. For example, the volume, V, of a cylinder depends 
upon the radius, r, and the height, h, and is given by 


ened AA ' 
The rate of change of area w.r.t. the radius is Poa 27r. In practice functions often 


V=nrh 


where V is the dependent variable; r and h are independent variables. V is a function of 
the two independent variables, r and h. We write V = V (7, h). 


Engineering application 25.1 


Electrical potential inside a cathode ray tube 


In recent years the cathode ray tube has declined in popularity as a screen display 
device. It is now rarely used in consumer electronics as flat screen televisions have 
become more popular. However, it is still sometimes used in the specialist electronics 
sector and instruments such as cathode ray oscilloscopes are still commonly in use 
in many laboratories. 

Within a cathode ray tube the electrical potential, V, will vary with spatial position 
and time. Given Cartesian coordinates x, y and z, we can write 


V=VQG,)y,Z,t) 


to show this dependence. Note that V is a function of four independent variables. 


Engineering application 25.2 


Power dissipated in a variable resistor 


The power, P, dissipated in a variable resistor depends upon the instantaneous voltage 
across the resistor, v, and the resistance, r. It is given by 


v2 


P= 
r 
Hence we may write P = P(v, r) to show this dependence. The power is a function 


of two independent variables. 


As another example of a function of more than one variable consider a three-dimensional 
surface as shown in Figure 25.1. The height, z, of the surface above the x—y plane de- 
pends upon the x and y coordinates, that is z = z(x, y). If we are given values of x and 
y, then z(x, y) can be evaluated. This value of z is the height of the surface above the 
point (x, y). We write, for example, z(3, —1) for the value of z evaluated when x = 3 
and y = —1. The dependent variable, z, is a function of the independent variables 
x and y. 

Some important features are shown in Figure 25.1. The value of z at a maximum 
point is greater than the values of z at nearby points. Point A is such a point. As you 
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Figure 25.1 
The height of the surface 
above the x—y plane is z. 


move away from A the value of z decreases. A minimum point is similarly defined. 
At a minimum point, the value of z is smaller than the z value at nearby points. This is 
illustrated by point B. Point C illustrates a saddle point. At a saddle point, z increases in 
one direction, axis D on the figure, but decreases in the direction of axis E. These axes 
are at right angles to each other. The term ‘saddle’ is descriptive as a horse saddle has 
a similar shape. Maximum points, minimum points and saddle points are considered in 
greater depth in Section 25.7. 


PARTIAL DERIVATIVES 


Consider 
Z= z(x,y) 


that is, z is a function of the independent variables x and y. We can differentiate z either 
w.r.t. x, or w.r.t. y. We need symbols to distinguish between these two cases. When find- 
ing the derivative w.r.t. x, the other independent variable, y, is held constant and only 
x changes. Similarly when differentiating w.r.t. y, the variable x is held constant. We 


OZ ‘ we 2 : 
write re to denote differentiation of z w.r.t. x for a constant y. It is called the first partial 
x 


aor — , <a : az 
derivative of z w.r.t. x. Similarly, the first partial derivative of z w.r.t. y is denoted re 
y 


) 
Referring to the surface z(x, y), * gives the rate of change of z moving only in the x 
x 


direction, and hence y is held fixed. 


If z = z(x, y), then the first partial derivatives of z are 
a 0 
oe 
Ox dy 


; ee Oz ; 
If we wish to evaluate a partial derivative, say —, at a particular point (x), yo), we 
aia ; Ox 
indicate this by 
az 


Ox tex 


Oz 
~ (Xp, Yo) or 
ox “ 


just as we did for functions of one variable. 
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0 a 
Example 25.1 Given z(x, y) = x*y + sinx + xcosy find mn and 
x y 


: az ; : . . 
Solution To find = we differentiate z w.r.t. x, treating y as a constant. Note that since y is a 
x 
constant then so is cos y. 


dz 
— = 2xy+cosx+ cosy 
Ox 


0 
In finding - x and hence x” and sin x are held fixed, thus 
y 


Oz > : 
— =x —xsiny 
dy 


Example 25.2 Given z(x, y) = 3e* — 2e& +- xy? 
(a) find z(1, 1) 


3 a 
(b) find and = when x =y = 1. 
ox dy 


Solution (a) z(1,1) =3e! —2e! + 1 = 3.718 
) 
(b) % — 36% 4 2x3 
ox 
) 
ne 4309 
dy 
When x = y = 1, then 


j 

#8 296 a= 155 
Ox 

8 ne 43 = 2.437 
dy 


At the point (1, 1, 3.718) on the surface defined by z(x, y), the height of the surface 
above the x—y plane is increasing in the x direction, and decreasing in the y direction. 
Note that we could also write 


8) _io1ss and S| = 2.437 


(1) 9Y |. 
or 


dz dz 
—(1, 1) = 10.155 and —(1, 1) = —2.437 
ox dy 


Both notations are in common use. 


Example 25.3 


Solution 


Example 25.4 


Solution 
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Engineering application 25.3 


Eddy current losses 


Eddy currents are circulating currents that arise in iron cores of electrical equipment 
as a result of an a.c. magnetic field. They lead to energy losses given by 


P= kf’ Borax 
where 
P, = eddy current losses (W per unit mass), 
By 
f = frequency of the magnetic field wave (Hz), 


ax = Maximum value of the magnetic field wave (T), 


k, = aconstant that depends upon factors such as the lamination thickness of 


the iron core. 


oP. oP. 
Calculate —£, and Z_, 
of OB max 
Solution 
dP. oP. 
© = 2k fB d £ = 2k f’B 
of il max an OB max ale max 


oV oV 
If V(x, y) = sin(xy), find — and —. 
Ox dy 


oV d 
To find Ox we treat y as a constant. Recalling that aon kx) = kcoskx we find 
x 


aV oV aV 
— = ycos(xy). To find — we treat x as a constant. Thus — = xcos(xy). 
Ox dy dy 


Find the first partial derivatives of z where 


(a) (x, y) = yre* 
(b) z(x,y) = x’ sin(xy) 


Oz ; fie 
(a) To find — we must treat y as a constant. However, the differentiation of the factor 
x 


xe* will require use of the product rule. We find 


dz Q, y 
7 be ) 
= y((1)e* + xe") 


=ye(x+1) 
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a 
To find 7" the variable x is held constant and so 
y 


Oz 

ay = 

(b) Observe the product term in the variable x which means we shall need to use the 
product rule. We find 


xe 


0 
~ = 2x sin(xy) + x (y cos(xy)) = 2x sin(xy) + xy cos(xy) 
x 


0 
To find 7 we treat x as a constant and find 
y 


0 
>. = x’(xcos (xy)) = * cos(xy) 
y 


EXERCISES 25.3 
1 Find the first partial derivatives of () y=telt+x (dd) y= 3x2e% 4+ Pe 
=: — p2xt+3 
(a) z= 2x4+3y (b) z=x-y+x (e) y=e™ (f) y= et 
(C2 = e+ y? +3 (d) z=xy-1 5 Find the first partial derivatives of z where 
2 2 
(e)z= xy +xy (a) z=Ve+y (b) z= 
(f) z= x + 2x*y — Axy? + 2y7 2x + 3y 
2 Find the first partial derivatives of (c) z=sin 7 (d) z=e” 
x y 
ae ONES A) () <=In@x—3y) () z= In@y) 
y? 1 y 
(c) z= 44+2 (dq) z= — (g) z= xInQ@y) (h) z= xIn{ — 
x xy x 
a? A/D 
(ec) z= — - (f) z= xy —3(x+y) 6 Evaluate the first partial derivatives of f at x = 1, 
y x 
y=2. 
. . ue c+ 
3 Find the first partial derivatives of (a) f= 3x2y—2xy (by) f= y 


(a) z= 2sinx+3cosy (b) z= xsiny 


2 (c) f =2sin(3x +4 2y) (d) f=2e% 


sinx (d) z=xysiny a 
() f=yinxt+In@y) @) f=J/3x+y 


(c) z=xtany—y 


(e) z=sinx+y) (f) z= 4cos(4x — 6y) 
siny 7 Given 
(g) z= i. 


,h) = 27°h— Sth 
4 Find the first partial derivatives of y. Note that y is a FO") a na 


function of x and t. evaluate the first partial derivatives of f when r = 2, 
(a) y=te’ (b) y=xe? h=1. 
Solutions 
a) az d) y, 
Boa ~=2,2=3 Os 
ox y 


(b) 14+2x,—1 (ec) 2xy +y?, x7 + 2xy 


(c) 2x, 2y (f) 3x2 + 4xy — 4y2, 2x2 — 8xy + 6? 


(b) 


(d) 


(b) 


(d) 


(g) 


(b) 
(c) 
(d) 
(e) 
(f) 


0z 

— =2cosx, — = —3siny 
Ox 

sin y, xcos y 


tan y — y cos x, x sec? y — 2ysinx 
ysiny, xsiny + xy cosy 
cos(x + y), cos(x + y) 
— 16 sin(4x — 6y), 24 sin(4x — 6y) 
siny cosy 
x27 x 
0 0 
sd = te’, as — e* 
Ox ot 
Qxe*, —x2 et 
l,e’(1 +1) 
6xe™ — He, 6x7 e7 + 377 
te“, xe™ 
9: e2x+3t 3 e2x+3t 


(b) 


(c) 


(d) 
(e) 


(f) 


(g) 


(h) 


6 (a) 


(b) 
(c) 
(d) 
(e) 
(f) 

af 
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Oz x Oz y 
ax [x2 4 y2’ dy [242 
3y 3x 


(2x + 3y)?’ (2x + 3y)2 


1 x x x 
~ cos{ ~ }, —-5 cos{ — 
y y y Ba 


Bye, 3x02 
2 3 
2x—3y’ 2x—3y 


x 
1+ In(xy), = 
y 


(2) =, es 

z y 

P 

Ff 4.2) =8, 0,2) =1 
ox dy 

0.5, —0.25 

4.5234, 3.0156 

29,5562, 14.7781 

3, 0.5 

0.6708, 0.2236 


af 


7 aos 1) = 7.6464, — (2, 1) = 6.5858 
r 


oh 


HIGHER ORDER DERIVATIVES 


Just as functions of one variable have second and higher derivatives, so do functions of 


several variables. Consider 


Z= z(x,y) 


829 


0 0 
The first partial derivatives of z are . and = The second partial derivatives are found 


Xx 


dy 


by differentiating the first derivatives. We can differentiate first partial derivatives either 
w.t.t. x or w.r.t. y to obtain various second partial derivatives: 


0 a (a a? 
differentiating S wart. x produces — cg Pees 
Ox Ox 


a a (9a a? 
differentiating = w.r.t. y produces — Pe es ee 
Ox dy \ ox dyox 


Ox ax? 
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OZ d (dz a7z 
differentiating — w.r.t. x produces —{ — | = 
dy ax \ dy oxdy 
0 0 (90 a? 
differentiating = w.r.t. y produces — pes 
dy dy\dy} ay? 
2 a2z 
For most common functions, the mixed derivatives —— and —— are equal. 
dyox oxdy 


Example 25.5 Given 
zZ(x, y) = 3xy? — 2xy + sinx 


find all second partial derivatives of z. 


: 0 a 

Solution << = 3y? — 2y + cosx ce Oxy’ — 2x 
ox dy 
az a (az oa az a (az 1 
ee —_ pees >= — 0 J S=_ — = iX 
ax? dx \ Ax dy? ay \ ay ? 
a? a (a a? a (9 

z — 2 ue = Qy* 2 a = id = Oy" = 2. 
dxdy dx\dy dydx ody\ ax 
2 2 
Note that a = a 
axdy  dydax 


Example 25.6 Given 
H(t) =3° +r +e" 


’H 0H 
verify that —— = —. 
Oxot  dtdx 


F 0H ; 0H 
Solution — =6x+te” — =2t+xe" 
ox ot 


YH a/dH\ a 

= = 6 t xt = xt t xt 
Otox 5 (se) ae Oe ewe wan 
7H a (dH a 

= = 2) xt = xt x 
axdt = (=) a ee 


Third, fourth and higher derivatives are found in a similar way to finding second deriva- 
tives. The third derivatives are found by differentiating the second derivatives and 
so on. 


25.4 Higher order derivatives 
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Example 25.7 Find all third derivatives of f(r, s) = sin(2r) — 3r's?. 


: 0 a] 
Solution af 2cos(2r) — 12r°s? af —6r's 

or os 

a? a? 

= —4sin(2r) — 36r’s” ed = —6r* 

a2 

FL _aars 
drds 
er ae arf 
The third derivatives are and and these are found by differen- 


i ars” areas’ ards? ass” 
tiating the second derivatives. 


ay a (af 
ans ar\ ar 


af a (af 
aras  ar\ aras 
af a (af 
dras? ar \ ds2 


arf a (0 
as as\ as?) 


8 cos(2r) — 72rs* 


72r’s 


Note that the mixed derivatives can be calculated in a variety of ways. 


a2 f 
+) 


af a (af 
aras (+4 
af a (af 
ras? (3 


EXERCISES 25.4 


1. Calculate all second derivatives of v where 
vih,r) =rvh 
2 Find the second partial derivatives of f given 


@) f=axvyty (b) f=2xty?— 39° 


Cae @ 2 
(e) $a () f=4/xy 
We 3 
3 Find all second partial derivatives of 
(a) z=xe” (b) z= 2sin(xy) 
(c) z=xcos(2x + 3y) (d) z=ysin(4xy) 
(e) z=e*siny (f) z=e*-Y 


(g) g=e" 


af 
dros 


Find all second partial derivatives of 
(a) z= (3x—2y) (6) z= 2x F5y 
(c) zg=sing?+y2) (d) z=In@x+5y) 


1) SS 33y 


Find all third partial derivatives of z where 
oa 
LUX, SS a 
ame | 
Evaluate all second partial derivatives of f at x = 2, 
y=. 
(a) f=ye’  (b) f =sin(2x—y) 


© f= (2) 
x 
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Solutions 
1 ev Ss ee 
an? 4h3/2” dhdr — /h? Or? 
a? a a? 
2 (a) fe 2y, f 2X; f = 6) 
x2 axdy dy? 
(b) 24x2y3 — 18xy°, 
24x37 — 45x2y4, 
12x4y — 60x3y3 
(ce) —x Fy? 4x“ 1/2y, 84/% 
2 2x 207+1) 
@ -,-5,>7— 
yy y 
(e) 18xy7!/2, —3x?2y3/2, qey 2 
(f) —x73/2yl/2, yp V/2y- 1/2, _ yl /2y-3/2 
a2 q2 Q2 
3 (a) 2 = ; oe = 26”, Ot = Axe” 
Ox? dxdy ay? 


(b) —2y? sin(xy), 


2 cos(xy) — 2xy sin(xy), 
—2x2 sin(xy) 


(c) —4sin(2x + 3y) — 4xcos(2x + 3y), 


—3 sin(2x + 3y) — 6xcos(2x + 3y), 
—9x cos(2x + 3y) 


(d) —16y° sin(4xy), 


8ycos(4xy) — 16xy? sin(4xy), 
8x cos(4xy) — 16xy sin(4xy) 


(e) e*siny, e* cos y, —e* siny 
(f) 9 ety, 3 et), ebx-y 
(g) ye, e(1 +2y), 7 


a2z 18 
B (a) = 34208x—2y)'8, 
Ox 
Q2 
= = —2280(3x — 2y)!8, 
dxdy 


a2 
°% = 1520(3x — 2y)!8 


ay2 
(b) —(2x + 5y)77/?, 


5 
— 5x Sy oP, 


25 
=a (2x + 5y)—3/2 


(c) 2cos(x? + y?) — 4x? sin(x? + y?), 
—A4xy sin(x? + y), 
2cos(x? + y*) — 4y* sin(x? + y*) 


(d) 


(x + Sy)?’ 
10 
(2x + Sy)?’ 
25 
(2x + 5y)? 
18 12 8 
) - =. ; 
(3x — 2y) (3x — 2y)? (x — 2y) 
az a3z 2 
= 0, = : 
ax3 ax2dy (y + 1)? 
a3z om 4x arz 7 6x2 
axdy2 (y+1)3’ ay (vy +14 
af 
(a) paz 1) = 7.3891, 
a2 
ato, 1) = 7.3891, 
axdy 
a2 
a 1) =0 
ay 
(b) —0.5645, 0.2822, —0.1411 
(c) 0.25, 0, —1 


(eee 25 5 PARTIAL DIFFERENTIAL EQUATIONS 


Partial differential equations (p.d.e.s) occur in many areas of engineering. If a variable 
depends upon two or more independent variables, then it is likely this dependence can be 
described by a p.d.e. The independent variables are often time, t, and space coordinates 


X,Y, Z- 


One example is the wave equation. The displacement, u, of the wave depends upon 
time and position. Under certain assumptions, the displacement of a wave travelling in 


one direction satisfies 
2 2 
o-u ,0°U 


ae ax 


Example 25.8 


Solution 


Example 25.9 


Solution 
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where c is the speed of the wave. This p.d.e. is called the one-dimensional wave equation. 
Under prescribed conditions the subsequent displacement of the wave can be calculated 
as a function of position and time. 


Verify that 
u(x,t) = sin(x + 2r) 


is a solution of the one-dimensional wave equation 


a7u _ a-u 
are ax? 
The first partial derivatives are calculated. 
ou Ou 
— =cos(x + 2t) — = 2cos(x+ 2f) 
Ox ot 
, ul O7u 
The second derivatives, —~ and ——, are now found. 
ax? ar? 
a-u : a7u . 
a2 = — sin(x + 2rt) a = —4sin(x + 2) 
Now 
a7u ; ; a7u 
ap = —4sin(x + 2t) = 4[— sin(x + 2t)] = ae) 


Hence u(x, t) = sin(x + 2r) is a solution of the given wave equation. 


Another equally important p.d.e. is Laplace’s equation. This equation is used exten- 
sively in electrostatics. Under certain conditions the electrostatic potential in a region is 
described by a function $(x, y) which satisfies Laplace’s equation in two dimensions. 


ao ao 

ae Ay 
ax? Oy? 

This equation is so important that a whole area of applied mathematics, called potential 

theory, is devoted to the study of its solution. 


Verify that 
1 
satisfies the three-dimensional Laplace’s equation 
a? a a? 
ms ss =i = a - 


(X,Y, 2) = 


0 


0 
We begin by calculating the first partial derivative, a We are given 
x 


ie (2 +y 4 2)-12 
and so 


a0 


1 
re = — 5 Lag +4 2) 7/7 (2x) a —x(x" 4a? + 27)73/? 
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a? 
We use the product rule to find — 
ax? 
ke ie) 2 
ao = ltt AI a(—$ 2 +9 + Aye 


= OF ev Ayr FH ay $27) 4390] 


By a similar analysis we have 


ao 2 = 
ay @+y~t7yPP-@+y +2) +3y'] 
vo 21 \2 4 (2\-5/2 Bp Oe ed 2 
rr as ey Pe) ae ey ee ae | 
So 
a a? a? 
aan OO ey 4 A322 $y $2) $32 43? +32) 
ax dy? sz? 
=0 
1 
Hence ¢ = ———————— is a solution of the three-dimensional Laplace’s equation. 


Nee oP al ee 


The transmission equation is another important p.d.e. The potential, u, in a transmission 
cable with leakage satisfies a p.d.e. of the form 
07u _ 07u B ou C 
se ae oe 
where A, B and C are constants relating to the physical properties of the cable. 
The analytical and numerical solution of p.d.e.s is an important topic in engineering. 
Coverage is beyond the scope of this book. 


EXERCISES 25.5 


BM Verity that is a solution of the equation 


a7u mn ou 2 ou 7 
x— —=Tu 
*Y axay ox yay 


u(x, y) = rag + xy 


is a solution of the p.d.e. 
4 Verify that 


ou du 
— 2. Sy * 
a u(x, y) = ay + — 
y 
2 Verify that is a solution of 
2 2: 
@ = sin(xy) Oru o-u ie 


—~ + 2x——_ = 
ay? axdy 
satisfies the equation 


5 Verify that 
ro Io 2,2 / 
ae ge Te (x, y) = xsiny +e" cosy 
satisfies 
3 Verify that 2 2 
op _ 


3 3 sat = —xsiny 
u(x, y) =x yt+xy ax2 ay2 ’ 


6 Given 


g=yrty 


(a) Show 
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(b) Verify that ¢ is a solution of 


6 | 6 


pd AID 
axe Pa ial +y) 


rp , 
qe ea) a 


25.6.1 


TAYLOR POLYNOMIALS AND TAYLOR SERIES 
IN TWO VARIABLES 


In Chapter 18 we introduced Taylor polynomials and Taylor series for functions of a 
single variable. We now extend this to include functions of two variables. Recall the 
main idea behind Taylor polynomials and series for a function of one variable. Knowing 
the values of a function, f(x), and its derivatives at x = a we can write down the Taylor 
polynomial generated by f about x = a. This polynomial approximates to the function 
jf. The values of the Taylor polynomial and the function are usually in close agreement 
for values of x near to x = a. To put it another way, knowing the value of f and its 
derivatives at x = a allows us to estimate the value of f near to x = a. 

The same idea holds when f is a function of two variables, x and y. If we know the 
value of f and its partial derivatives at a point x = a, y = b, then the Taylor polynomials 
allow us to estimate f at points near to (a, b). 


First-order Taylor polynomial in two variables 


Suppose f is a function of two independent variables, x and y, and that the values of 


af 


0 
ve on and “ are known at the point x = a, y = b, that is we know 
y 


0 0 
f(a, b) ZL Gy, b) at (a, b) 
Ox dy 
The first-order Taylor polynomial, p, (x, y), generated by f about (a, b) is given by 
af di 


a 
p,@, y) = f(a, b) + @— a): Dy —b) 0,0) 
Be oy 


We note the following properties of a first-order Taylor polynomial: 


(1) The values of the Taylor polynomial and the function are identical at the point (a, b). 
(2 


wm 


The values of the first partial derivatives of the Taylor polynomial and the function 
are identical at the point (a, b). 


(3 


wm 


The highest derivative needed to calculate the Taylor polynomial is the first deriva- 
tive. 


(4 


wm 


The first-order Taylor polynomial contains only linear terms; that is, there are no 
powers of x or y higher than 1. 


The first-order Taylor polynomial represents a plane which is tangent to the surface 
FS (, y) at (a, b). 
We can use p, (x, y) to estimate the value of f near to (a, b). 
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Example 25.10 A function, f, is such that 


fG6,1)=2 53,1) =-1 8F 31) =4 
x dy 


(a) State the first-order Taylor polynomial generated by f about (3, 1). 
(b) Estimate the values of f(3.5, 1.2) and f(3.2, 0.7). 
(c) Verify that the Taylor polynomial and the function have identical values at (3, 1). 


(d) Verify that the first partial derivatives of the Taylor polynomial and the function are 
identical at (3, 1). 


Solution (a) In this example a = 3 and b = 1. Hence 


i) a 
Pi%y) = f3,D+a- oe, Dey = ee 1) 
x dy 


=2+@-3)-)D+0-14 
=4y-x+1 
The first-order Taylor polynomial generated by f about (3, 1) is 
P\@y) =4y-x+1 
(b) We use p, (x, y) to estimate f(3.5, 1.2) and f(3.2, 0.7). 
p, (3.5, 1.2) = 4.2) —3.5+1= 2.3 
p,@.2, 0.7) = 4(0.7) — 3.2+1=0.6 


Hence 2.3 is an estimate of (3.5, 1.2) and 0.6 is an estimate of f(3.2, 0.7). 
(c) We are given f(3, 1) = 2. Also 
p,3,1) =40)-34+1=2 
and so f(3, 1) = p,@, 1). 


a 7] 
(d) We are given a0, 1) = —1 and 0, 1) = 4. Now 
x y 


Pix, y) =4y—-x4+1 


and so 
Op, =| OP —4 
ox dy 

Hence 
) r) ) a 
eps ent “Gia e nes 
ox Ox dy dy 


Example 25.11 A function, f(x, y), is defined by 
fay=rt+xy-y 
(a) State the first-order Taylor polynomial generated by f about (1, 2). 


(b) Verify that the Taylor polynomial in (a) and the function f have identical values at 
C, 2). 
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(c) Verify that the first partial derivatives of the Taylor polynomial and f have identical 
values at (1, 2). 


(d 


wm 


Estimate f(1.1, 1.9) using the Taylor polynomial. Compare this with the true value. 


Solution (a) Weare given f = x* + xy — y’ and so 
af af 


—~=2 Sx Sy" 
ox ey dy oe 
Evaluating these at the point (1, 2) gives 
0 0 
fC, 2) =—-5 Leer wel 2) = 
Ox 
Putting a = | and b = 2 in the formula for p, (x, y) we are able to write down the 
Taylor polynomial: 
0 0 
Pi (x,y) = fl, 2) + @&— nea, 2)+O- ned, 2) 


= 5+ (x—1)4+4 (y—2)(-1) 
= 4x— lly +13 


The first-order Taylor polynomial is p, (x, y) = 4x — Ily + 13. 


(b) We can evaluate the Taylor polynomial and the function at (1, 2). 
p, (1,2) =4—22+13=—-5 fd,2)=- 
Hence p,(1,2) = f(1, 2); that is, the Taylor polynomial and the function have 
identical values at (1, 2). 
(c) The first partial derivatives of p, (x, y) are found. 
7] C) ] 0 
ZED ed Fl iy Faayea4 Fara 
Ox dy Ox dy 
Hence the first partial derivatives of p, and f are identical at (1, 2). 
(d) p, (1.1, 1.9) = 41.1) — 1101.9) + 13 = —3.5 


f (1.1, 1.9) = (1.1)? + (1.1)(1.9) — (1.9)3 = —3.559 


The values of the Taylor polynomial and the function are in close agreement near to 
the point (1, 2). 


25.6.2 Second-order Taylor polynomial in two variables 


Given a function, f, and its first and second partial derivatives at (a, b) we can write 
down the second-order Taylor polynomial, p, (x, y). 


0 0 
Me p= eb Le aha haem Dona b) 


at 


+3 (@- ays ae b2G ao Dien ae b)+(y— Die a ») 
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Example 25.12 


Solution 


We note the following properties of the second-order Taylor polynomial: 


(1) The values of the Taylor polynomial and the function are identical at (a, b). 


(2) The values of the first partial derivatives of the Taylor polynomial and the function 
are identical at (a, b). 

(3) The values of the second partial derivatives of the Taylor polynomial and the func- 
tion are identical at (a, b). 


(4) The second-order Taylor polynomial contains quadratic terms, that is terms involv- 
ing x’, y” and xy. 


A function, f, and its first and second partial derivatives are evaluated at (2, —1). The 
values are 
0 a? a? a 
f=3 a = 3 fy Pag 2s 
ax dy Ox? axdy dy? 
(a) State the second-order Taylor polynomial generated by f about (2, —1). 
(b) Estimate f(1.8, —0.9). 


(c) Verify that the Taylor polynomial and f have identical values at (2, —1). 


(d) Verify that the first partial derivatives of the Taylor polynomial and f are identical 
at (2, —1). 

(e) Verify that the second partial derivatives of the Taylor polynomial and f are identical 
at (2, —1). 


(a) We put a = 2 and b = —1 in the formula for p, (x, y): 


a a 
re n=f+e-DA +042 
Xx dy 


1 Pf as ef 
+3(c 2) 59 t2e- DOF DES +O4D ) 


where f and its derivatives are evaluated at (2, —1). So 


P2%y) = 3+ @-2)44+ WF 1)(-3) + s{e— 2) + 2(@- 2) 1)2 


pet type Ee ade ty 
Sy x. Foes + ena 

ae ee ae 
(b) The value of p, (1.8, —0.9) is an estimate of f(1.8, —0.9): 

(1.8)? (—0.9)? 21 
p (1.8, —0.9) = ; 5 +2(1.8)(—0.9)+4(1.8) — 8(-0.9)— > 
= 1.875 
2? (-1) 21 

(c) p,(2, -l)= 5 5 + 212)(=1) + 42) = 8(—1) = ca. 


f(2,-1) =3 
Hence p,(2, —1) = f(2, —1); that is, the Taylor polynomial and the function have 
identical values at (2, —1). 


(d) The first partial derivatives of p, are 


C) C) 
OPE 2 pl Gy tid Po _y+2x-8 
Ox dy 
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sO 
; 3 
ae ey ns) ee es ee) ee eee ee ee 
Ox dy 
Hence 
F J 3 aj 
Pre y= Fay 220, -1) = Fe, -1y 
Ox Ox dy dy 


that is, the first partial derivatives of the Taylor polynomial and the function have 
identical values at (2, —1). 


(e) The second partial derivatives of p, are found: 


a" p, Esc a’ p, 29 a” py = 


Ox? axdy dy? 


These values are identical to the second partial derivatives of f at (2, —1). 


Example 25.13 A function f is given by 
fay=Hrtxry+y 


(a) State the second-order Taylor polynomial generated by f about (1, 1). 
(b) Use the polynomial to estimate f(1.2, 0.9). Compare this value with the true value. 


(c) Verify that the second partial derivatives of the function and the Taylor polynomial 
are identical at (1, 1). 


Solution (a) Here a = 1 and b= 1. Weare given that f = x? + x’y + y* and so 


a] 0 a? 

BF _ a2 + oxy OE ae OT heady 
ax dy Ox? 

a? a? 

OF =2x ana = 12y* 

dxdy dy? 


Evaluation of f and its derivatives at (1, 1) yields 


Wee Was Via a. Se 


=3 = = —= = = 12 
f ax dy ax? axdy dy? 
The second-order Taylor polynomial is p,(x, y): 
0 0 
pity) = f+ @- A+ o- NZ 
ax oy 
1 af af af 
if 2-1 1 1)? 
+ AG ioe Pe D0-1) FO= 0 =) 


=345@- +501) + 58-1? +4—DO- N+ 29-0) 
= 4x? + 6y? + 2xy — 5x —9y +5 
(b) The value of p,(1.2, 0.9) is an estimate of f(1.2, 0.9). 
p,(1.2, 0.9) = 4(1.2)? + 6(0.9)? + 2(1.2)(0.9) — 5(1.2) — 9(0.9) +. 5 = 3.68 
The actual value is 


f(1.2, 0.9) = (1.2)? + (1.2)°(0.9) + (0.9)* = 3.6801 
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(c) The partial derivatives of p,(x, y) are found. 


0 
— = 8x+2y—5 oe p49 
x dy 


a 
ap, = ap, _ 0° Ps B48 
Ox? Oxdy dy? 


The second partial derivatives of p, are identical to the second partial derivatives of 
f evaluated at (1, 1). 


25.6.3 Taylor series in two variables 


The third-order Taylor polynomial involves all the third-order partial derivatives of f, 
the fourth-order Taylor polynomial involves all the fourth-order partial derivatives of f 
and so on. Taylor polynomials approximate more and more closely to the generating 
function as more and more terms are included. As more and more terms are included, 
we obtain an infinite series known as a Taylor series in two variables. The general form 
of this series is beyond the scope of this book. 


EXERCISES 25.6 
1 Usea first-order Taylor polynomial to estimate of Q4)=9 a’ f ce ees 
f(2.1, 3.2) given dy ax2 
of 2 2 
as Seas 9S 34) =3 oF 3,4) =0.5 
x axdy ay? 
af 
— (2,3) =3 
dy 


5 Use a second-order Taylor polynomial to estimate 
2 Use a first-order Taylor polynomial to estimate Be leven 
g(—1.1, 0.2) given 


g(—3,3) =1 98 _3, 3) =-1 


a 
A-in=e ei pes ox 
ox ag ag 
a —(-3,3)=4 —(-3,3)=3 
28 (21, 0y==t ay‘ ) axa & ) 
dy 
; . a°g ag 
3 Use a first-order Taylor polynomial to estimate axa (—3, 3) = —2 aye (—3,3) = 2 
h(—1.2, —0.7) given ¥ y 
h(—1.3, —0.6) = 4 6 Use a second-order Taylor polynomial to estimate 
ah A(0.1, 0.1) gi 
dos —1 Oe Daly GE 
ox j 
a 
ah h(0, 0) = 4 — (0,0) = —-1 
—(-13,-0.6) =1 0:0) ay 0 
dy 
dh ah 
Use a second-order Taylor polynomial to estimate ay (0,0) = —3 ae (0,0) = 2 
f(3.1, 4.2) given 
of a (0,0) = 2 ah 0)=-1 
[3,4 =1 ra am axdy 7 dy2 
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7 A function, f(x, y), is defined by 


LS) 


ry WwW 


(a) 


(b) 


(c) 


(d) 


(e) 


f@y=rytxy? 
Calculate the first-order Taylor polynomial 
generated by f about (1, 1). 
Calculate the second-order Taylor polynomial 
generated by f about (1, 1). 
Estimate f(1.2, 1.2) using the first-order Taylor 
polynomial. 
Estimate f(1.2, 1.2) using the second-order 
Taylor polynomial. 


Compare your answers in (c) and (d) with the 
true value of f(1.2, 1.2). 


A function g(x, y) is defined by 


(a) 
(b) 
(c) 


(d) 
(e) 


; x 
g(a, y) = xsiny + = 
y 

Calculate the first-order Taylor polynomial 


generated by g about (0, 1). 


Calculate the second-order Taylor polynomial 
generated by g about (0, 1). 


Estimate g(0.2, 0.9) using the polynomial in (a). 


Estimate g(0.2, 0.9) using the polynomial in (b). 


Compare your estimates with the exact value of 
9(0.2, 0.9). 


Solutions 


44 
5.6 
3.8 


1.465 


1.305 


3.625 


(a) 


4x +4y —6 


25.7 
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9 A function h(x, y) is defined by 


10 


10 


(a) 


(b) 


(c) 


(d) 


(e) 


h(x, y) =ey+xe 
Calculate the first-order Taylor polynomial 
generated by h about (0, 0). 


Calculate the second-order Taylor polynomial 
generated by h about (0, 0). 


Estimate h(0.2, 0.15) using the polynomial 
from (a). 
Estimate h(0.2, 0.15) using the polynomial 
from (b). 


Compare your answers in (c) and (d) with the 
exact value of h(0.2, 0.15). 


A function, f(x, y, z), is defined by 


(a) 


(b) 


(c) 


(b) 
(c) 
(a) 
(c) 
(e) 
(a) 
(d) 
(a) 
(c) 


f(x,y, 2) =? tayet ye? 
Write down the first-order Taylor polynomial 
generated by f about (0, 1, 2). 


Use the polynomial from (a) to estimate 
FO.1, 1.2, 1.9). 


Compare your answer in (b) with the exact value 
of f(0.1, 1.2, 1.9). 


3x2 + 3y? + Oxy — 8x — 8y +6 


3.6  (d) 4.08  (e) 4.1472 
1.8415x  (b)_ 2.3012x — 0.4597xy 
0.3683 (d) 0.3775 

0.3789 

y (b) x2 4+xy+y (ec) 0.15 
0.22 (e) 0.2297 
x+t4y+4z-8 (b) 46 

4.57 


MAXIMUM AND MINIMUM POINTS OF A FUNCTION 
OF TWO VARIABLES 


We saw in Chapter 12 that to find the turning points of y(x) we solve 


6) 
dx 
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2 
The sign of the second derivative, or is then used to distinguish between a maximum 


point and a minimum point. The analysis is very similar for a function of two variables. 
When considering a function of two variables, f(x, y), we often seek maximum points, 
minimum points and saddle points. Collectively these are known as stationary points. 
Figure 25.1 illustrates a maximum point at A, a minimum point at B and a saddle point 
at C. When leaving a maximum point the value of the function decreases; when leav- 
ing a minimum point the value of the function increases. When leaving a saddle point, 
the function increases in one direction, axis D on the figure, and decreases in the other 
direction, axis E on the figure. 
To locate stationary points we equate both first partial derivatives to zero, that is we 
solve 
af_y fo 
Ox oy 


Stationary points are located by solving 
0 0 
af =0 af =0 
Ox dy 


Example 25.14 Locate the stationary points of 


(a) f(x,y) = 2x7 —xy-Ty+y’ 
(b) f(x,y) =x — 6x+ 4xyt+-y’ 


Solution (a) The first partial derivatives are found: 


0 0 
f =4x-y f =—-x-—7+2y 
ox dy 
: ne OT af 
The stationary points are located by solving a 0 and — = 0 simultaneously, 
x , 
that is 
4x-—y=0 
—x+2y—-7=0 


Solving these equations yields x = 1, y = 4. Hence the function f(x, y) has one 
stationary point and it is located at (1, 4). 


(b 


we 


The first partial derivatives are found: 

0 0 

oF 55 54 Ay OF = art 2y 
ox oy 


The first partial derivatives are equated to zero: 


2x + 4y-6=0 
4x+ 2y=0 
Solving the equations simultaneously yields x = —1, y = 2. Thus the function has 


one stationary point located at (—1, 2). 
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Example 25.15 Locate the stationary points of 


Solution 


oo y? 
{GN se ta Fo 


The first partial derivatives are found: 


a a 
eee ee ees, 
Ox dy 


These derivatives are equated to zero: 


+6x+y=0 
x+y+6=0 


(25.1) 
(25.2) 


Equations (25.1) and (25.2) are solved simultaneously. From (25.2) y = —x — 6, and 
substituting this into Equation (25.1) yields x* + 5x — 6 = 0. Solving this quadratic 
equation gives x = 1, —6. When x = 1, y = —7, and when x = —6, y = 0. The function 
has stationary values at (1, —7) and (—6, 0). 


Equating the first partial derivatives to zero locates the stationary points, but does not 
identify them as maximum points, minimum points or saddle points. To distinguish be- 
tween these various points a test involving second partial derivatives must be made. 
Note that this is similar to locating and identifying turning points of a function of one 
variable. 

To identify the stationary points we consider the expression 


af af af ): 
Ox? dy? dxdy 


If the expression is negative at a stationary point, then that point is a saddle point. 
2 


If the expression is positive and in addition is positive at a stationary point, then 


x2 


that point is a minimum point. 
2 


If the expression is positive and in addition a is negative at a stationary point, then 
x 


that point is a maximum point. 
If the expression is zero then further tests are required. These are beyond the scope 
of the book. 


In summary: 


vf a Op \e 
mat - (3 f) < (i) Saddle point 
BE ON xoy 
a2 a? a2 2 a2 
s = = (74) >0 and at >0 Minimum point 
rf af \e oe 
— = (74) >0 and o <0 Maximum point 
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Example 25.16 Locate and identify the stationary values of 


f(x,y) =x +xyt+y 


Solution The first partial derivatives are found: 


) r) 
OY, = 5g OF ded 
x dy 


The first partial derivatives are equated to zero: 


2x+y=0 
xt+1l=0 


This yields x = —1, y = 2. Thus there is one stationary point, positioned at (—1, 2). 
The second partial derivatives are found: 


92 2 2 
Pe ey eG 


Ox axdy dy? 
Now 

a2 a2 a 2 

Ox? dy? axdy 


Since the expression is negative, we conclude that (—1, 2) is a saddle point. 


Example 25.17 Locate and identify the stationary points of 
f@y)=ay-xv -y 
Solution The first partial derivatives are found: 


a a 
a aaj sig OD ag Oy 
Ox oy 


0 0 
Solving of = 0, ws = Oyields x = 0, y = 0. 
ox dy 


The second partial derivatives are found: 


oF 2 ca =I arf _ 


Ox? axdy dy? 


The second derivative test to identify the stationary point is used. Now 


vf af vf 
Ox? dy? Oxdy 


2 
) = (-2)(-2)- Gy? =3 


af or (a 


2 
Since a2 < Oand dx? dy? +) > 0 then (0, 0) is a maximum point. 
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Example 25.18 Locate and identify the stationary points of 


3 y 
=—- —+2 
LOS Bae tae 
Solution The first partial derivatives are found: 
0 0 
f =-] f =y+2 
Ox dy 
The first partial derivatives are equated to zero: 
x-1=0 
yt+2=0 
These equations have two solutions: x = 1, y= —2 andx = —l, y= —2. 
In order to identify the nature of each stationary point, the second derivatives are 
found: 
a? a? a? 
Ox? Oxdy ay? 


Each stationary point is examined in turn. 


At (1, —2) 
rfarf (fy : 
Bx? dy? Dxdy = 2x(1) — 0° = 2x = 2 sincex = 1 
Q2 
af =2x=2 
ax? 
Since both expressions are positive the point (1, —2) is a minimum point. 
At (-—1, —2) 
Here 


af af af 

ax? day? axdy 
Since the expression is negative then (—1, —2) is a saddle point. 
Figure 25.2 illustrates a plot of the surface defined by f(x, y). 


2 
) = 2x = —2sincex = —1 


Figure 25.2 
The Ancsioy ‘ 5 
f&W= yet + 2y 
has a minimum at (1, —2) 
and a saddle point at 

(—1, —2). 
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EXERCISES 25.7 


Determine the position and nature of the stationary 
points of the following functions: 


(a) f(x,y) =x+y4xy 

(b+) fay) = +y? — 2y 

(c) fy) =x +xy-y 

@ fy =P +y —xy 

(e) f(x,y) =x? + 2y* — 3xy +x 

Determine the position and nature of the stationary 
points of the following functions: 

(a) z(x,y) = x24 y — 3xy + 2x 

(b) cx,y) =P +xy4+y 


3 
(©) cay) = 5 ae = 


Solutions 


(a) (—1, —1), saddle point 
(b) (0, 1), minimum 

(c) (1, —2), saddle point 
(d) (0,0), minimum 

(e) (4,3), saddle point 


(a) (3. $), saddle point 
(b) (0,0), saddle point; (i. -th); minimum 
(c) (0, 1), saddle point; (0, —1), 


maximum 


1 1 1 
(d) za&y)=-+--— 
xX yxy 
(e) c(t, y) = 4x°y — Oxy 
Locate and identify the stationary points of the 
following: 


(a) f(x,y) =xy-x? +y? 


(b) f(y) = state 


(c) f(x,y) = x4 + Loxy + 4 
(d) f(x,y) =y—y? — ex 


() fay) = . — xy 


(d) (1, 1), saddle point 

(e) (0,0), (1.5, 0), saddle points 
(a) (0,0), saddle point 

(b) (1, —1), saddle point 


(c) (0,0), saddle point; (2, —2), minimum; (—2, 2), 
minimum 


(d) (—1, 0.5), maximum 
(e) (—4, 0.4493), saddle point 
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Find all first partial derivatives of the following 
functions: 


(a) z(x,y) = Ox? + 2y2 

(b) z(x,y) = 3x78 

(c) z(x,y) = 4x3 — 5y° 

) za, y) =xy+2°y? 

(e) z(x,y) = sin(2xy) 

® 2@,y)=26" 

Evaluate the first partial derivatives of f atx = —1, 

y=2. 

(@) fG,y) = 3x3 —y +x°y? 
4x 

(b) f@,y) = ra 


(c) f(x,y) = sinx+2cosy 
@) fay) = Gy? +x 


(©) f,y) = 2+? 
x—y 


(f) fy) =2e%e” 

Find all second partial derivatives of z where 
(a) 2(x, y) = 3x4 — oy + xy? 

(b) cay) =? +y?)? 

(©) z(x,y) = 2-9 


2 
(d) z(x,y) = Lay 


(©) zy) =2./x+y 
(f) z(x,y) = (sin. x) (cos y) 


Find all first and second partial derivatives of 
f(x, y) = (ax + by)" 

where a, b and n are constants. 

Find the first partial derivatives of 
f(xy) = (ax? + by’ + exy)” 

where a, b, c and n are constants. 

Verify that 
z(x,y) = 3x+2y4+1 


is a solution of 


az OZ 
ax dy 
Verify that 


z(x,y) = 2 —x+y 


is a solution of 
dz Oz 


op te FD 
dx” By @+y) 


Verify that 
fay =r +y — dy 


is a solution of 
Cd) a 
af, af _ 
ax dy 


0 


Verify that 
z(x, y) = sinx + cosy 


is a solution of 


a7z 4 az a, 
a2 ape 
Solutions 
a 
ey fai dy 
Ox dy 


(b) 9x2, 18x39 


(d) y+ 2xy*,x + 2x*y 
(e) 2ycos(2xy), 2x cos(2xy) 


(f) 6ye3”, 6x e3? 


(a) 9F (1,2) =, 8F (1,9) =-8 
ox dy 
(b) 2,1 (c) 0.5403, —1.8186 


11 


12 


13 


14 
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Verify that 


z(x,y) = xye" 

is a solution of 
a2z i a2z az _ 
Ox2 dy? y axdy ~ 


x 


ye 
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(a) Write down the second-order Taylor polynomial 


generated by f(x, y) about x = 2, y = 3 given 


f(x,y) = xy — xy? 


(b) Estimate f(2.1, 2.9) using your polynomial from 


(a) and compare this with the exact answer. 
Write down the second-order Taylor polynomial 
generated by z(x, y) about x = 1, y= 1 given 


x+y 
x 


z(x,y) = 


Calculate the second-order Taylor polynomial 
generated by 


fay) =r t+y? 


about x = l,y = 0. 


Locate and identify all the stationary points of the 
following functions: 


(a) fay) =x? +y? —3y 

(b) f(x,y) = 4xy — xy 

(c) f(x,y) =x? + 2y? — 12x 

@ f@y)=xy-y-x 

Locate and identify the stationary points of 


fay==-2+y 


(d) 25, —12 (e) Mee 
> ieee dems 
oO” 9 
(f) 5.4366, 5.4366 
a2z 2 3 
(a) a = 36x + 6xy A 
a2 
<= 9x2y2, 
axdy 
a2z 2 3 


(b) 12x? + 4y?, 8xy, 4x? + 12y? 
(c) 32e4*-3Y, —24 et*—3Y, 18 ef—-3y 
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4 4 4 of 2 2 = 
d . : 5 — = (2ax+cy)n(ax* + by + cxy)"™," 
© G+ @+y) Oty) ax » 
3 a 
(e —3@ +y) be oe = (2by+ cx)n(ax + by + cxy)"! 
y 


—5(xty)3/, 
—5(x+y)3/? 


(f) —(sinx) (cosy), 


11. (a) 27x? — 36y? — 72xy + 108x + 276y — 432 
(b) p, (2.1, 2.9) = —26.97, 
f (2.1, 2.9) = —26.985 


—(cosx)(siny), 
—(sin.x)(cos y) eh Dy +? 
2 
a . y 
4 oF = an(ax+byy" t 13 5) +x 

af i 14 (a) (0, 1) minimum; (0, —1) saddle point 

_ n— 
nO (b) (0, 0), (4, 0) saddle points 
af eo incon yo-2 (c) (2,0) minimum; (—2, 0) saddle point 
ax2 Qt ox y > (d) (0, 0) saddle point; (& b) maximum 
af abn(n — 1) (ax + by)" 1 1 1 1 

= abn(n — x + by ; . 

oxdy 15 : : . ; saddle points 
a V2 2/2 a/2” 2x/2 


aa b’n(n — 1)(ax + by)" 
y 


t = | 
“7 x PAF 4 

a AY i 

on } 
iO » ay a 
j =. | 
ate: 
“hol eae SPN ss 
: eh a 
; | 
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INTRODUCTION 


This chapter draws together several threads from previous chapters. It builds upon dif- 
ferential and integral calculus, functions of several variables and the study of vectors. 
These topics together form a branch of engineering mathematics known as vector calcu- 
lus. Vector calculus is used to model a vast range of engineering phenomena including 
electrostatic charges, electromagnetic fields, air flow around aircraft, cars and other solid 
objects, fluid flow around ships and heat flow in nuclear reactors. The chapter starts by 
explaining what is meant by the operators grad, div and curl. These are used to carry out 
various differentiation operations on the fields. 


PARTIAL DIFFERENTIATION OF VECTORS 


Consider the vector field v = v,i+ vj + vk, where each component v,, v, and v, isa 
function of x, y and z. We can partially differentiate the vector w.r.t. x as follows: 


av OUee my Ove 
ax ax ax Ox 


This is a new vector with a magnitude and direction different from those of v. 
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Partial differentiation w.r.t. y and z is defined in a similar way, as are higher deriva- 
tives. For example, 


av dv,, Wu, dv, 


= *j <k 
ie oe oe ae 
Si . ie dv av dav a°v a°v 
Example 26.1 If v = 3x’yi + 2xyzj — 3x*y’k, find —, —, —. Further, find ag oe 
Ox dy Oz ax Ox0zZ 
Solution We find 
av 
— = 6xyi + 2yzj — 12x°y’k 
ox 
av 
— = 3x7 i+ 2xzj — 6x*yk 
ay 
av 
— = 2xyj 
Oz a 
a°v 
= 6yi — 36x’y’k 
a°v 
= yj 
axdz 
EXERCISES 26.2 
1 Given v = 2xi + 3yzj + 5xz*k find 4 lv =3xydt+ 0? —y4+2)j+ @w+y*)k find 
av 5 ov av dv dv dv dv a2v 7 av 
(a) Ox (b) ay (c) Bx’ By’ Oz" Baz” By” ri 
2 Iff = 2i— xyzj + 3x7ck find 5 Ifv=sin(xyz)i+ ze” j — 2xyk find 
af af af oy oy ay 
ees b) — = —-,-,-—. 
(a) ax (b) By (c) az ax’ dy’ Oz 
af g2f a2f 6 Ifv=xi+x’yj — 3x°k, and ¢ = xyz, find 
de a — F 
(@) Ox? (e) dy 6 az2 ov, Oe. oe 2 Deduce that 
ax ox Ox 
3 Given E = (x2 + y)it (1 —2jt (x + 22k find 9 av ag 
a =e 
dE dE dE Ox ox ox 
(a) Ox (b) Dy (c) Or 
~ y & 7 Ifv =In(xy)i+ 2xycos gj — x+yzk, find 
a2E 02E 02E dv dv dv d°v d2v av 
Oizy Oza OD SD al We? agg? and =>. 
ax dy dz dx dy dz Ox* dy Oz 
Solutions 
1 (a) 2i+57k (b) 3zj (c) —xyj + 327k (d) 6zk 
(c) 3yj + 10xzk (e) 0 (f) 0 


2 (a) —yz2j + 6xzk (b) —xzj 
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3 (a) 2xi+k (b) i 6 ova rydat yg — 3x4 yck 
c) —j+2k d) 2i 
o 7 SOY) _ oayel + 3x2y2xf — 122 yck 
(e) 0 (f) 0 ie vg ye 
dv op 
4 — =3yai+2xj+k = 
ax ye J ax YZ 
av : . a 
— = 3xzi — 2yj + 2yk Pe i+ 2xyj — 9x7k 
dy ax 
dv 
ove 4. F 9 . 
az ote mvt + 2y cos gj — 4x3 yzk 
> Ox Xx 
avv 2j 
— = 9 F 
dx Fe 4 Oxcos gj —x4ck 
a°v : dy y 
sa = —2j + 2k 
ay dv ee 
— = —2xysinzj — xyk 
av 5 Oz 
a = 4d 
az2 92 
es 12x? yzk 
dv ax2 x2 
ov : xyes 
5 ax Oe cos(xyz)i + yze?j — 2yk 92y i 
a jy yk 
SY = xzcos (xyz)i + xze™j — 2xk ay » 
2 a°v 
0 ee ee . 
> = xy cos(xyz)i + ej azz 2xy cos ZJ 
z 


| 26.3 | THE GRADIENT OF A SCALAR FIELD 


Given a scalar function of x, y, z 


b =o, y, 2) 
Ip dah 


we can differentiate it partially w.r.t. each of its independent variables to find — 


ag Ox’ oy 
and ae If we do this, the vector 
Zz 
dp, Ib, , Oh 
—i+ —j+ —k 
a aye 
turns out to be particularly important. We call this vector the gradient of ¢ and denote 
it by 


Vo or grad @ 


An alternative form of writing V@ is as three components 


(28 ap *#) 
ax’ ay’ az 
dp, Of, dd 


d¢g=V¢ = —i+ —j+ —k 
ore? e Ro Gy 


The process of forming a gradient applies only to a scalar field and the result is always 
a vector field. 
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It is often useful to write V@ in the form 


(3 0 a 
dx’ dy’ dz 


where the quantity in brackets is called a vector operator and is regarded as operating 
on the scalar @. Thus the vector operator, V, is given by 


00 0 
ax’ dy’ dz 


Example 26.2 If¢@=¢@(x,)y,z) = 4x*ysinz, find Vo. 


Solution (x,y,z) = 47 ysinz 
so that by partial differentiation we obtain 


a 


= 12x’ysinz 
Ox 
0 
a = 4x sinz 
dy 
0 
ae = 4x7 ycosz 
Oz 
Therefore 


Vo = 12x’ysin zi + 4x sin zj + 4x°y cos zk 


We often need to evaluate V@ at a particular point, say, for example, at (4, 2, 3). We 
write V¢| (4,3) to denote the value of V¢@ at the point (4, 2, 3). 


Example 26.3 If ¢ = xy + xy’ + 3y find 
(a) Vo 
(b) Volo.) 
(c) |V¢| at C1, 1, 1) 


Solution (a) If ¢é =x*y + xy’ + 3y then 
ag 
ax 
ap 
ay 
ag 
az 

so that 


=3ry+y 
=x +2xy4+3 


=0 


Vo = Bx’y+y)it (x + 2xy + 3)j + Ok 


Example 26.4 


Solution 
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(b) At (0, 0,0), V@ = Oi + 3j + Ok = 3j. 
(c) At(1,1,1), V@= Bx ?x14+1i+(34+2x 1x 143)j+0k = 41+ 6j + 0k 
so that |V@| at (1, 1, 1) is equal to V4? + 62 = V52. 


So far we have been given @ and have calculated Vé. Sometimes we will be given Vd 
and will need to find @. Consider Example 26.4. 


If F = V¢ find @ when F = (3x2 + y?)i+ (2xy + 5)j. 


Note that in this example F has only two components. Consequently V@ will have two 


components, that is F = Vd = a + ae j. Therefore 
x y 
a a 
Ge + y+ Qny + i= i+ Fj 


Equating the i components we have 


ae 


= 37 +y (26.1) 


Equating the j components we have 


a 
ae Qxy +5 (26.2) 
dy 


Integrating Equation (26.1) w.r.t. x and treating y as a constant we find 
=x try + fO) (26.3) 


where f(y) is an arbitrary function of y which plays the same role as the constant of 
integration does when there is only one independent variable. Note in particular that 


a a 
re f(y) = 0. Check by partial differentiation that - = 374+’. 
Xx xX 


Integrating Equation (26.2) w.r.t. y and treating x as a constant we find 
b = xy + 5y + g(x) (26.4) 


where g(x) is an arbitrary function of x. Note that * g(x) = 0. Check by partial differ- 
entiation that x = 2xy+5. Comparing both forms for @ given in Equations (26.3) and 
(26.4) we see that by choosing g(x) = x* and f(y) = Sy we have 

p=x tx’ +5y 


Check that F is indeed equal to Vd. Also check that by adding any constant to @ the 
same property holds, that is F is still equal to V@. 
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26.3.1 


Physical interpretation of V@ 


Suppose we think of the scalar field #(x, y, z) as describing the temperature throughout 
a region. This temperature will vary from point to point. At a particular point it can be 
shown that V@ is a vector pointing in the direction in which the rate of temperature in- 
crease is greatest. | V@| is the magnitude of the rate of increase in that direction. Similarly, 
the rate of temperature decrease is greatest in the direction of —V@. Analogous inter- 
pretations are possible for other scalar fields such as pressure and electrostatic potential. 


Engineering application 26.1 


Electrostatic potential 


Engineers working on the design of equipment such as cathode ray tubes and elec- 
trical valves, which are also commonly known by their generic term vacuum tubes, 
need to calculate the electrostatic potential that results from an accumulation of static 
charges at various points in a region of space. Complicated examples require the use 
of a computer. Let us consider a simple example. 
The electrostatic potential, V, in a region is given by 
gone ee 
G2 + y2 + 22)3/2 
Suppose a unit charge is located in the region at the point with coordinates (2, 1, 0). 
Find the direction at this point, in which the rate of decrease in potential is greatest. 


Solution 
The rate of decrease is greatest in the direction of —VV. 
We first calculate the first partial derivatives of V. Writing 


Veyetyt7y 


we find 
ONG = ae 2 2 2)—5/2 ers 2 2 2, —5/2 
ae ae \GiS Mer sr ir sez) (22) = Sean CO ssp e)) 
av & 3 e DL Dh sP Damo ee N= 31/5) 
a yor Py tz) Cy) Gy 4 z) 
= 3 +42) 74 e¢y4 23? 
oy = = x2 2 2-5/2 sien 2 2 Dy 5D 
pe aaa Mer ser see) (Cx) = =Byaer srr see) 
These partial derivatives can each be evaluated at the point (2, 1, 0). That is, 
aV aV aV 
— = —0.107 — = 0.036 — —\() 
9X |(2,1,0) Y 1(2,1,0) 92 |(2,1,0) 
and so 


VV 121,09) = —0.107i + 0.036j + 0k 
Finally, 
—VV | 00.1.0) = 0.107i — 0.036j — 0k 
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At the point (2, 1, 0) this vector points in the direction of greatest rate of decrease in 
potential. This is also the direction of the electrostatic force experienced by the unit 


charge at that point. 


EXERCISES 26.3 


If p = x2 — y* — 3xyz, (a) find V@, (b) evaluate V@ at 
the point (0, 0, 0). 


If 6 = xyz, find (a) V@, (b) Vo at (1, 2, 1), (c) |Vo| 
at (1, 2, 1). 


If v = V¢@, find @ when v = (2x — 4y*)i — 8xyj. 


Given @ = xyz find (a) V¢, (b) —V¢@, (c) Ve 
evaluated at (3, 0, —1). 


Find VV when 
@Var+y~4+2 


(b) V = zsin~! (") 
x 


(c) V= ertyt, 

An electrostatic potential is given by V = xe7V>. 
Find VV and deduce the direction in which the 
decrease of potential is greatest at the point with 


coordinates (1, 1, 1). 


Solutions 


(a) (2x — 3yz)i — (2y + 3xz)j — 3xyk 
(b) 0 

(a) 2xyzdi + x73 5 + 3x2 y27k 

(b) 4i+ j+ 6k 

(c) J53 

b=x — 4x? +c 


(a) yai + xzj + xyk 
(b) —yzi — xzj — xyk (c) —3j 
(a) 2xi+ 2yj + 2zk 

~zy , 
x2/1 — y2 /x2 
i 


jtsin! Nk 
x 1 — y?/x2 x 


(b) 


7 


10 


In the theory of fluid mechanics the scalar field ¢ is 
known as the velocity potential of a fluid flow. The 
fluid velocity vector, v, at a point can be found from 
the equation v = V¢@. For a particular type of flow 

od = Ux, where U is a constant. Show that the 
corresponding fluid motion is entirely parallel to the x 
axis, and at any point the fluid speed is U. Find a 
velocity potential for a similar flow which is entirely 
parallel to the y axis. 


Is —V@ the same as V(—@)? Explain your answer. 


Given ¢ = 3x7y + xz, 

(a) determine Vd 

(b) determine V(7?) 

(c) determine 7V@ 

(d) Is V(7@) the same as 7Vg? 


For any scalar field ¢ and any constant k, is V(k@) the 
same as kV? 


@ 2° —4j4+ 


~Vy 
—/% — 7 * 5, 0.368% + 0.184, 
e Vi j, —0.368i + 0.184j 
2 
@ =Uy 
yes 


(a) (6xy + z)i+ 3x2j + xk 


(b) (42xy + 7z)i + 21x7j + 7xk 
(c) (42xy + 7z)i + 21x7j + 7xk 
(d) yes 

yes 
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Example 26.5 


Solution 


THE DIVERGENCE OF A VECTOR FIELD 


Given a vector field v = v(x, y, z) let us consider what happens when we differentiate 
its individual components. If 


v=v,i+v,j + 0,k 
we can take each component in turn and differentiate it partially w.r.t. x, y and z, respec- 
tively; that is, we can evaluate 

dv, du, dv, 


ax dy Oz 


If we add the calculated quantities the result turns out to be a very useful scalar quantity 
known as the divergence of v, that is 


dv, 4 dv, ie dv, 
ax dy Oz 

This is usually abbreviated to div v. Alternatively, the notation V - v is often used. If we 
use the vector operator notation introduced in the previous section we have 


ad 0 a 
V-v=(—,—,—)-v 
ax dy Oz 


(9 9d 9 (i, 8) 
= ax" ay’ az xr Fyo Tz 


Interpreting the - as a scalar product we find 


divergence of v = 


. dv, 4, dv, i dv, 
ey = - — —_ 
ax dy Oz a 
as before, although this is not a scalar product in the usual sense because (>. By’ =) 
x dy Oz 


is a vector operator. We note that the process of finding the divergence is always per- 
formed on a vector field and the result is always a scalar field: 


dv dv dv 
diy = VW os —2 ot 2 
iv V Vv ax ar a = az 
If v= x°2i + 2y°2°j + xyz’k find div v. 


Partially differentiating the first component of v w.r.t. x we find 


v,. D 
—_- = 2x 
Ox a 
Similarly, 
dv, dv 
— = by? and — = 2xyz 
dy Oz 


Adding these results we find 
div v = V-v = 2xz 4 6y’2" + 2xyz 


26.4.1 


Example 26.6 


Solution 
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Physical interpretation of V - v 


If the vector field v represents a fluid velocity field, then, loosely speaking, the diver- 
gence of v evaluated at a point represents the rate at which fluid is flowing away from 
or towards that point. If fluid is flowing away from a point then either the fluid den- 
sity must be decreasing there or there must be some source providing a supply of new 
fluid. 

If the divergence of a flow is zero at all points then outflow from any point must be 
matched by an equal inflow to balance this. Such a vector field is said to be solenoidal. 


Show that the vector field 
v = xsinyi+ ysinxj — z(sinx + siny)k 


is solenoidal. 


We have 

v, = xsiny so that ~ = siny 
Also, 

. Uv, . 

v, = ysinx so that — = sinx 

Finally, 
: dv. 
v, = —z(sinx + siny) so that aa = —(sinx + siny) 
Zz 

Therefore, 


V-v=siny+sinx — (sinx + siny) =0 


and hence v is solenoidal. 


Engineering application 26.2 


Electric flux and Gauss’s law 


We saw in Engineering application 7.5 that electric charges produce an electric field, 
E, around them which can be visualized by drawing lines of force. Suppose we sur- 
round a region containing charges with a surface S. If a small portion of this surface, 
5S, is chosen we can draw the field lines which pass through this portion as shown in 
Figure 26.1. 

The flux of E through 6S is a measure of the number of lines of force passing 
through 6S. Gauss’s law states that the total flux out of any closed surface S is pro- 
portional to the total charge enclosed. It is possible to show that this law can be 
expressed mathematically as 


ess 
€0 
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where p is the charge density and €, is a constant called the permittivity of free 
space. Note that in a charge-free region, p = 0, and so V - E = 0. This means there 


is no net flux of E. 


Figure 26.1 

The flux of E through 6S is a 
measure of the number of lines 
of force passing through 4S. 


EXERCISES 26.4 


1. A vector field v is given by 


v= 3x°yi + 2yr gj +x2k 


Find 

(a) v,, Uys V, 

(b) ou uy dv, 
ox dy az 

(c) Vev 


2 A vector field F is defined by 


F=(tt+y)it 6? —Dj+ 042k 
(a) Find V-F. 
(b) Calculate V - F at the point (3, 2, —1). 
3 IfA = 3yzi+ 2xyj + xyzk find V- A. 


4 Find the divergence of each of the following vector 
fields: 


(a) v=xri+yj+2k 
(b) v =e i+ 2zsin(xy)j + zk 
(c) v=xyi — 2yzj +k 
(d) v=x7y*i— yj — xyck 
5 IfE=xi+ 2j —yek find 
(a) E-i 
(b) Ej 


10 


11 


(c) E-k 
(d) V-E 
A vector field is given by 
F = (3x? —z)it+ (x+y)j 
+ (x + 3yz)k 
Find V - F at the point (1, 2, 3). 


Given the scalar field @ = x2 + y? — 222, find V@ and 
show that V - (V@) = 0. 


For any vector field F is V - (—F) the same as 
—(V-F)? 


The vector field F is given by 
F=x7i+ Qxy? Zi +x2k 

(a) Find V-F. 

(b) State 4F. 

(c) Find V- (4F). 

(d) Is 4(V -F) the same as V - (4F)? 


For any vector field F and any scalar constant k is 
V + (kF) the same as kV - F? 


Give an example of a vector field 
. . dv, dv, 
v=v0,i+v,j + v,k such that — 40, — 40, 
: @ ox dy 


dv, 
— #0, but V-v=0. 
dz 


a] (a) 3x2y, 2y>z, xz3 
(b) 6xy, 6y?z, 3x27 


26.5 The curlofavector field 859 


| ie 


WD) Vo = 2x4 + 2yj — 4ck 


(c) 6xy + 6y2z + 3x2" 
8 yes 


BB ow 14+2y42 
By v-aq=2x+4y 
BB 2x4 2742 


(b) ye + 2xzcos(xy) +3 


(c)y — 2z 
(d) 2xy? — 2y — xy 
Bax w2 


(b) 3 


BY 2 + 4yyzt 3x27 
(b) 4x27i + 8xy2zj + 4xz3k 
(c) 422 + 16xyz + 12xz? 
(d) yes 


10 yes 


(c) —yz (d) l-y | v = 2xyi+ y*j — 4yzk for example 


THE CURL OF A VECTOR FIELD 


A third differential operator is known as curl. It is defined rather like a vector product. 


; : : : rn?) 
This determinant is evaluated in the usual way except that we must regard —, — and 


0 a 
a, as operators, not multipliers. Thus, for example, 
z 


Bx By a 
x dy means SS 
v, Yy ox dy 
Explicitly we have 


) dv a dv, a 
curly = (2 —**)i4 em j+(—- Ys \ik 
dy Oz Oz Ox Ox dy 


ox Oy 
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Example 26.7 If v = x’yzi — 2xyj + yzk find V x v. 


i jk 
Solution Vxv= a = = 
ox dy Oz 


wy, —2xy yz 


_ AE _ al _ ae _ Seen " a _ ee 
~ | ay Oz ax Oz ; Ox dy 


=at+xyj— Qy+x72)k 


Note that the curl operation is only performed on a vector field and the result is another 
vector field. 

A detailed discussion of the physical interpretation of the curl of a vector field is be- 
yond the scope of this book. However, if the vector field v under consideration represents 
a fluid flow then it may be shown that curl v is a vector which measures the extent to 
which individual particles of the fluid are spinning or rotating. For this reason, a vector 
field whose curl is zero for all values of x, y and z is said to be irrotational. 


Example 26.8 Show that the vector field 
F = ye*i+xe”j + 0k 


is irrotational. 


i j k 

Solution V x F= oe 
ax dy <O0z 

ye” xe’ 0 


= 0i+ 0j + (Gye? + &”) — xe” + €”))k 
=0 for all x, y and z 


The field is therefore irrotational. 


EXERCISES 26.5 


WM ad the curl of the vector Geld two-dimensional vector field. Show that F is 
P aad 


; : a 
v= xi — 3xyj + 4ck. irrotational if — = —. 
dy ox 
2 Ifv = 3xi — 2y?zj + 3xyck find V x v. 
4 Find the divergence and curl of the vector field 


3 Suppose F = P(x, y)i+ Q(, y)jisa E = cos xi+ sin xj. 
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5 Find the curl of each of the following vector fields: (a) Find 'V x ¥. 


(a) E = x yi + Txyzj a 3x7k (b) State 3F. 
(b) v = y*xi + 4xzj + y2xk (c) Find V x (3F). 
(c) F = sinxi+ cos xj + 3xyzk (d) Is 3(V x F) the same as V x (3F)? 


6 A vector field F is given b 
= 7 F isa vector field and k is a constant. Is V x (kF) the 


F=xyit+2yjt+ot+2)k same as k(V x F)? 
Solutions 
1 —3yk (c) 3xzi — 3yzj — sinxk 


6 (a) —j—2x°k 


Q\e . 
2 (3xz + 2y*)i — 3yzj 3 ds 2 
(b) 3x°yi + 6y*j + 3(x 4+ 2°) Kk 


4 —sinx, cos xk (c) —3j- 37k 
5 (a) —7xyi — 6xj + (Tyz —27)k (d) yes 
(b) x(2y — 4)i — y*j + (4c — 2xy)k 7 yes 


| 26.6 COMBINING THE OPERATORS GRAD, DIV AND CURL 


We have now met three vector operators; these are summarized in Table 26.1. 

It is important to be able to combine the three operators grad, div and curl in sensible 
ways. For instance, because the gradient of a scalar is a vector we can consider evaluating 
its divergence, that is 


V:(V¢) =V:- a oO 82 
dx oy Oz 
(8 a 8\ (ad ab ao 
~ \ ax’ dy’ az dx’ dy’ dz 
a? a? 3° 
a ? p 
Ox ay? 2? 


This last expression is very important and is often abbreviated to simply 


Vo 
Table 26.1 
The three vector operators. 
Operator Acts on Result is a Definition 
a a 0 
grad scalar field vector field Vo = grad gd = a + ae aa 
ax dy Oz 
. . dv dv, dv, 
div vector field scalar field V-v= divv= — = 
a dy 0z 
ij k 
a0 0 
curl vector field vector field Vxv= curly =)2- 3 3 
dx dy OZ 
Vy Vy U, 
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pronounced ‘del-squared @’, and occurs in Laplace’s equation V*¢ = 0 and other partial 
differential equations. 


Example 26.9 If ¢ = 2x* —y’ — 2’, find V@, V - (V@) and deduce that @ satisfies Laplace’s equation. 


Solution Vo = 4xi— 2yj — 2zk 
ViVeijet—7-9 20 
that is, 


Vo =0 


Hence ¢ satisfies Laplace’s equation. 


Example 26.10 If (x, y, z) is an arbitrary differentiable scalar field, show that curl (grad 6) = Vx (V¢) 
is always zero. 


Solution Given ¢ = (x, y, z) we have, by definition, 


Then 
curl(grad @) 


V x (Vo) 
ij k 
00 0 
dy az 
dp Ib ID 
ax dy dz 


= (a(ae) ~ aals5))*- (Ze) -ae(Ze)) 
sia) 


0 (oa 0 (9a 
Now, since ? = e with similar results for the other mixed partial deriva- 
ax \ dy dy \ ox 


tives, it follows that 


ll 
@ 
ta 


V x (Vd) =0 


for any scalar field @ whatsoever. 


For an arbitrary differentiable scalar field @ 
V x (Vg) = 0 
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Engineering application 26.3 


Poisson's equation 


Recall from Engineering application 26.1 that engineers sometimes need to solve 
complex electrostatic problems when designing electrical equipment. For example, 
this need arises when calculating the electrical field strength in a high-voltage elec- 
trical distribution station to ensure there is no danger of electrical discharge across 
the air gap between components that have different voltages. Poisson’s equation is 
an equation that can be useful when carrying out this work. Consider the following. 

If E is an electric field and V an electrostatic potential, then the two fields are 
related by 


K=—VV 
From Example 26.2 we know Gauss’s law: V - E = Ls Combining these two equa- 
tions we can write ay 
Te Sue 
€0 
that is 
vy =-2 
€0 


This partial differential equation is known as Poisson’s equation and by solving it we 
could determine the electrostatic potential in a region occupied by charges. Note that 
in a charge-free region, 9 = 0 and Poisson’s equation reduces to Laplace’s equation 
vo 


EXERCISES 26.6 
A scalar field ¢ is given by (b) curl (grad ¢) 
g=3x+y- yr. Show that ¢ satisfies (c) curl (curl F) 


V2o = —2(y* +27). 


If @ = 2x2y — xz} show that V2 = 4y — 6xz. 


(d) div (curl F) 
(e) div (grad ¢) 


If v = xyi — yzj + (y + 2z)k find curl (curl (v)). 


If @ = xyz and v = 3x7i + 2y>j + xyk find V, V-v, , 
and V - (@v). Show that Qj =2Aayt+yz2 by =XZ 


8 Scalar fields @, and ¢, are given by 


2 


V - (gv) = (Vo) -v+ oV-v. 
Verify that @ = xy + y’z + 2x satisfies 


V-(Vo) =2(x+y+4+2). 


If A is an arbitrary differentiable vector field show 


(a) Find V@,. 
(b) Find V@p. 
(c) State ¢)¢>. 


that the divergence of the curl of A is always 0. (d) Find V(¢,¢). 
Express each of the following in operator notation (e) Find 6, Vd) + d)V¢). 
using *V’, “V+” and Vx”: (f) What do you conclude from (d) and (e)? 


(a) grad (div F) 
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9 Ascalar field ¢ and a vector field F are given by 


bd =x 
(a) Find V¢. 
(b) Find V-F. 


Solutions 


@j-k 


(c) Calculate 6(V -F) + F- (V@). [Hint: recall the 
dot product of two vectors.] 

F=,i+ 2j + zk (d) State fF. 

(e) Calculate V - (@F). 

(f) What do you conclude from (c) and (e)? 


(d) (6x2yz + 2xy*z?)i + (2x3z + 2yx*z)j 
+ (2x3 y + 2y?x*2)k 


4 Vo =yat+xzj + xk 


(e) same as (d) 


V-v = 6x4 6y” 
V- (dv) = 9x" yz ai 8xy3z ae xy? (f) V(¢1 92) = g, Vor + ooVo, 
9 (a) yeit+ xz2j + 2xyzk 
7 (a) V(V-F) (b) V x (Vo) 
(b) 2x +1 
(c) Vx (VxF) (d) V-(VxF) 


(e) V+ (Vo) 


8 (a) 2yit (2x4 2y2)j+y’k 


(b) 2xzi + x2k 


(c) Ix yz + yr? 


26.7 


(c) 3x2 y2" + 2x77 + B3xyz 
(d) xt yz7i + 2xyz2j + xyz3k 
(e) same as (c) 


(f) V-@F) = 0(V-F)+F-(V¢) 


VECTOR CALCULUS AND ELECTROMAGNETISM 


Vector calculus provides a useful mechanism for expressing the fundamental laws of 
electromagnetism in a concise manner. These laws can be summarized by means of four 
equations, known as Maxwell’s equations. Much of electromagnetism is concerned with 
solving Maxwell’s equations for different boundary conditions. 


Equation 1 
divD=V-D=o0 


where D = electric flux density, and » = charge density. This equation is a general 
form of Gauss’s theorem which states that the total electric flux flowing out of a closed 
surface is proportional to the electric charge enclosed by that surface. 


Equation 2 
dvB=V-B=0 


where B is the magnetic flux density. 

This equation arises from the observation that all magnetic poles occur in pairs and 
therefore magnetic field lines are continuous; that is, there are no isolated magnetic 
poles. In contrast, electric field lines originate on positive charges and terminate on neg- 
ative charges and so a net positive charge in a region leads to an outflow of electric 
flux. 
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Equation 3 


OB 
curl E= V x E= —-— 
ot 


where E is the electric field strength. This equation is a statement of Faraday’s law. A 
time-varying magnetic field produces a space-varying electric field. 


Equation 4 


dD 
SS ee ae 
where H is the magnetic field strength and J is the free current density. This equation 


states that a time-varying electric field gives rise to a space-varying magnetic field. 


The derivation of these equations is beyond the scope of this text but can be found in 
many books on electromagnetism. The power of these equations lies in their generality. 
The brevity with which the main laws of electromagnetism can be expressed is a tribute 
to the utility of vector calculus. 


REVIEW EXERCISES 26 


1 


Find V@¢ if 7 Ascalar field ¢ is a function of x, z and t only. Vectors 

i o= 3H E and H are defined by 
(b) 6 =x yz t-xy?z + xyz" E +( 3s Hx) n=—-79; 

(c) 6 =x? +x y22 +2 y2z é\ dz Ox ot 
If ¢ =1/ [ya y2 + 22, show that where € is a constant. 

226 a a (a) Show that V-E = 0. 
ae a ie (b) Show that V-H = 0. 
. oH F 
Functions satisfying Laplace’s equation are called Given that V x E = —u—, where y is a constant, 
harmonic functions. Show that the following show that @ satisfies the partial differential equation 
functions are harmonic: a6 a6 aye ao 
(a) z= oa 6xy2 +y4 ax2 | a2 B ar? 

b) z= 4x3 y — 4x73 a a2 
ae am 8 Ifv = (2xy + 3x5)i + ej + ayzk find 5 and — 
If A = xi+ yj + zk and a a 
B = cos xi — sin xj, find 9 Anelectrostatic potential is given by V = 5xyz. Find 
(a) AxB (a) the associated electric field E, 

(b) V-(A xB) (b) |E| at the point (1, 1, 1). 
(c) VxA 10 Anelectrostatic field is given by 
(d) VxB E = 3(x+ y)i+ 2xyj. Find the direction of this field at 
Verify that (a) the point (2, 2) 
V-(A x B) =B-(V x A)—A-(V xB). (b) the point (3, 4). 
For arbitrary differentiable scalar fields @ and yy show 11 Find the curl of A = yi + 2xyj + 3zk. 
that Vio) = VV$+oV¥. 
ify a 4 i+ yj-+ ck, find Vv 12 Find the curl of the vector field 

=xyanda = x1+ yj + zk, fin > 2 . 2)4 

F = (2 — 4y2)j. 

V xa, V x (wa). Show that Gt Gy ay 
V x (Wa) =WV xat+ (Vw) xa. 13 If¢ =5e*+’ cosz find V¢ at the point (0, 0, 0). 


866 


Chapter 26 Vector calculus 


14 The vector fields A and B are given by 


A= yi + 2xzj + xk 
B = yd+ x*yj + 3k 
(a) Calculate the vector product A x B. 
(b) Calculate V-(A x B). 
(c) Calculate B- (V x A) —A-(V x B). 
(d) What do you conclude from (b) and (c)? 


(e) Can you prove this result is true for any two 
vector fields? 


15 Given 
F = (3x, 2y, 4z) G = (1, -1,3) 
b= xyz 
Solutions 


10 


(a) 3yzi + 3xzj + 3xyk 
(b) (2xyz + y?z + yz")i 
+ (x22 + Qxyz + x27)j 
+ Oy + xy? + 2xyz)k 
() (727 + Qxyz” + 2xy*z)i 
+ (Qxyz? + x22? + 2xyz)j 
+ (Qxy?z + 2x" yz + xy )k 


(a) zsinxi+ zcosxj + (—ycosx — xsinx)k 
(b) zcosx (c) 0 (d) —cosxk 
Vw = 2xyit+ Pj 

Vxa=0 

V x (Wa) = x? ai — 2xyzgj + (2xy* — 3) 


(4xy + 15x*)i + ye” j + yzk, 


(4y + 60x3)i + y2 ej 

(a) E=—VV = —S5yzi — 5xzj — 5xyk 
(b) V75 

(a) 121+ 8j 


(b) 211+ 24j 


11 
12 
13 
14 


15 


find 
(a) 
(b) 
(c) 
(d) 
(e) 
(f) 
(g) 
(h) 
(i) 


V-F 
Vo 

VxF 

V- (OF) 

V x (¢G) 
V(F-G) 

V x (F x G) 
V-(F+G) 

V x (3F +4G) 


(2y — 1)k 


(y+ Ik 


Ssi+ 


(a) 


(b) 
(c) 
(d) 
(a) 
(b) 
(c) 
(d) 
(e) 


(f) 
(g) 
(h) 
(i) 


10j 


(6xz — x3 y)i + (xyz — 3x7y)j 
+(x4y? — 2xyz*)k 

—3x2(y + 1) — xz(4y — 1) + 6z 
—3x?(y + 1) — xz(4y — 1) + 6z 
V-(A x B) =B-(V x A)—A-(V xB) 
9 

Qxyzi + x2¢f + x2yk 

0 

21x? yz 

x7 (y + 3z)i+ (X?y — 6xyz)j 

_ (x2z + 2xyz)k 

3i — 2j + 12k 

—6i+7j — 15k 

9 

0 
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INTRODUCTION 


In this chapter a number of new sorts of integral are introduced. These are intimately 
connected with the developments of the previous chapter on differential vector calcu- 
lus. The chapter starts by explaining the physical significance of line integrals and how 
these are evaluated. This leads naturally into the topics of conservative vector fields and 
potential functions. These are important in the study of electrostatics. Double and triple 
integrals are then introduced; these generalize the earlier work on integration to inte- 
grands which contain two and three independent variables. 

Finally some simple volume and surface integrals are introduced, together with the di- 
vergence theorem and Stokes’ theorem. These enable Maxwell’s equations to be 
expressed in integral form. 


LINE INTEGRALS 


Consider an object of mass m placed in a gravitational field. Because the force of grav- 
ity is a vector the gravitational field is an example of a vector field. The gravitational 
force on the mass is known as its weight and is given by mg where g is a constant 
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A 
s 
meM 
és 
N 
Figure 27.1 
B An object of mass m falls from A to B. 


vector called the acceleration due to gravity. Suppose we release the mass and allow 
it to fall from point A in Figure 27.1. The vertical displacement measured downwards 
from A is s. 

Work is being done by the gravitational force in order to make the mass accelerate. We 
wish to calculate the work done by the field in moving the mass from A to B. Suppose 
we consider the amount of work done as the mass moves from point M to point N, a 
distance 5s. Elementary physics tells us that the work done is equal to the product of the 
magnitude of the force and the distance moved in the direction of the force. In this case 
the magnitude of the force is mg, and so the small amount of work done, dW, in moving 
from M to N, is 


dW =meg ds 
; 5W . 
from which we have ma = mg. As 5s — 0 we obtain 
KY 


ow dW 
— mg 


im — = — 

bs>0 OS ds 
To find the total work done as the mass falls from A to B we must add up, or integrate, 
the contributions over the whole interval of interest, that is 


B 
total work done = W = i mg ds 
A 


This is an elementary example of a line integral, so called because we are integrating 
along the line from A to B. In this case it is straightforward to evaluate. Since both g and 
m are constants the integral becomes 


B 
w= mg [ ds 
A 


which equals mgx (distance from A to B). 


Engineering application 27.1 


The work done by the gravitational field 


An object of mass m falls vertically from A to B. If A is the point where s = 0 and B 
is the point where s = 10 find the total work done by gravity as the mass falls from 
Ato B. 
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Solution 
The work done by gravity is found by evaluating the line integral 
s=10 
W= me | dy— melsly- = 10mg 


=0 
Note that this is also the potential energy lost by m in falling a distance of 10 units. 


In the previous example the path along which we integrated was a straight line, but this 
need not always be the case. Consider the following example. 


Engineering application 27.2 


The work done by an electric field 


Figure 27.2 shows a unit charge moving along a curve C from point A to point B in 
an electric field E. 

At the particular point of interest, M, we have resolved the electric field vector 
into two components. Resolving a vector into perpendicular components has been 
described in Example 7.3. One component is tangential to C, namely E,, and one is 
normal to C, namely £,. As a unit charge moves from M to N, a distance 6s along the 
curve, the work done by the electric field is E,5s. The component E,, does no work 
since there is no motion perpendicular to the curve C. To find the total work done we 


must add up all contributions, resulting in the integral 
B 


total work done = i E, ds 
A 


This is a second example of a line integral, the line being the curve, C, joining A and 
B. It is usual to denote this by 


total work done = if E, ds 
€ 


where the symbol i tells us to integrate along the curve C. 
G 


Figure 27.2 
As a charge moves from M to N, 
the field E does work E,6s. 
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If the coordinates of the end points of the curve C are known, say (x,, y,) and (x5, y>), 

(2,99) 

x19) 

route from A to B. 
We now explain how to integrate a function along a curve. Consider a vector field, F, 


through which runs a curve, C, as shown in Figure 27.3. 


we often write iP to show this, but care must then be taken to define the intended 


F(x,y) 


Figure 27.3 
The integral F- ds is equal to the work 
Cc 


done by F as the particle moves along C. 


Suppose we restrict ourselves to two-dimensional situations. In the general case the 
vector field will vary as x and y vary, that is F = F(x, y). Consider the small element 
of C joining points M and N. Let @ be the angle between the tangent to the curve at M 


— 
and the direction of the field there. We shall denote the vector joining M and N (i.e. MN) 
by ds. Consider the quantity 


F-6ds 


where - represents the scalar product. When F represents a gravitational force field, 
F - 5s represents the small amount of work done by the field in moving a particle of unit 
mass from M to N. The appropriate integral along the whole curve represents the total 
work done. From the definition of the scalar product we note that 


F - 5s = |F||5s| cos 6 
Writing the modulus of F as simply F’, and the modulus of ds, as 6s, we have 
F-d5s = Fdscosé 
= (F cos@)ds 
= Fos 


where F; is the component of F tangential to C. This result is of the same form as the 
expressions for work done obtained previously. We are therefore interested in integrals 
of the form 


/ F(x, y) - ds 
Cc 


ds a 
dxi 


Figure 27.4 


The vector ds can be 
written as dxi + dyj. 


27.3 
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Since F is a vector function of x and y it will have Cartesian components P(x, y) and 


, Q(x, y) and so we can write F in the form 
J 


F(x, y) = P(x, yjit+ OG, y)j 


Similarly, referring to Figure 27.4, we can write the vector ds as ds = dxi + dyj. Hence 


[Po a= [revit om nb (ad + ap 


Taking the scalar product gives 


[Ponar+ 06.90 


and this is the line integral in Cartesian form. 


If F(x, y) = P(x, y)i + Q(x, y)j then 


[Fo ds [eee y)i + Q(x, y)j) + (dai + dyj) 


= [ Pos av+ O04 »0¢y 


In order to proceed further we now need to explore how such integrals are evaluated 
in practice. This is the topic of the next section. 


EVALUATION OF LINE INTEGRALS IN TWO DIMENSIONS 


Example 27.1 Evaluate 


[Fs 
Cc 


where F = 5y’i + 2xyj and C is the straight line joining the origin and the point (1, 1). 


Solution We compare the integrand with the standard form and recognize that in this case 


P(x, y) = 5y’ and Q(x, y) = 2xy. The integral becomes 


i Sy’ dx + 2xy dy 
Cc 


The curve of interest, in this case a straight line, is shown in Figure 27.5. Along this 
curve it is clear that y = x at all points. We use this fact to simplify the integral by 
writing everything in terms of x. We could equally well choose to write everything in 


d 
terms of y. If y = x then = 1, that is dy = dx. As we move along the curve C, x 


ranges from 0 to 1, and the integral reduces to 


x=1 73 1 7 
[seart2ear= [rear= | wear =| 7] =. 
Cc Cc x=0 3 |, 3 
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Example 27.2 


Solution 


Example 27.3 


Solution 


y 

1 d, ) 

Cc 

0 1 x 
Figure 27.5 Figure 27.6 
The path of integration joins The path C is made up of two distinct 
(0, 0) and (1, 1) and has equation parts. Integration is performed 
y=x. separately on each part. 


Let us now see what happens when we choose to evaluate the integral of Example 27.1 
along a different path joining (0, 0) and (1, 1). 


Evaluate the integral Cc 5y* dx + 2xy dy along the curve, C, consisting of the x axis be- 
tween x = 0 and x = | and the line x = | as shown in Figure 27.6. 


We evaluate this integral in two parts because the curve C is made up of two pieces. The 
first piece C, is horizontal, and the second C, is vertical. The required integral is the sum 
of the two separate ones. Along the x axis, y = 0 and dy = 0. This means that both the 
terms Sy” dx and 2xy dy are zero, and so the integral reduces to 


x=1 
/ Odx =0 
x=0 


and so there is zero contribution to the final answer from this part of the curve C. Along 
the line x = 1, the quantity dx equals zero. Hence 5y” dx also equals zero, and 2xy dy 
equals 2y dy. Because y ranges from 0 to | the integral becomes 


y=1 
i 2ydy = [Jj = 1 
y=0 


Note that this is a different answer from that obtained in Example 27.1. 


As we have seen from Examples 27.1 and 27.2 the value of a line integral can depend 
upon the path taken. It is therefore essential to specify this path clearly. 


Evaluate the integral Ie F(x, y) - ds, where F(x, y) = (3x? + y)i+ (5x — y)j and C is the 
portion of the curve y = 2x* between A(2, 8) and B(3, 18). 


[Fo.-as= [oe art Gx— yay 
(Be GC 


The curve C has equation y = 2x” for 2 < x < 3. Along this curve we can replace y by 


2x. Note also that ;. = 4x and so we can replace dy with 4x dx. This will produce an 
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integral entirely in terms of the variable x which is then integrated between x = 2 and 
x = 3. Thus 


3) 
[oe +nart Gr—ney= f (3x7 + 2x7) dx + (5x — 2x7) (4x dx) 
Cc 2: 


3 
= 25x° — 8x dx 
2 


EXERCISES 27.3 


1 Evaluate te 3y dx + 2x dy along the straight line C (a) along the straight line joining A and B, 


between (1, 1) and (3, 3). (b) horizontally along the x axis from x = 0 tox = 3 


2 : ; and then vertically from y = 0 to y = 2. 
2 Evaluate f c 2yx dx + x* dy along the straight line 


y = 4x from (0, 0) to (3, 12). 5 If F = Qxy—y* +3)i4+ (a? — 4xy°)j evaluate 
te F- ds where C is the straight line joining A(1, 3) 


3 Evaluate Jox + 3y) dx + 2y dy along the curve and B(2, 5). 


y = x* between (0, 0) and (2, 4). 
; : Evaluate the integral 
4 IfE= («+2y)i+ @ — 3y)j, A is the point (0, 0) and 


B is the point (3, 2), evaluate 2 2 
[os + 2y) dx + (7x + y*) dy 
iG 


B 
[ E-ds from A(0, 1) to B(2, 5) along the curve C defined by 
y=2x+1. 
Solutions 
15 9 
1 20 4 pee b) = 
(a) 5 (b) 5 
2 108 5 —1149 
3 38 268 
am 
3 


EVALUATION OF LINE INTEGRALS IN THREE DIMENSIONS 


Evaluation of line integrals along curves lying in three-dimensional space is performed 
in a similar way. It is often helpful in this case to express the independent variables x, y 
and z in terms of a single parameter. Consider the following example. 
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x=4 y=P z=5+t 
and is located within a vector field F = yi + x°j + (z+ x)k. 


(a) Find the coordinates of the point P on the curve where the parameter ¢ takes the 


value 1. 


(b) Find the coordinates of the point Q where the parameter ¢ takes the value 3. 


(c) By expressing the line integral /, c F'-ds entirely in terms of t find the value of the 


line integral from P to Q. Note that ds = dxi + dyj + dzk. 


Solution (a) Whent = 1,x =4, y= 1 and z= 6, and so P has coordinates (4, 1, 6). 
(b) When t = 3, x = 4, y = 27 and z = 8, and so Q has coordinates (4, 27, 8). 
(c) To express the line integral entirely in terms of f we note that if x = 4, 
d 
an 0 so that dx is also zero. If y = f° then - = 31° so that dy = 3¢?dr. 
d 
Similarly since z= 5 +1, ; = | so that dz = dt. The line integral becomes 
[F-« = [oirei + (z+ x)k) - (dxi + dyj + dzk) 
c c 
— [vase dy + (z+ x) dz 
c 
t=3 
-|/ 0 + 16(317) dt + (9 +1) dt 
t=1 
3 
-|/ 487 ++9+4¢tdt 
1 
81 3 
[fo] 
3 1 
48(3 3° 48 
- (FP + ma+5 ) (5 +94 
= 438 
EXERCISES 27.4 
1. You are required to evaluate the line integral (a) Find the coordinates of the point A, where ¢ = 0. 
fe F. ds where F is the vector field (b) Find the coordinates of the point B, where t = 1. 
= (2y + 3x)i + (yz + xj + 3xyk and (c) By expressing the line integral entirely in terms 


ds = dxi + dyj + dzk. The curve C is defined 


th C: 
parametrically by x = 12, y = 3¢ and z = 2t for e curve 


values of t between 0 and 1. 


of t, evaluate the line integral from A to B along 


Solutions 


1 (a) (0,0, 0) 
(b) C, 3, 2) 
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Example 27.5 


Solution 


27.5.1 
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(c) 17 


CONSERVATIVE FIELDS AND POTENTIAL FUNCTIONS 


We have seen that, in general, the value of a line integral depends upon the particular 
path chosen. However, in special cases, the integral of a given vector field F turns out to 
be the same on any path with the same end points; that is, it is independent of the path 
chosen. In these cases the vector field F is said to be conservative. There is a simple test 
which tells us whether or not F is conservative: 


F is a conservative vector field if 


curl F = 0 everywhere 


If we are dealing with a two-dimensional vector field F(x, y) = P(x, y)i+ Q(, y)j then 
dQ 


oP 
this test becomes: F is conservative if a = - (See Question 7 in Exercises 27.5.) 
y x 


Show that the vector field 
F = ze" sinyi + ze" cos yj + e* sin yk 


is a conservative field. 


We find curl F: 


0 0 0 
ax oy Oz 
ze* siny ze* cosy e* siny 


= (e’ cos y — e* cos y)i — (e* siny — e’ sin y)j + (ze* cos y — ze* cos y)k 
=0 


We have shown that V x F = 0 and so the field is conservative. Note from Section 26.5 
that such a field is also said to be irrotational. 


There are two important properties of conservative vector fields which we now discuss. 


The potential function 


It can be shown that any conservative vector field, F, can be expressed as the gradient of 
some scalar field ¢. That is, we can find a scalar field @ such that F = V@. The scalar 
field ¢ is said to be a potential function and F is said to be derivable from a potential. 
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Example 27.6 


Solution 


Example 27.7 


Solution 


In three dimensions, if 
F = P@, y,z)it+ Q(x, y, DJ+ RO, y, Dk 


is conservative, we can write 


F=Vo 
_ 96, 9b, Oh 
7 be Oy Be 
So 
a a a 
a og" go" 
Ox dy Oz 


You should refer back to Example 26.4 where this idea was introduced. 


The vector field v is derivable from the potential @ = 2xy + zx. Find v. 


If v is derivable from the potential ¢, then v = V@ and so 
v=V¢ = (2y+ z)it 2xj + xk 


This vector field is conservative as is easily verified by finding curl v. In fact, 


i j k 

0 are) 

Vxv= ax ay az 
2y+z2x x 


= 0i- 1 -—1)j+ @2-2)k 
=0 
Indeed, recall from Example 26.10 that curl (grad @) is identically zero for any ¢. 


When the vector field F is conservative there is an alternative method of evaluating the 
line integral SF - ds which involves the use of the potential function @. Consider the 
following example. 


The two-dimensional vector field F = i + 2j is conservative. 


0 0 
(a) Find a suitable potential function @ such that F = Vé = scl + = j. 
x y 
(b) Evaluate be F - ds along any convenient path. 


(c) Find the value of ¢ at B(3, 2), and the value of @ at A(O, 0), and show that the 
difference between these is equal to the value of the line integral obtained in part (b). 


0 
(a) We are given that ~ = 1 so that @ = x + f(y), where f is an arbitrary function 
x 
0 
of y. We are also given that ~ = 2 so that ¢ = 2y + g(x), where g is an arbitrary 
y 


function of x. It is easy to verify that ¢ = x + 2y is a suitable potential function. 


27.5.2 
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(b) To find the line integral we shall choose the straight line path, C, joining (0, 0) and 


(3, 2). This has equation y = = The line integral becomes 


(3,2) (3,2) 
/ F-ds= [ (i+ 2j) « (dai + dyj) 
( (0,0) 


0,0) 
(3,2) 
= dx + 2 dy 
(0,0) 
ve 4 2 
— dx + ~ dx since dy = =~ dronC 
(0.0) 3 3 


x=3 a 
= / ae 
x=0 3 


(c) The value of @ at B(3, 2) is 3 + 2(2) = 7. The value of @ at A(O, 0) is 0. Clearly 
the difference between these values is 7, the same as the value of the line integral 
obtained in part (b). 


The result obtained in the previous example is important: 


If F is a conservative vector field such that F = V@ then, for any points A and B, 


B 
[ F- ds = $(B) — $(A) 


The line integral around a closed loop 


A second important property of conservative fields arises when the curve C of integration 
forms a closed loop. Suppose we evaluate a line integral from A to B firstly along the 
curve C, and secondly along C, as shown in Figure 27.7. 

If the field, F, is conservative both integrals will yield the same answer, that is 


[ F-os= [ Beas 
Cc Cc. 


1 2 


But 


JF astiom Ato B =~ | Feds from B to A 


Consequently, the line integral around the closed curve A to B and back to A must equal 
zero. When the path of integration is a closed curve we use the symbol 


f 
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B id B 
1 
Cy 
C 
Ci : 
C 

A A 
Figure 27.7 0 1 Ke 
The line integral / F- ds can be Figure 27.8 
evaluated along different paths The path of integration is the curve C, 
between A and B. followed by the line C). 


to represent the integral. So, for a conservative field we have the result 


f F-as=0 
c 


for any closed curve C. 
In summary we have the following results: 


For a conservative field, F, all the following statements are equivalent: 
VxF=0 


f F-ds=0 


B 
if F - ds is independent of the path between A and B 
A 


F is derivable from a scalar potential, that is F = V@ 


Example 27.8 Evaluate 


[F-as 
Cc 


where F is the vector field y’i + 2xyj and where: 
(a) C=C, is the curve y = x° going from A(0, 0) to BCL, 1). 
(b) C =C, is the straight line going from B(1, 1) to A(O, 0). 


(c) Deduce that £ F - ds = 0 where the closed line integral is taken around the path C, 
and then C,. 


Solution The situation is depicted in Figure 27.8. 


(a) C, has equation y = x’ and so dy = 2x dx. Note that we are integrating from A to B. 
Therefore 
/ F-ds = / (yi + 2xyj) + (dxi + dyj) = i y’ dx + 2xy dy 
c C, 


1 C, 
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Substituting y = x” and dy = 2x dx we have 


[F-o= xt dx + (2x)x" (2x dx) 
C; Cc. 


1 1 


x=1 
-|/ 5x° dx = [7 ]} =1 


=0 


(b) C, has equation y = x and so dy = dx. Note that this time we are integrating from B 


to A. Therefore 


i 


2 


Feds = [ ('+209)- i+ aj) = | y’ dx + 2xy dy 
C, CG 


Substituting y = x and dy = dx we have 


=I 


2 


(c) Now 


x=0 0 
[Fos / Part 2edr= [ ae he = [| = = 
C. x 1 


fr-as= | F-ds-+ [ F-ds=1—1=0 
Cc C, 


1 


The line integral around the closed path is zero because the field is conservative. 
You should check that V x F = 0 and that a suitable potential function is ¢ = yx. 


EXERCISES 27.5 


Verify that the vector field 
F = (4x° + y)i + xj is conservative. 


For the vector field 
F = ycosxyi + xcos xyj + 2zk find V x F and verify 
that the field is conservative. 


Consider the field F = yi + xj. 


(a) Show that F is a conservative field. 
: : 0d ad 
(b) Find a function @ such that — = y and — =x. 
Ox dy 
(c) Find a suitable potential function ¢ for F. 
(d) If A is the point (2, 1) and B is the point 
(5, 8) evaluate is F-ds. 
(e) Find ¢ at B and ¢ at A and show that the value of 
the line integral calculated in part (d) is equal to 
the difference between the values of ¢ at B and A. 


Show that the vector field, 

F = (2xy — yt + 3)i + (x? — 4xy7)j, of Question 5 in 
Exercises 27.3, is conservative and find a suitable 
potential function ¢ from which F can be derived. 
Show that the difference between ¢ evaluated at 

B(2, 1) and at AC, 0) is equal to the value of the line 
. B 

integral [, F-ds. 


5 Show that 


is independent of the path of integration, and evaluate 
the integral. 


The function ¢@ = 4xy is a potential function for 
which F = V@. 
(a) Find F. 


(b) Evaluate f F - ds along the curve y = 2? from 
A(0, 0) to B(2, 8). 


(c) Evaluate ¢ at B and at A and show that the 
difference between these values is equal to the 
value of the line integral obtained in part (b). 


If F = P(x, y)i+ Q(x, y)j + Ok, show that 


P 
VxF= 30 _ a k. Deduce that if F is 
ox dy 
dQ __ oP 


conservative then — = —. 
ay 
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Solutions 
3 (b) 6=xy+ec (c) xy+e (d) 38 oe 2 
4 pa=xvy—yix4+3x4e,5 6 (a) F=4yi+4xj = (b) 64 


DOUBLE AND TRIPLE INTEGRALS 


27.6.1 Double integrals 


Expressions such as 


f f(x, y) dxdy 
y 


yay, Yx=x, 


are known as double integrals. What is meant by the above is 


[O (Lorene) 
y=y, xex 


1 


where firstly the inner integral is performed by integrating f with respect to x, treating y 
as if it were a constant. The limits of integration are inserted as usual and then the whole 
expression is integrated with respect to y between the limits y, and y,. 

It is important that you can distinguish between double integrals and line integrals. 
Physically they mean quite different things, and they are evaluated in different ways. 
Whilst both types of integral contain terms involving x, y, dx and dy, in a double integral 
dx and dy always occur as a product, that is as dx dy. In a line integral they occur sep- 
arately. When evaluating a line integral we integrate along a curve. When evaluating a 


double integral we integrate over a two-dimensional region. 
Double integrals may also be written in the form 


[[ tonavay 
R 


where R is called the region of integration. The region R must be described mathemat- 


ically. In the integral 


Y=) PX=X> 
/ fusing 
y 


yay, Yx=x, 


R is the rectangular region defined by x, < x < xX, y; < y < y,. Non-rectangular 


regions are also common. 


Example 27.9 Sketch the region R defined by 1 <x <4and0<y<2. 


Solution The region is shown in Figure 27.9. We see that x lies between 1 and 4, and y lies between 


0 and 2. Consequently the region is rectangular. 


(c) 64 
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0 1 2 3 4 x 0 1 2 3 4 x 
Figure 27.9 Figure 27.10 
The rectangular region over which the The integral could have been found by first 
integration in Example 27.10 is integrating over a vertical strip such as that 
performed. shown. 


Example 27.10 Evaluate 


Solution 


y=2 x=4 
/ i x + 2y dx dy 
y=0 Jx=1 


over the region, R, given in Example 27.9 and defined by 1 <x << 4and0 <y <2. 


The inner integral is found first by integrating with respect to x, keeping y fixed, that is 
constant. 


x=4 x2 x=4 1 15 
x+2ydx = | — + 2xy = 8+ 8y— — —2y= — + 6y 
x=1 2 2 2 


x=1 


Then the outer integral is found by integrating the result with respect to y: 


oA 15 13 ar 
—+6ydy=|]—y+3y] =15412=27 
y=0 2. 2 0 


Now consider again Example 27.10. By performing the inner integral first, 


x=4 
: x + 2y dx 
x=! 


we are effectively singling out a horizontal strip, such as that shown in Figure 27.9, and 
integrating along it from x = 1 to x = 4. When we then integrate the result from y = 0 
to y = 2 we are effectively adding up contributions from all such horizontal strips. 

The integral over the same region could also have been performed by integrating first 
over vertical strips for 0 < y < 2 (Figure 27.10) yielding 


y=2 
/ x+ 2ydy = [xy + Pay = 2x +4 
y=0 ° 
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Example 27.11 


Solution 


The result is then integrated from x = 1 to x = 4 effectively adding up contributions 
from each vertical strip: 


x=4 
/ 2x +4dx = [x? + 4x]f = 32-5 = 27 


x=1 


as expected. We note that 


y=2 px=4 x=4 py=2 
/ di xt 2yardy =f i. x + 2ydydx (27.1) 
y=0 Jx=1 x=1 y=0 


In particular, note the order of dx and dy in Equation (27.1). The simple interchange of 
limits is only possible because in this example the region R is a rectangle. 


When evaluating a double integral over a rectangular region R, the integration may 
be carried out in any order, that is 


[[ tonaray= ff po.narar 
R R 


The double integral in Example 27.10 could have the following interpretation. Con- 
sider the surface z = x + 2y, forO < y < 2,1 < x < 4, as shown in Figure 27.11. In 
this example the surface is a plane but the theory applies to more general surfaces. The 
variable z represents the height of the surface above the point (x, y) in the x—y plane. 
For example, consider a point in the x—y plane, say (2, 3), that is x = 2, y = 3. Then at 
(2,3),z=x+2y = 2 +4 2(3) = 8. Thus the point on the surface directly above (2, 3) 
is 8 units from (2, 3). 

Let us now consider dx dy. We call dx dy an element of area. It is a rectangle with 
dimensions dx and dy and area dx dy. Hence z dx dy represents the volume of a rectangu- 
lar column of base dx dy. As we integrate with respect to x for 1 < x < 4 and integrate 
with respect to y for 0 < y < 2 we are effectively summing all such volumes. Thus 


y=2 x=4 
/ i x + 2y dx dy 
y=0 Jx=1 


represents the volume bounded by the surface z = x + 2y and the region R in the x—y 
plane. 


Sketch the region R over which we would evaluate the integral 


y=1 x=2—2y 
/ / f(x, y) dx dy 
y=0 Jx=0 


First consider the outer integral. The restriction on y means that interest can be confined 
to the horizontal strip 0 < y < 1. Then examine the inner integral. The lower limit on 
x means that we need only consider values of x greater than or equal to 0. The upper x 
limit depends upon the value of y. If y = 0 this upper limit is x = 2 —2y = 2. Ify= 1 
the upper limit is x = 2 — 2y = 0. At any other intermediate value of y we can calculate 
the corresponding upper x limit. This upper limit will lie on the straight line x = 2 — 2y. 
With this information the region of integration can be sketched. The region is shown in 
Figure 27.12 and is seen to be triangular. 
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N 


coor NYU DN 


Figure 27.11 

The surface z=x+2y,0<y<2,1 <x < 4; the Figure 27.12 

quantity z dx dy is the volume of the rectangular column The region over which the integration in Example 27.12 
with base area dx dy. is performed. 


Example 27.12 Evaluate 


y=1 x=2-2y 
iP / 4x + 5 dx dy 
y=0 Jx=0 


over the region described in Example 27.11. 


Solution We first perform the inner integral 


x=2—-2y 
/ 4x + 5 dx 
x=0 
integrating with respect to x. This gives 
[ar Se, = 2 = 2y)’ 450 = 2) =0 
= 8y’ — 26y + 18 


Next we perform the outer integral 


y=1 5 2 1 
gy? 26 
/ By? — 26y + 18dy =| 2 —— + 18y 
y=0 3 2 0 
8 26 
ee un 
a al 
= 7.667 


The region of integration is shown in Figure 27.12. The first integral, with respect to x, 
corresponds to integrating along the horizontal strip from x = 0 tox = 2 — 2y. Then 
as y varies from 0 to | in the second integral, the horizontal strips will cover the entire 
region. 


884 Chapter 27 Line integrals and multiple integrals 


Example 27.13 


Solution 


If we wish to change the order in which the integration is carried out, care must be 
taken with the limits of a non-rectangular region. Consider again Figure 27.12. The line 


De 
x = 2 — 2y can be written as y = =a We can describe the region R by restricting 


attention to the vertical strip 0 < x < 2, and then letting y vary from 0 up to 7 
that is 


x=2 y=(2—x)/2 
/ 4x + 5 dy dx 
x=0 dy 


=0 


You should check by evaluating this integral that the same result is obtained as in 
Example 27.12. 


Evaluate the double integral of f(x, y) = x? + 3xy over the region R indicated in Fig- 
ure 27.13 by 


(a) integrating first with respect to x, and then with respect to y, 


(b) integrating first with respect to y, and then with respect to x. 


(a) If we integrate first with respect to x we must select an arbitrary horizontal strip as 
shown in Figure 27.13 and integrate in the x direction. On OB, y = x so that the 
lower limit of the x integration is x = y. On AB, x = | so the upper limit is x = 1. 
Therefore 


ll 
fo 
Wl Re 

+ 
Ne 
Se 
fm, 
wl, 

+ 
Nites 

[oS) 
Se 


As y varies from 0 to | the horizontal strips will cover the entire region. Hence the 
limits of integration of the y integral are 0 and 1. So 


ry 3y 1 3, Ity* 
/ s+ 3-Sra-[y+p7- 


if 


= 2 6 3 4 Ah, 
eee 
~ 3° 4° 24 
15 
~ 24 
5 
~ 8 


that is, 


y=1 x=1 5 
i / x + 3xydxdy = = 
y=0 Yx=y 8 


(b) 
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y 
B 
1 
vax 

0 y=0 1 A x 
Figure 27.13 Figure 27.14 
The integral with respect to x has We integrate along a vertical strip 
limits x = y and x = 1. from y = Otoy =x. 


If we choose to integrate with respect to y first we must select an arbitrary vertical 
strip as shown in Figure 27.14. At the lower end of the strip y = 0. At the upper end 
y=x. 

To integrate along the strip we evaluate 


y=x 3 2747+ 3 3 5 3 
/ + suyay= [y+ | ofa" ™ 
y=0 2 0 


We add contributions of all such vertical strips by integrating with respect to x from 
x=O0tox=1: 


1 5x3 


aa) ] —5 
a: Pile. 6 


We see that the prudent selection of the order of integration can yield substantial 
savings in the effort required. 


27.6.2 Triple integrals 


The techniques we have used for evaluating double integrals can be generalized natu- 
rally to triple integrals. Whereas double integrals are evaluated over two-dimensional 
regions, triple integrals are evaluated over volumes. 


Example 27.14 Evaluate 


x=1 y=1 z= 
al i i x+y+zdzdydx 
x=0 Jy=0 Jz=0 


Solution What is meant by this expression is 


x=1 y=1 z=l 
r= | (/ (/ xty-+ 2de) ay) a 
x=0 y=0 z=0 
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where, as before, the inner integral is performed first, integrating with respect to z, with 
x and y being treated as constants. So 


x=1 y=1 2 1 
r= | / [ato + 5 | dy} dx 
x=0 y=0 2 0 
x=1 y=1 1 
/ ( / xX+ oy + oY ‘y) dx 
x=0 y=0 


ll 
i 
Ss i 

Se 

So 

+ 
we 

+ 
N| 

S 
_ a | 
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Consideration of the limits of integration shows that the integral is evaluated over a unit 
cube. 


27.6.3 Green’s theorem 


There is an important relationship between line and double integrals expressed in 
Green’s theorem in the plane: 


If the functions P(x, y) and Q(x, y) are finite and continuous in a region of the 
x-y plane, R, and on its boundary, the closed curve C, provided the relevant par- 
tial derivatives exist and are continuous in and on C, then 


frsesown|[(2-2)e 


where the direction of integration along C is such that the region R is always to 
the left. 


The conditions given in the theorem are present for mathematical completeness. Most 
of the functions that engineers deal with satisfy these conditions, and so we will not 
consider these further. The important thing to note is that this relationship states that a 
line integral around a closed curve can be expressed in terms of a double integral over 
the region, R, enclosed by C. 


Example 27.15 (a) Evaluate £.xydx + x° dy around the sides of the square with vertices D(0, 0), 
E(1, 0), F(I, 1) and GO, 1). 


(b) Convert the line integral to a double integral and verify Green’s theorem. 
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Solution (a) The path of integration is shown in Figure 27.15. To apply Green’s theorem the 
path of integration should be followed in such a way that the region of integration 
is always to its left. We must therefore travel anticlockwise around C. 


» 
1 G F On DE, y=0, dy=OandO0 <x <1. 
On EF,x=1, dx=OandO0<y<l 
On FG, y = 1, dy = 0 and x decreases from | to 0. 
" On GD, x = 0, dx = 0 and y decreases from | to 0. 
0 1 x The integral around the curve C can then be written as 
Figure 27.15 


E F G D 
The path C is chosen so § = i + / + / + / 
that the region R is c D E F G 
always to its left. Theres 


1 0 
f war+srdy 04 f idy+ f xdx +0 
iC 0 1 


2750 
= b+] > 
1 
ep (ee 

Pi 
a 
~ 2 


(b) Applying Green’s theorem with P(x, y) = xy and Q(x, y) = x* we can convert the 
line integral into a double integral. Note that —— = 2x and ae x. Clearly the 
y 


region of integration is the square R. We find 


f xyar+ dy = | (2x — x) dx dy 
Cc R 


1 pl 
-|/ / x dx dy 
0 Jo 


We see that the same result as that in part (a) is obtained and so Green’s theorem 
has been verified. 
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1 


4 
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EXERCISES 27.6 


Evaluate 


1 3 4 2 

(a) / i} 2 dxdz (b) / i x dy dz 
0 JO 0 JO 
2 p3 1 3y 

© | / xy + ldxdy @ | i = dxdy 
ee -1d2 % 


R is the shaded region shown in Figure 27.16. 
Evaluate f'f,x + ./ydxdy 
(a) performing the integration with respect to x first, 


(b) performing the integration with respect to y first. 


Figure 27.16 
The region of integration for Question 2. 


Evaluate 'f,(5x? + 2y”) dx dy where R is the interior 


of the triangular region bounded by 
ACI, 1), B(2, 0) and C(2, 2). 


(a) Sketch the region of integration of the double 
integral 


3 pV 
/ / ydxdy 
1 1 


Solutions 


(a) 1 (b) 8x (c) 10.5 (d) 0 


20 
(a) > 


20 
g MG 


(b) 1.35435 


256 
3 


4 
3 
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(b) Evaluate the integral by first reversing the order 
of integration. 


Evaluate 


1 5 
/ / (2 +9?) dedy 
—-lJ1 


Evaluate 


II (2 +?) dedy 
R 


over the triangular region R with vertices at 
(0, 0), (2, 0) and (1, 1). 


Evaluate 


ik + 2y)7!/? dxdy 
R 


over the region R given by x — 2y < 1 and 
ey. 


Evaluate fc (sinx + cos y) dx + 4e* dy where C is the 
boundary of the triangle formed by the points (1, 0), 
(3, 0) and (3, 2). By converting this line integral into a 
double integral, verify Green’s theorem. 


Evaluate the line integral 


f ixty) de + duydy 
C 


where C is the boundary of the triangle formed by the 
points (0, 0), (2, 0) and (0, 5). Express the line integral 
in terms of an appropriate double integral and 
evaluate this. Verify Green’s theorem. 


. The region R is shown in Figure S.26. 


< WI 


x=y24+]1 


re NM 


0 12 3 4°55 x 
Figure S.26 
92.306 
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SOME SIMPLE VOLUME AND SURFACE INTEGRALS 


Volume and surface integrals arise frequently in electromagnetism and fluid mechan- 
ics. We will illustrate these concepts through some simple examples. For a thorough 
treatment of more general cases you will need to refer to a more advanced text. 


Engineering application 27.3 


The mass of a solid object 


Consider a solid object such as that shown in Figure 27.17. The density, o, of this 
object may vary from point to point. So, at any point P with coordinates (x, y, z), the 
density is a function of position, that is p = p(x, y, z). Since density is a scalar, this 
is an example of a scalar field, like those discussed in Section 7.4. 
Suppose we select a very small piece of this object having volume dV and located 
at P(x, y, z). Recall from elementary physics that 
mass 


density = 
volume 


Then the mass of this small piece, 5m, is given by 
bm = p dV 


If we wish to calculate the total mass, M, of the object we must sum all such contri- 
butions from the entire volume. This is found by integrating throughout the volume. 
We write this as 


total mass, M = / pdv 
Vv 


This is an example of a volume integral, so called because the integration is per- 
formed throughout the volume. It will usually take the form of a triple integral such 
as those discussed in Section 27.6.2. Technically, there are three integral signs, but 
for brevity these have been replaced by the single iy where it is to be understood that 
the integral is to be performed over a volume. In any specific problem care must be 
taken to ensure that the entire volume is included when the integration is performed. 

For example, consider the case of a solid cube with sides of length 1 unit. Let 
one corner be positioned at the origin and let the edges coincide with the positive 
x, y and z axes. Suppose the density of the cube varies from point to point, and is 
given by p(x, y, z) =x-+y-+z. Then the integral which gives the mass of the cube is 
i (x + y + z) dV, where the volume V is the region occupied by the cube. This 
integral has been evaluated in Example 27.16 and found to be 3, representing the 
mass of the cube. 


Figure 27.17 

The total mass of a body is found by 
summing, or integrating, throughout 
the whole volume. 
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Engineering application 27.4 


The electric charge enclosed in a region 


Suppose electric charge is distributed throughout a volume, V, and at any point has 
charge density p(x, y, z). Charge density is a scalar field. 

Suppose we select a very small portion having volume dV and located at P(x, y, z) 
as shown in Figure 27.18. The charge in this portion, 5Q, is given by 


80 = pbV 


If we wish to calculate the total charge enclosed within the volume, we must sum all 
such contributions from the entire volume. This is found by integrating throughout 
the volume. We write this as 


total charge, Q = / pdav 
Vv 
This is another example of a volume integral. 


volume dV 


volume V 


Figure 27.18 

The total charge enclosed is found by 
summing, or integrating, throughout 
the whole volume. 


Engineering application 27.5 


Fluid flow across a surface 


Figure 27.19(a) represents the motion of a body of fluid throughout a region. At any 
point P(x, y, z) fluid will be moving with a certain speed in a certain direction. So, 
each small fluid element has a particular velocity v, which varies with position, that 
is V = V(i, y, z). This is an example of a vector field, as discussed in Section 7.4. 


surface S 


Py, LE 
(b) 


(a) 


Figure 27.19 
The vector field v represents fluid velocity and S is an imaginary surface through which the 
fluid flows. 


Vi—aviCyae) 


small portion of surface 6S 
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In Figure 27.19(b) we have placed an imaginary surface, S, within the flow and 
it is reasonable to ask ‘what is the volume of fluid which crosses this surface in 
any given time?’ Suppose we select a very small portion of the surface having area 
5S. Note that 5S is a scalar. We can also define an associated vector 5S, which has 
magnitude 5S and whose direction is normal to this portion of the surface. The 
component of fluid velocity in the direction of 5S is given by the scalar product 
v- 6S where 6S is a unit vector in the direction of 5S. The volume of fluid cross- 
ing this portion each second is v- 6S. This is also known as the flux of v. If we 
wish to calculate the total flux through the surface S we must sum all such contribu- 
tions over the entire surface. This is found by integrating over the surface. We write 
this as 


volume flow per second = total flux = / v-ds 
s 


This is an example of a surface integral, so called because we must integrate over 
a surface. It will usually take the form of a double integral such as those discussed 
in Section 27.6.1. Technically, there are two integral signs, but for brevity these have 
been replaced by the single /, s where it is to be understood that the integral is to be 
performed over the surface S. In any specific problem care must be taken to ensure 
that the entire surface is included when the integration is performed. The double 
integrals evaluated in Section 27.6 are special cases of surface integrals, in which the 
surface is a plane region (the x—y plane). 

For example, suppose the velocity field is given by v = (x+ 2y)k. This represents 
a flow in the z direction whose magnitude varies with x and y. Suppose S is the 
plane surface defined by | < x < 4,0 < y < 2, z = O. This surface is shown in 
Figure 27.20. Note that the normal to this surface is parallel to the z axis and so we 
can write dS = dx dyk. Then 


v-dS = («+ 2y)k-dxdyk 
= («+ 2y) dxdy 


dS = dxdyk 


SS + 2y)k 


Figure 27.20 

The vector field v represents 
fluid flowing across the 
surface S. The integral over 
the surface of v - dS gives the 
volume flow per second 
across S. 


nN 
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The surface integral which gives the flux of v through S is 


y=2 eh 
/ / (x + 2y) dx dy 
y=0 ll 


Note how the limits of integration are chosen so that contributions from the whole 
surface are included. This integral has already been evaluated in Example 27.10 and 
found to be 27. This represents the volume flow per second through S. 


Engineering application 27.6 


Electric current density 


Electric current is the flow of electric charges. Figure 27.21 represents an electric 
current flowing across a surface S. 

If p(x, y, z) is the charge density at a point, that is the charge per unit volume, and 
v(x, y, Z) is the velocity of the charges, then the quantity J defined as 


= Wy 


is called the current density and has units of amperes per square metre. Suppose 
we select a very small portion of the surface having area 5S. Note that 5S is a scalar. 
Suppose we define a vector, 6S, which has magnitude 6S and which is normal to this 
portion of surface. The component of current density in the direction of 5S is given 
by the scalar product J - 5S. The current crossing this portion is J - 6S. If we wish to 
calculate the total current, J, through the surface S we must sum all such contribu- 
tions over the entire surface. This is found by integrating over the surface. We write 
this as 


current, [J = i J-dS 
Ss 


This is an example of a surface integral. 


surface S 


J=J( y,z) 


Figure 27.21 
The electric current through a surface is 
obtained using a surface integral. 


6S 


small portion of surface 6S 


Note that this surface integral involves integrating a current density with units of 
amperes per square metre over an area with units of square metres. Therefore the 
result is a quantity with units of amperes and so the units balance on both sides of 
the equation. 
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Engineering application 27.7 


Electric flux and Gauss’s law 


Electric charges produce an electric field, E, which can be visualized by drawing 
lines of force. Suppose we surround a region containing charges with a surface S. 
The flux of E through S is a measure of the number of lines of force passing through 
it. The flux is given by 


flux = [ B-as 
§ 


Gauss’s law states that the total flux out of any closed surface is proportional to the 
total charge enclosed. So consider a closed surface in free space, enclosing a volume 
V. The total charge enclosed in V is given by the volume integral 


total charge = if pdV 
Vv 


Gauss’s law then states: 


1 
fas = — | pav 
S €& Jv 


where €, is the permittivity of free space. The notation ¢ indicates that the surface is 
a closed surface. Note how a surface integral can be expressed as a volume integral. 
This relationship is generalized in the divergence theorem in the following section. 


Example 27.16 A vector field is given by v = i+ 4y°j + $2’k. 


Solution 


(a) Find div v. 
(b) Evaluate ie div vdV where V is the unitcubeO <x <1,0<y<10<z<l. 


(a) divv = 2x+y+z. 
(b) We seek ii (2x + y+ z)dV where V is the given unit cube. The small element of 


volume dV is equal to dzdydx. With appropriate limits of integration the integral 
becomes 


x=1 y=1 z=l 
/ / / (2x + y + z) dzdy dx 
x=0 y=0 z=0 


This is a triple integral of the kind evaluated in Section 27.6.2: 


1 


x=1 py=1 pz=l Bro y=1 2 
/ / i (2x+y+z) dz dy dx =i | [2xc+ 92+ 4 dy dx 
x=0 Jy=0 Jz=0 x=0 Jy=0 2 0 
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x=1 
= 2x + Idx 
x=0 


= [x +x] 
= 2 


Example 27.17 Evaluate §. v- dS where v = x°i + Sy°j + $z’k and where S is the surface of the unit 
cubeO <x <1,0<y<1,0<z< 1. The vector dS should be drawn on each of the 
six faces in an outward sense. 


Solution The cube is shown in Figure 27.22. We evaluate the surface integral over each of the six 
faces separately and then add the results. 
On surface A,x = 1 andO < y< 1,0 <z< 1. dS isa vector normal to this surface, 
drawn in an outward sense, and so we can write it as dy dzi. Then 


vedS = (vit jy°j+ 32°) + dydzi 
=x dydz 
= dydz since on A, x = | 


The required surface integral over A is then 


Z= y=l z=1 
ldydz = / [y]p dz 
=1 


On surface B,x = Oand0 < y < 1,0 <z< 1. dS isa vector normal to this surface and 
so we can write it as —dy dzi. Then v - dS becomes —x” dy dz = 0 since x = 0. Over this 
surface, the integral is zero. 

You should verify in a similar manner that over each of C and E the integral is 5, 
whilst integrals over D and F are zero. The total surface integral is then 2. 


on E, dS = dxdyk 


B de A 


on C, dS = dxdzj 


dy 


F d A 
A Figure 27.22 
* The integral is evaluated over 


on A, dS = dydzi all six surfaces. 


EXERCISES 27.7 


1 Given F = —yi — xk 


27.8 The divergence theorem and Stokes’ theorem 895 


evaluate $5 F- dS where S is the 3 Evaluate the volume integral Iv 6x dV where V is the 


surface of aunitcubeO <x <1,0<y<l, parallelepipedO <z<2,0<x<10<y<3. 
O<z<l. 
aS 4 Evaluate Iv 1+ zdV where V is a unit cube 
2 Given F = 6xy*zk evaluate JsF-dS where S is the 0<x<LO<y<l10<z<l. 


plane surface z= 2,0 <x < 2,0 < y < 2. Take the 
direction of the vector element of area to be k. 


Solutions 
1 0 
2 128 


27.8.1 


Example 27.18 


18 


ry Ww 
ee) 


THE DIVERGENCE THEOREM AND STOKES’ THEOREM 


There are a number of theorems in vector calculus which allow line, surface and volume 
integrals to be expressed in alternative forms. One of these, Green’s theorem, has been 
described in Section 27.6.3. This allows a line integral to be written in terms of a double 
integral. Now we give details of the divergence theorem and Stokes’ theorem. 


The divergence theorem 


The divergence theorem relates the integral over a volume, V, to an integral over the 
closed surface, S, which surrounds that volume, as illustrated in Figure 27.23. 

When calculating such surface integrals vectors drawn normal to the surface should 
always be drawn in an outward sense, that is away from the enclosed volume. Recall 
that when a surface is closed the symbol for a surface integral is $5. 


The divergence theorem: 


fv-as = [ aivvev 
s Vv 


Verify the divergence theorem for the vector field v = x7i + 4y°j + 427k over the unit 


Solution 


cubeO<x<1lO0O<y<l10O<z<l. 


Firstly we need to evaluate $5 v-dS where S is the surface of the cube. This integral has 
been calculated in Example 27.17 and shown to be 2. 

Secondly we need to calculate /, y div vdV over the volume of the cube. This has been 
done in Example 27.16 and again the result is 2. 


We have verified the divergence theorem that 45 v-dS= ip div vdV. 
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dS 


surface S 


open surface S 


outwardly drawn normal 


dS bounding curve C 
Figure 27.23 Figure 27.24 
The divergence theorem relates a volume integral An open surface S with bounding 
to a surface integral. curve C. 


27.8.2 Stokes’ theorem 


Stokes’ theorem relates the integral over an open surface, S, to a line integral around a 
closed curve, C, which bounds that surface such as that shown in Figure 27.24. 

When a surface is open we adopt the following convention when drawing vectors 
normal to the surface. When the direction of the normal vector has been specified, use 
the right-hand screw rule to obtain a sense of turning around the normal vector, as shown 
in Figure 27.24. Imagine now moving the circle which surrounds the normal vector along 
the surface until it just meets the curve C. Transfer its sense of turning to the curve C. 
When calculating the line integral around the curve C, it should be traversed in the same 
sense. 

Specifically consider the two open surfaces shown in Figure 27.25. In both cases we 
are considering cubes. In the first case the cube has no top face. In the second case the 
cube has no bottom face. Drawing vector dx dzj note that the sense of turning required by 
the right-hand screw rule is that shown. The curve C must be traversed in the directions 
shown. 

Recall that when a curve is closed the symbol for a line integral is fc 


dS = dx dyk 


J # 


dS = —-dx dyk 


Figure 27.25 
The right-hand screw rule gives the direction in which C must be traversed. 
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Stokes’ theorem: 


fvedr= [ cuntv-as 
€ Ss 


Here dr is an element of length along the curve C. Recall that dr = dxi + dyj + dzk. 
We have used the symbol dr rather than ds as we did in Section 27.2 to avoid confusion 
with the element of area dS. 


Example 27.19 Acube of side 2 units is constructed with five solid faces and one open face. It is located 
in the region defined byO <x < 2,0 < y < 2,0 < z < 2 and its open face is its top 
face, bounded by the curve C, lying in the plane z = 2, as shown in Figure 27.26. 
Throughout this region a vector field is given by 


v= («+yit O+20j+ @4+2k 
(a) Evaluate f.v- dr. 


(b) Evaluate curl v. 


(c) Evaluate f, , curl v dS, and verify Stokes’ theorem. 


Solution (a) The open face is highlighted in Figure 27.26. It is bounded by the curve C around 
which the line integral $c v- dr must be performed in the sense shown. In this plane 
z = 2 and dz = 0, and hence 


v-dr = (x+y) dx + (y + 2) dy 


We perform the line integral around C in four stages. 
On I, x = 0, dx = 0 and hence v- dr = (y + 2) dy. Noting that y varies from 0 to 
2, the contribution to the line integral is 


9 


2 y 2 
[ v+2e=[5 +29] =6 
0 2 0 


dS = —dxdyk 


Figure 27.26 
A cubical box with open top bounded by curve C. 
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On II, y = 2, dy = 0 and hence v- dr = (x + 2) dx. Noting that x varies from 0 
to 2, the contribution to the line integral is 


2 2 2 
0 2 0 


On II, x = 2, dx = 0 and hence v- dr = (y + 2) dy. Here y varies from 2 to 0. 
This contribution to the line integral is 


0 y 0 
[ v+2e=[5 +2] =-6 
2 2 2 


On IV, y = 0, dy = 0 and hence v- dr = xdx. Here x varies from 2 to 0. The 
contribution to the line integral is 


0 327° 
/ xdx=}]—| =-2 
2 2 2 
Putting all these results together we find 


fv-dr=6+6-6-2=4 
Cc 


i j k 
‘ 1 a a a ae oe 
(b) = dx dy dz | aa 


X+yyt7zx+Z 


(c) Now we calculate the surface integral which must be performed over the five solid 
surfaces separately. Refer to Figure 27.26. On surface A, the front face lying in the 
plane x = 2, dS = dydzi. Hence curl v- dS = —dydz. Then 


z=2 y=2 
[cuty -dS = / - —dy dz 
A z=0 Jvy=0 
z=2 
= / [—ylp dz 
z=0 
=> [ —2dz 
z=0 


= [—2z) 
=-4 


On B, the back face lying in the plane x = 0, dS = —dydzi. It follows that 
curl v- dS = dy dz. The required integral over B is 


z=2  py=2 
' / dydz = 4 
z=0 y=0 


On C, the right-hand face, dS = dx dzj. Hence curlv- dS = —dxdz. The required 
integral over C is 


z=2 x2 
i / —dxdz = —4 
z=0 x=0 
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Similarly, on D, the left-hand face, dS = —dxdzj. Hence curlv- dS = dxdz. The 


required integral over D is 


Z= x=2 
/ dxdz = 4 
2= x=0 


On surface F,, the base, z = 0 and dS = —dxdyk. Hence curlv- dS = dxdy. The 


required integral over F is 


y=2 px=2 
if dxdy =4 
y 


=0 x=0 


Recall that the top surface E is open, and so we have completed the surface integrals. 
Finally, putting these results together, 


S 


Note from part (a) that this equals 


EXERCISES 27.8 


1 Verify Stokes’ theorem for the field v = xyi + yzj 
where S is the surface of the cube 0 < x < 1, 
0<y<1,0<z< 1 and the face z = 0 is open. 


2 Ifv = 3xi+ xyzj — 5zk verify Stokes’ theorem 
where S is the surface of the cube 0 < x < 1, 
0<y<1,0<z< 1 and the face z = 0 is open. 


5 
3 Suppose v = x7i — 3xy?z7j + (7x + zk and S is the 
surface of a cube of side 2 units lying in the region 
0<x<2,0<y<2,0 <z < 2 with an open top in 
the plane z = 2. Verify Stokes’ theorem for this field. 
4 Consider a cube givenbyO <x <1,0<y< 1, 
Solutions 
When calculating the relevant line and surface integrals 3 
the sign of the result depends upon the orientation of 
the curve C. 4 
1 fv-dr=+3 5 
2 fv-dr=0 
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fouty-ds=-444-44444=4 


f v-dr and so we have verified Stokes’ theorem. 


1 <z< 2, above the surface z = 1. Suppose the 
surface z = | is the only open face. Let S be the 
surface of this cube. Verify Stokes’ theorem for the 
field v = yi+ (x — 2xz)j — xyk. Take C as the square 
with corners (0, 0, 1), (1,0, 1), (1, 1, 1), (0, 1, 1) in 
the plane z = 1. 


Consider that part of the positive octant, that is where 
x, y and z are all positive, bounded by the planes 
x=0,y=0,z=Oandx+y-+ 2z = 2. Assume that 
the tetrahedron so formed has three solid faces, and 
one open face on the plane x + y + 2z = 2. Verify 
Stokes’ theorem for the field v = x2i — 2xzk. Take C 
as the triangle (2, 0, 0), (0, 2, 0) and (0, 0, 1). 


fv-dr = +128 


sed 


IN INTEGRAL FORM 


In Section 26.7 Maxwell’s equations were stated as examples of the application of vector 
calculus. In fact, alternative integral forms of these equations are often more useful and 
these are given here for completeness. 


where D = electric flux density, and p is electric charge density. Note that the r.h.s. is the 
total charge enclosed by the volume V. This equation states that the total flux crossing a 
closed surface S' which encloses a volume V is equal to the total charge enclosed by the 
surface. This is also the integral form of Gauss’s law. (See Engineering application 27.7 


where B = magnetic flux density. This law states that the net magnetic flux crossing 
any closed surface is zero. So whilst charges can be thought of as sources or sinks of 
electric flux, there are no equivalent sources or sinks of magnetic flux. 


Note that in the theory of electrostatics, differentiating partially with respect to time 
always yields zero, and so this equation reduces to § E-dr = 0. This is a condition 


(el 
discussed in Section 27.5 and confirms that an electrostatic field is a conservative field. 


This is the integral form of Ampére’s circuital law. The closed curve C bounds an open 


oD 
surface S. A current with density ae + J flows through the surface S. 
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Equation 1 
gp -dS = i pdaVv 
s Vv 
which is obtained by letting D = €,E.) 
Equation 2 
$B -dS = 0 
s 
Equation 3 
f B+ dr = -5 [B-as 
C at Js 
Equation 4 
7] 
f Hear = +, | D-as+ [ J-as 
C at Js s 
EXERCISES 27.9 
1. Starting with Maxwell’s equation V - D = p, by 


integrating both sides over an arbitrary volume V and 
using the divergence theorem obtain Equation 1 
above. 


Starting with Maxwell’s equation V - B = 0, by 
integrating both sides over an arbitrary volume V and 
using the divergence theorem obtain Equation 2 
above. 


Starting with the equation V x E = 0 for static 
electric fields, use Stokes’ theorem to show that 
§ E-dr=0. 


aD 
4 Starting from V x H = oF + J and using Stokes’ 


theorem obtain Equation 4. 


0 
In magnetostatics (; = 7 , Ampére’s law (or the 


magnetic circuit law) states: 


f Hear=1 
Cc 


where J is the current enclosed by the closed path C. 


Obtain this law from Equation 4 and using 


fgJ-dS = 1. 
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REVIEW EXERCISES 27 


1 Find Ic (2x — y) dx + xy dy along the straight line 
joining (—1, —1) and (1, 1). 

2 Find f.(2x + y) dx + y? dy 
(a) along the curve y = x between (1, 1) and (2, 8), 

and 

(b) along the straight line joining these points. 

3 IfF = 3xyi + 2e* yj find fF - ds where C is the 
straight line y = 2x + 1 between (1, 3) and (4, 9). 

4 Show that the field F = (2x + l)yit+ (? +x + Djis 
conservative and find a suitable potential function @. 

5 Find fc (xi + 4xyj) «ds where C is a closed path in 
the form of a triangle with vertices at 
(0, 0), (3, 0) and (3, 5). 

6 Find (?=) fe; Gx — Sy) dedy. 

i =2 px= 

M) Find f=) {oy *” Qy + 5x) dedy. 

8 Find 
PD Roy Ep OF +9? +2) dedy de. 

9 Evaluate f'f,(4y +x?) dx dy where R is the interior of 
the square with vertices at (0, 0), 
(4, 4), (0, 4) and (4, 0). 

10 Evaluate fc e’ dx + e* dy where C is the boundary 
of the triangle formed by the lines y = x, y= 5 
and x = 0. By converting this line integral into a 
double integral verify Green’s theorem in the 
plane. 

11. The region R is bounded by the y axis and the lines 
y =xand y = 3 — 2x. 
(a) Sketch the region R. 
(b) Find the volume between the surface z = xy + 1 

and the region R. 

12. The region R is shown in Figure 27.27. 

The vector field F is given by 
F= yi + 3xyj 
Solutions 

2 

_E 

2 (a) 1030.5 (b) 1031.25 


(a) Evaluate fc F - ds where C is the curve enclosing 
the region R. 


(b) Verify Green’s theorem in the plane. 


(c) The surface, z(x, y), is given by z= 1+ x+ xy. 
Calculate the volume under the surface and above 
the region R. 


The region R is bounded by the x axis and the curve 
y=8+2x—2?. 


(a) Sketch R. 
(b) Evaluate /'/.3x + 2y dx dy 
Evaluate 
0 4 
(a) [oy Jz 3xydedy 
3 2 
(b) fs fo 4+x—ydydx 
(c) fy Sy? ty? dedy 
ss 3 
() fa fo ldyde 


Sketch the regions of integration of the double 
integrals in Question 14. 


If a, b, c and d are constants show that 


d b 
/ / f(x)g(y) de dy 


is identical to 


f sou] [fs0] 


Figure 27.27 
The region R for Question 12. 


1666.37 


g=xrytyx+y+e 
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INTRODUCTION 


Probability theory is applicable to several areas of engineering. One example is relia- 
bility engineering which is concerned with analysing the likelihood that an engineering 
system will fail. For most systems calculating the exact time of failure is not feasible 
but it is often possible to obtain a good estimate of whether or not a system will fail in 
a certain time interval. This is useful information to have for any engineering system, 
but it is vital if the failure of the system results in the possibility of injury or loss of life. 
Examples include the failure of high-voltage switchgear so that the casing becomes live, 
or electrical equipment producing a spark while being used underground in a mine. 

Probability theory is also used extensively in production engineering, particularly in 
the field of quality control. No manufacturing process produces components of exactly 
the same quality each time. There is always some variation in quality and probability 
theory allows this variation to be quantified. This enables some predictability to be in- 
troduced into the activity of manufacturing and gives an engineer the confidence to say 
components of a certain quality can be supplied to a customer. 

A final example is in the field of communication engineering. Communication chan- 
nels are subject to noise which is random in nature and so is most successfully modelled 
using probability theory. We will examine some of these concepts in more detail in this 
chapter. 
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Figure 28.1 


A: a component meets 
the specification. B = A: 
a component fails to 
meet the specification. 


INTRODUCING PROBABILITY 


Consider a machine which manufactures electronic components. These must meet a cer- 
tain specification. The quality control department regularly samples the components. 
Suppose, on average, 92 out of 100 components meet the specification. Imagine that a 
component is selected at random and let A be the outcome that a component meets the 
specification; let B be the outcome that a component does not meet the specification. 


92 
Then we say the probability of A occurring is Too = 0.92 and the probability of B 


8 
occurring is Too = 0.08. The probability is thus a measure of the likelihood of the 


occurrence of a particular outcome. We write 


P(A) = probability of A occurring = 0.92 
P(B) = probability of B occurring = 0.08 


We note that the sum of the probabilities of all possible outcomes is 1. The process of 
selecting a component is called a trial. The possible outcomes are also called events. 
In this example there are only two possible events, A and B. We can depict this situa- 
tion using a Venn diagram as shown in Figure 28.1. Recall from Section 5.2 that Venn 
diagrams are used to depict sets. In this diagram we are depicting the events A and B as 
sets. The set of all possible outcomes is called the sample space and is represented by 
the universal set E. The set A represents the event that a component meets the specifi- 
cation. The set B represents the event that a component fails to meet the specification. 
In this instance, when a trial takes place there are only two possible outcomes, either a 
component meets the specification or it does not; that is, either event A occurs or event 
B occurs. An alternative notation is to write 


B=A 
where A is said to be the complement of A. We could also write 
A=B 


When a bar appears over a set then we say, for example, ‘not A’, or ‘not B’. 

We now seek to define probability in a more formal way. Let F be an event. We wish 
to obtain the probability that E will occur, that is P(E), when a trial takes place. In order 
to do so we repeat the trial a large number of times, n. We count the number of times 
that event E occurs, denoted by m. We then conclude that 


m 
P(E) = — (28.1) 


The larger the number of trials that take place, the more confident we are of our estimate 
of the probability of E occurring. For example, consider the trial of tossing a coin. If 
we wish to calculate the probability of a head occurring then measuring the results of 
1000 tosses of the coin is likely to yield a more accurate estimate than measuring the 
results of 10 tosses of the coin. Various consequences flow from Equation (28.1). The 
number of times E occurs must be non-negative and less than or equal to the number of 
trials, that is 0 < m <n. So, 


0< P(E) <1 


28.2.1 
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If m = 0, corresponding to event E never occurring in n trials, then 


P(E) =~ =0 


We conclude that E is an impossible event. 
If m = n, corresponding to event E always occuring in n trials, then 


n 
PE)=—=1 


We deduce that F is a certain event. 

If P(E) > 0.5 then we conclude that E is more likely to occur than not. 

The approach to defining probability that we have adopted so far is essentially exper- 
imental. We carry out a series of trials and measure the probability of an event occurring. 
Sometimes it is possible to deduce the probability of an event purely from theoretical 
considerations. Consider again the trial of tossing a coin. If the coin is fair then defining 
H to be the event that the coin lands with the head facing up we can easily deduce that 


P(H) = 0.5 


because we intuitively recognize that a head has the same likelihood of occurring as a 
tail. Similarly, if we roll a fair die — die is the singular of dice — and E is the event that 


a 6 is obtained, then P(E) = e 
Suppose there are two possible outcomes, £, and E,, of a trial. Then 
P(E,) + P(E,) = 1 
If a trial has n possible outcomes, E,, E,,..., £,,, then 
P(E,) + PEA) +--+ PE) =1 


The sum of the probabilities of all possible outcomes is 1, representing the total 
probability. 


Compound event 


Consider the situation of rolling a fair die. It is possible to define a variety of events 
or outcomes associated with this trial. We choose to define two events, EF, and E,, as 
follows: 


E, : a1, 2,3 or 4 is obtained 


E, : an even number is obtained 
So, 


E, = {1, 2, 3, 4} 
E, = {2, 4, 6} 


Now the universal set is 


f= I1,2,3,4,5,6) 


which embraces all possible outcomes. 
Suppose we now define the event EF, as 


E, : E, occurs and E, occurs 
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E| E, 
> 
Figure 28.2 


E, 1 E, corresponds to 
the compound event E, 
occurs and E, occurs. 


The event EF, is known as a compound event and only occurs when both E, and E, occur 
at the same time. Using set notation we can write 
E,=E, OE, 
= {1, 2,3,4} 1/9 {2, 4, 6} 
= {2,4} 
So E, occurs when a 2 or a 4 is rolled. Hence E, ME, can occur in two out of six equally 
likely ways. Therefore, 
P(E;) = P(E ee 
a0 phat = — = 3 
Figure 28.2 shows a Venn diagram for this example. 
We could also define the compound event E, as 
E, : E, occurs or E, occurs 


The event £, occurs when either E, occurs or E, occurs, or both. We can write 


E, = E, UE, 
= {1, 2,3, 4} U {2, 4, 6} 
= {1, 2,3, 4, 6} 


So, 


5 
P(E,) = P(E, UE) = = 


If events A and B both occur then this compound event is denoted AM B. 
If either event A or event B occurs then this compound event is denoted A U B. 


Engineering application 28.1 


Power supply to a computer 


Computers used in the control of life-critical systems often have two separate power 
supplies. If one power supply fails then the other takes over. Let E, be the event that 
power supply | fails, and let E, be the event that power supply 2 fails. For the power 
supply to the computer to fail completely the compound event E, and E, must occur, 
that is E, M E, occurs. 

A system such as this is said to have dual modular redundancy. One of the 
difficulties with such a system is in determining which system is operating correctly 
and which unit is switched to being the active one. In some situations it is necessary 
to have three systems operating in parallel and then using a voting system to decide 
on the correct output. This way the decision is taken based on the majority of the 
three systems, that is, when at least two out of the three agree. 
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Engineering application 28.2 


Quality control on a factory production line 


Quality control is an important aspect of factory production. So much so that some 
engineers spend their whole careers in this field and are known as quality control en- 
gineers. The fundamental goal is to ensure that manufactured goods are produced to 
a specified standard in spite of variations in the finished product that are an inevitable 
part of the manufacturing process. Consider the following problem. 

Machines A and B make components, which are then placed on a conveyor belt. 
Of those made by machine A, 93% are acceptable. Of those made by machine B, 95% 
are acceptable. Machine A makes 60% of the components and machine B makes the 
rest. Find the probability that a component selected at random from the conveyor 
belt is 


(a) made by machine A 

(b) made by machine A and acceptable 
(c) made by machine B and acceptable 
(d) made by machine B and unacceptable 


Solution 


(a) We are given that 60% of the components are made by machine A. Converting 
this percentage to a decimal number gives 


P(component is made by machine A) = 0.6 


(b) We know that 60% of the components are made by machine A and 93% of these 
are acceptable. Converting these percentages to decimal numbers we have 


3} 


P(component is made by machine A and is acceptable) = —~ x — 
100 ~=100 


= 000) x O88 = O.o58 


(c) We know that 40% of the components are made by machine B and 95% of these 
are acceptable. So, 


95 


P(component is made by machine B and is acceptable) = —~ x — 
100 ~=100 


= 0.40 x 0.95 = 0.38 


(d) We know that 40% of the components are made by machine B and 5% of these 
are unacceptable. So, 


40 5 
P(component is made by machine B and is unacceptable) = —~ x — 


= 0.40 x 0.05 = 0.02 100 100 


An alternative way to solve this problem is by constructing a tree diagram. To 
do so, consider the case when 1000 components are picked from the conveyor 
belt. We know that machine A makes 60% of the components and so 600 of these 
components, on average, will be made by machine A and the other 400 must 
therefore be made by machine B. This is shown in Figure 28.3. 
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1000 
ry, 400 n, 
i, 2 ace 
ee Ready 
400 20 
Figure 28.3 Figure 28.4 
Partial tree diagram for Engineering Full tree diagram for Engineering 
application 28.2. application 28.2. 


Of the 600 components made by machine A, we know that 93% of these are ac- 
ceptable, that is 93% of 600 = 558. So 600 — 558 = 42 are unacceptable. 

Of the 400 components made by machine B, we know that 95% of these are ac- 
ceptable, that is 95% of 400 = 380. So 400 — 380 = 20 are unacceptable. 

We can now complete a full tree diagram for this problem, which is shown in 
Figure 28.4. 

Using the tree diagram it is straightforward to calculate the required probabilities. 


(a) The probability a component is made by machine A is found by noting that 600 
of the 1000 components are made by machine A, that is 

600 

1000 — 

(b) The probability a component is made by machine A and is acceptable is found by 
noting that 558 of the 1000 components are made by machine A and acceptable, 
that is 


P(component is made by machine A) = 


P(component is made by machine A and acceptable) = 000 ~ 0.558 
: : 380 
(c) P(component is made by machine B and acceptable) = 000 = 0.38 
(d) P(component is made by machine B and unacceptable) = 1000 02 
EXERCISES 28.2 

1_ A fair die is rolled. The events E,,..., E; are defined (c) P(E, NEs) 
as follows: (d) P(E, NE3) 
E,: an even number is obtained (ce) P(E3 N Es) 


E,: an odd number is obtained 


E;: a score of less than 2 is obtained 


2 A trial can have three outcomes, E,, E, and E3. Ey 
and E, are equally likely to occur. EF; is three times 


E4: a3 is obtained more likely to occur than Z,. Find P(E,), P(E,) and 
E;: a score of more than 3 is obtained P(E3). 
Find 3 Acomponent is made by machines A and B. Machine 
(a) P(E,), P(E), P(E3), P(E), P(Es) A makes 70% of the components and machine B 


(b) P(E, NE) 


makes the rest. For both machines, the proportion of 
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acceptable components is 90%. Find the probability 
that a component selected at random is 


(a) unacceptable 
(b) acceptable and is made by machine A 


(c) unacceptable and is made by machine B 


Machines A, B and C make components. Machine A 


5 Circuit boards are made by machines A, B, C and D. 


Machine A makes 15% of the components, machine 
B makes 30%, machine C makes 35% and machine D 
makes the remainder. The probability that a board is 
acceptable is 0.93 when made by machine A, 0.96 
when made by machine B, 0.95 when made by 
machine C and 0.93 when made by machine D. A 


makes 20% of the components, machine B makes board is picked at random. Calculate the probability 
30% of the components and machine C makes the that it is 


rest. The probability that a component is faulty is 0.07 
when made by machine A, 0.06 when made by 
machine B and 0.05 when made by machine C. A 
component is picked at random. Calculate the 
probability that the component is 


(a) made by machine D 
(b) made by machine A and is acceptable 


(c) made by machine B and is unacceptable 


(a) made by machine C (d) made by machine C and is acceptable 


(b) made by machine A and is faulty (e) made by machine D and is unacceptable 
(c) made by machine B and is not faulty ; 

(ay: nade by machine © aid ioaulty (f) unacceptable and made by machine C 
(g) Ina batch of 1000 boards, how many would be 


expected to be acceptable and made by 


(e) made by machine A and is not faulty 


(f) faulty and is not made by machine B 


machine D? 
Solutions 
Oj) ee | Hee h(a) 05 (b) 0.014 (c) 0.282 
ore (d) 0.025 (e) 0.186 —(f).:0.039 
iG ~ 5 (a) 0.2 ~~ (b) 0.1395.) 0.012 
2) 0.2,0.2,06 (d) 0.3325 (e) 0.014 ~—() 0.0175 
3 (a) 0.1 (b) 0.63 (©) 0.03 ee 


MEL MutuaLty Exclusive EVENTS: THE ADDITION 
LAW OF PROBABILITY 


Consider a machine which manufactures car components. Suppose each component falls 
into one of four categories: 


Table 28.1 

The probability of a car top quality 

component falling into standard 

one of four categories. substandard 

Category Probability reject 

top quality ons After many samples have been taken and tested, it is found that under certain specific 
standard 0.65 


siaborandanl 0.12 conditions the probability that a component falls into a category is as shown in Ta- 

reject 0.05 dle 28.1. The four categories cover all possibilities and so the probabilities must sum 
to 1. If 100 samples are taken, then on average 18 will be top quality, 65 of standard 
quality, 12 substandard and 5 will be rejected. 
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Example 28.1 


Solution 


Figure 28.5 


E, and E j are mutually 
exclusive events and so 
are depicted as disjoint 
sets. 


Using the data in Table 28.1 calculate the probability that a component selected at ran- 
dom is either standard or top quality. 


On average 18 out of 100 components are top quality and 65 out of 100 are standard 
quality. So 83 out of 100 are either top quality or standard quality. Hence the probability 
that a component is either top quality or standard quality is 0.83. The solution may be 
expressed more formally as follows. Let A be the event that a component is top quality. 
Let B be the event that a component is standard quality. 


P(A) = 0.18 P(B) = 0.65 
Then, 

P(A UB) = 0.18 + 0.65 = 0.83 
Note that in this example 


P(A UB) = P(A) + P(B) 


In Example 28.1 the events A and B could not possibly occur together. A component is 
either top quality or standard quality but cannot be both. We say A and B are mutually 
exclusive because the occurrence of one excludes the occurrence of the other. The result 
applies more generally. 


If the occurrence of either of events E; or E; excludes the occurrence of the other, 
then £; and E; are said to be mutually exclusive events. 


If E; and £; are mutually exclusive we denote this by 
E,NE,=9 


We use ¢ to denote the empty set, that is a set with no elements. In effect we are stating 
that the compound event £; M E; is an impossible event and so will never occur. On a 
Venn diagram E; and E; are shown as disjoint sets (see Figure 28.5). Suppose that E,, 
E,, ..., E, are n events and that in a single trial only one of these events can occur. The 
occurrence of any event, E,, excludes the occurrence of all other events. Such events are 
mutually exclusive. 


For mutually exclusive events the addition law of probability applies: 
PAVE; O/B OF coo OIE.) = IMAGE, U8, Wooo W/E.) 
= P(E,)+ P(E.) +--+ + PE,) 
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Engineering application 28.3 


Electrical component reliability 


Electrical components fail after a certain length of time in use. Their lifespan is not 
fixed but subject to variation. It is important to be able to quantify the reliability of 
these components. Consider the following problem. 

The lifespans of 5000 electrical components are measured to assess their reliabil- 
ity. The lifespan (L) is recorded and the results are shown in Table 28.2. Find the 
probability that a randomly selected component will last 


(a) more than 3 years 
(b) between 3 and 5 years 
(c) less than 4 years 


Table 28.2 

The lifespans of 5000 electrical components. 
Lifespan of component (years) Number 
Ls 500 
4<L<5 2250 
3=<L<4 1850 
LSB 400 

Solution 


We define events A, B, C and D: 


A: the component lasts more than 5 years 

B: the component lasts between 4 and 5 years 
C: the component lasts between 3 and 4 years 
D: the component lasts 3 years or less 


500 2250 1850 
PA Oleh) Oo erie) = == 0.47 
5000 5000 5000 
400 
P0108 
5000 


The events A, B, C and D are clearly mutually exclusive and so the addition law 
may be applied. 


(a) P(component lasts more than 3 years) = P(A UBUC) 
= P(A) + P(B) + PC) 
= 0.1 + 0.45 + 0.37 
= (OL 
There is a 92% chance a component will last for more than 3 years. 
(b) P(component lasts between 3 and 5 years) = P(B UC) 
=P(B)+P(C) 
= 0.45 + 0.37 
=O 
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(c) P(component lasts less than 4 years) = P(C UD) 
=P(C)+P(D) 
= 0.37 + 0.08 
= 0.45 


Engineering application 28.4 


Calculating the probability a component is acceptable on 
a factory production line with several machines 


Sometimes a factory production line may be fed by several machines that make com- 
ponents. It is important to be able to calculate the overall quality of the product that 
emerges given knowledge of the performance of the individual machines. Consider 
the following problem. 

Machines A and B make components. Machine A makes 60% of the components. 
The probability that a component is acceptable is 0.93 when made by machine A 
and 0.95 when made by machine B. A component is picked at random. Calculate the 
probability that it is 


(a) made by machine A and is acceptable 
(b) made by machine B and is acceptable 
(c) acceptable 


Solution 


We have already looked at this problem in Engineering application 28.2. Figure 28.4 
shows the tree diagram for the problem. 


558 
(a) P (component is made by machine A and is acceptable) = 7000 = 585 


380 
(b) P (component is made by machine B and is acceptable) = 1000 = OB. 


(c) Note that the events described in (a) and (b) are mutually exclusive and so the 
addition law can be applied. 
'P(component is acceptable) = P(component is made by machine A and is acceptable) 
+ P(component is made by 
machine B and is acceptable) 
= 0.558 + 0.38 
= 0.938 
We can now obtain this probability directly from the tree diagram. We see that 
558 + 380 = 938 components are acceptable and so 


938 
P(component is acceptable) = —— = 0.938 
1000 


1 


1 
2 
3 


EXERCISES 28.3 


A component is classified as one of top quality, 
standard quality or substandard, with respective 
probabilities of 0.07, 0.85 and 0.08. Find the 
probability that a component is 

(a) either top quality or standard quality 

(b) not top quality 


The lifespan (L) of each of 2000 valves is measured 
and given in Table 28.3. 


Table 28.3 
The lifespans of 2000 valves. 


Lifespan (hours) Number 


L > 1000 119 
800 < L < 1000 520 
600 < L < 800 931 
400 < L < 600 230 
L < 400 200 


Calculate the probability that the lifespan of a valve is 
(a) more than 800 hours 

(b) less than 600 hours 

(c) between 400 and 800 hours 

Chips are manufactured by machines A and B. 


Machine A makes 65% of the chips and machine B 
makes the remainder. The probability that a chip is 


Solutions 


(a) 0.92 (b) 0.93 


(a) 0.3195 (b) 0.215 (c) 0.5805 


(a) 0.0195 
(d) 0.0175 


(b) 0.6675 
(e) 0.037 


(c) 0.3695 
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faulty is 0.03 when made by machine A and 0.05 
when made by machine B. A chip is selected at 
random. Calculate the probability that it is 

(a) faulty and made by machine A 

(b) faulty or made by machine A 

(c) faulty or made by machine B 

(d) faulty and made by machine B 

(e) faulty 

Components are made by machines A, B, C and D. 
Machine A makes 25% of the components, machine 
B makes 16% of the components, machine C makes 
21% of the components and machine D makes the 
remainder. The probability that a component is 
acceptable is 0.92 when made by machine A, 0.90 
when made by machine B, 0.96 when made by 
machine C and 0.93 when made by machine D. A 
component is picked at random. Calculate the 
probability that it is 

(a) made by machine A or machine C 

(b) made by machines A, B or D 

(c) acceptable and made by machine B 

(d) acceptable and made by machine C 


(e) acceptable and made by machine B or acceptable 
and made by machine C 


(f) acceptable or made by machine A 


(g) acceptable 


(a) 0.46 (b) 0.79 (c) 0.144 
(d) 0.2016 = (e) 0.3456 ~—(f):0.949 
(g) 0.929 


COMPLEMENTARY EVENTS 


Consider an electronic circuit. Clearly, either the circuit works correctly or it does not 
work correctly. Let A be the event that the circuit works correctly, and let B be the event 
that the circuit does not work correctly. Either A or B must happen when the circuit is 
tested, and one excludes the other. The events A and B are said to be complementary. 
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The Venn diagram corresponding to this situation is identical to that in Figure 28.1 where 
the set A corresponds to the event A, and the event B is represented by A. 


Two events, A and B, are complementary if they are mutually exclusive and in a 
single trial either A or B must happen. 


Hence, if A and B are complementary then 
P(A) + P(B)=1 
and 
P(ANB)=0 
It is usual to denote complementary events as A and A. For example, 


: the component is top quality 

: the component is not top quality 
in>6 

in<o6 


: the circuit has failed to meet the specification 


AQ Bw FS 


: the circuit has met the specification 


Recall from Engineering application 28.3 that D was the event the component lasts 
3 years or less, and this had probability P(D) = 0.08. We could say 


D: the component lasts 3 years or less 


D : the component lasts more than 3 years 
D and D are complementary events and so 


P(D) + P(D) =1 
P(D) = 1—P(D) = 1—0.08 = 0.92 


Engineering application 28.5 


Testing multiple electronic components 
Three transistors are tested. The probability that none of them works is 0.03. What 


is the probability that at least one transistor works? 


Solution 
We define the events A and A: 


A : all three transistors fail 


A : not all three transistors fail 
A and A are complementary events and so 


P(A) =0.03 P(A) =1—P(A) = 0.97 
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To say that not all three transistors fail is the same as saying that one or more tran- 
sistors work. So we could say 


A : at least one transistor works 


Hence the probability that at least one transistor works is 0.97. 


EXERCISES 28.4 
1. Which pairs of events are complementary? 2 Events A, B, C and D are defined. 
A: the component is reliable A: the lifespan is 90 days or less 
B: there is only one component B: the machine is reliable 
C: there are less than two components C: all components have been tested 
D: more than two components are D: at least three components from 
reliable 


the sample are unreliable 


E: the component is unreliable 


. State the events A, B, C and D. 
F: there is more than one component 


G: most of the components are 


unreliable 


Solutions 


M AandE;CandF B: the machine is unreliable 
C: some components have not been tested 


2 A: the lifespan is more than 90 days D: two or fewer components are unreliable 


GET concerts FROM COMMUNICATION THEORY 


Communication engineers find it useful to quantify information for the purposes of anal- 
ysis. In order to do so a very restricted view of information is used. Information is seen in 
terms of knowledge of an event occurring. A highly improbable event occurring consti- 
tutes more information than an almost certain event occurring. This correlates to some 
extent with human experience as people tend to be much more interested in hearing 
about unlikely events. The information, /, associated with an event, is defined by 


I=-—logp O0<p<l 


where p = probability of an event occurring. Notice that p = 0 is excluded from the 
domain of the function as the logarithm is not defined at 0. In practice this is not a 
problem because an event with zero probability never occurs. Often logarithms to the 
base 2 are used when calculating information as in many cases information arrives in 
the form of a ‘bit stream’ consisting of a series of binary numbers. For this case J has 
units of bits and is given by 


I = — log, p 


A formula for evaluating logarithms to the base 2 is given in Chapter 2. 
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Engineering application 28.6 


Information content of a binary data stream 


Suppose that a computer generates a binary stream of data and that 1s and Os occur 
with equal probability, that is P(0) = P(1) = 0.5. Calculate the information per 


binary digit generated. 
Solution 
Here, p = 0.5 whether the binary digit is 0 or 1, so 
—] 0.5 
(STS = ee se 
log, 2 


Engineering application 28.7 


Information content of an alphabetic character stream 


Suppose that a system generates a stream of upper case alphabetic characters and that 
the probability of a character occurring is the same for all characters. Calculate 


(a) the information associated with the character G occurring 


(b) the information associated with any single character. 


Solution 


1 1 —] 1/26 
(a) P(G occurring) = —, = lbs, = 08)9(1/26) = 4.70 bits 
26 26 log) 2 


(b) All characters are equally likely to occur and so 


1\ log, (1/26 
(los, 2 SE ages 
26 log, 2 


Often a series of events may occur that do not have the same probability. For example, 
if a stream of alphabetic characters is being generated then it is likely that some charac- 
ters will occur more frequently than others and so have a higher probability associated 
with them. For this situation it becomes convenient to introduce the concept of average 
information. Given a source producing a set of events 


By, By fay ice ey EB, 


ui 


with probabilities 


P\> Pa) P39+++> Pry 


then for a long series of events the average information per event is given by 
i=n 


H=— > Pp; log, p; bits 


i=1 


H is also termed the entropy. 
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Engineering application 28.8 


Entropy of a signal consisting of three characters 


A source produces messages consisting of three characters, A, B and C. The probabil- 
ities of each of these characters occurring is P(A) = 0.2, P(B) = 0.5, P(C) = 0.3. 
Calculate the entropy of the signal. 


Solution 
H = —0.2 log, (0.2) — 0.5 log, (0.5) — 0.3 log, (0.3) = 1.49 bits 


Engineering application 28.9 


Entropy of a binary data stream 


A source generates binary digits 0, 1, with probabilities P(O) = 0.3 and P(1) = 0.7. 
Calculate the entropy of the signal. 


Solution 
H = —0.3 log, (0.3) — 0.7 log, (0.7) = 0.881 bits 


Note that in Engineering application 28.9, on average, each binary digit only carries 
0.881 bits of information. In fact the maximum average amount of information that can 
be carried by a binary digit occurs when P(0) = P(1) = 0.5, as seen in Engineering 
application 28.6. For this case H = 1. 

For any data stream, the maximum average amount of information that can be carried 
by a digit occurs when all digits have equal probability, that is H is maximized when 


Py = Po >= .P3 5°" = Pn 


H is maximized when p, = p, = p; = -:: = p,. The maximum value of H is 
denoted H,,,.. 


When the probabilities are not the same then one way of viewing the reduction in 
H is to think of the likely event being given too much of the signalling time given its 
lower information content. It is interesting to explore the two limiting cases, that is (a) 
P(O) = 0, P(1) = 1; (b) P(O) = 1, PC.) = O. In both cases it can be shown that H = 0. 
However, on examination this is reasonable because a continuous stream of 1s does not 
relay any useful information to the recipient and neither does a continuous stream of Os. 

The fact that some streams of symbols do not contain as much information as other 
streams of the same symbols leads to the concept of redundancy. This allows the effi- 
ciency with which information is being sent to be quantified and is defined as 


maximum entropy — actual entropy 


redundancy = : 
maximum entropy 


A low value of redundancy corresponds to efficient transmission of information. 
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Engineering application 28.10 


Redundancy of a binary data stream 


Consider the source of binary digits examined in Engineering application 28.6 and 
28.9. The maximum entropy for a binary stream is | bit per binary digit. Calculate 
the redundancy in each case. 


Solution 
For Engineering application 28.6 
1-1 
redundancy = aire 0 
For Engineering application 28.9 


1 — 0.881 
redundancy = =a i 0.119 


Engineering application 28.11 


Redundancy of a character data stream 


A stream of data consists of four characters A, B, C, D with probabilities 0.1, 0.3, 
0.2, 0.4, respectively. Calculate the redundancy. 


Solution 
It can be shown that the maximum entropy, H,,,,, corresponds to the situation in 
which the probability of each symbol is the same, that is 0.25. 
Fnax = 4 X (—0.25 log, (0.25)) = 2 bits 
The actual entropy, H,,,, is given by 


H,., = —(.1 log, (0.1) + 0.3 log, (0.3) + 0.2 log, (0.2) + 0.4 log, (0.4)) 


= 1.846 bits 


2 — 1.846 
redundancy = oe 0.0770 


In the examples we have examined so far we have used the bit as the unit of information 
because the most common form of digital signalling uses binary digits. When there are 
only two possible events it is possible to represent an event by a single binary digit. 
However, if there is a larger number of possible events then several binary digits are 
needed to represent a single event. When calculating values for information and entropy 
in these examples an assumption was made that each event was represented by binary 
sequences or codes of the same length. It is only possible to do this efficiently if the 
number of events is a power of 2, that is 2, 4, 8, 16, .... In practice this problem does not 
arise because it is more common to produce codes that have a small number of binary 
digits for likely events and a long number of binary digits for unlikely events. This allows 
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the redundancy of a data stream to be reduced. The design of such codes is known as 
coding theory. One complication is that most streams of data are not transmitted with 
100% accuracy as a result of the presence of noise within the communication channel. It 
is often necessary to build extra redundancy into a code in order to recover these errors. 


1 


1 
2 
3 


EXERCISES 28.5 


A source generates six characters, A, B, C, D, E, F, 
with respective probabilities 0.05, 0.1, 0.25, 0.3, 0.15, 
0.15. Calculate the average information per character 
and the redundancy. 


A visual display unit has a resolution of 600 rows by 
800 columns. Ten different grey levels are associated 
with each pixel and their probabilities are 0.05, 0.07, 
0.09, 0.10, 0.11, 0.13, 0.12, 0.12, 0.11, 0.10. Calculate 
the average information content in each picture frame. 


A source generates five characters A, B, C, D and E 
with respective probabilities of 0.1, 0.15, 0.2, 0.25 
and 0.3. 


(a) Calculate the information associated with the 
character B. 

(b) Calculate the entropy. 

(c) Calculate the redundancy. 

A data stream comprises the characters A, B, C, D 


and E with respective probabilities of 0.23, 0.16, 0.11, 
0.37 and 0.13. 


Solutions 


2.3905, 0.0752 
3.2790 
(a) 2.7370 


(b) 2.2282 (c) 0.0404 


(a) Calculate the information associated with the 
character B. 


(b) Calculate the information associated with the 
character D. 


(c) Calculate the entropy. 
(d) Calculate the redundancy. 
A data stream comprises the characters A, B, C, D 


and E with respective probabilities of 0.12, 0.21, 0.07, 
0.31 and 0.29. 


(a) Which character carries the greatest information 
content? 


(b) Which character carries the least information 
content? 


(c) Calculate the information associated with the 
letter D. 


(d) Calculate the entropy. 


(e) Calculate the redundancy. 


(a) 2.6439  (b) 1.4344 — (c)- 2.1743 
(d) 0.0636 

(a) C (b) D (c) 1.6897 
(d) 2.1501 — (e) 0.0740 


CONDITIONAL PROBABILITY: THE MULTIPLICATION LAW 


Suppose two machines, M and N, both manufacture components. Of the components 
made by machine M, 92% are of an acceptable standard and 8% are rejected. For ma- 
chine N, only 80% are of an acceptable standard and 20% are rejected. Consider now 
the event E: 


E:acomponent is of an acceptable standard 


If all the components are manufactured by machine M then P(E) = 0.92. However, 
if all the components are manufactured by machine N then P(E) = 0.8. If half the 
components are manufactured by machine M and half by machine N then P(E) = 0.86. 
To see why this is so consider 1000 components. Of the half made by machine M, 92% x 
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500 = 460 will be of an acceptable standard. Of the half made by machine N, 80% x 
500 = 400 will be acceptable. Hence 860 of the 1000 components will be acceptable 


and so P(E) = 000 ~ 0.86. Clearly, there are distinct probabilities of the same event; 


the probability changes as the conditions change. This is intuitive and leads to the idea 
of conditional probability. 
We introduce a notation for conditional probability. Define events A and B by 


A: the component is manufactured by machine M 


B: the component is manufactured by machine N 


Then the probability that a component is of an acceptable standard, given it is manufac- 
tured by machine M, is written as P(E|A). We read this as the conditional probability 
of E given A. Similarly P(E|B) is the probability of EF happening, given B has already 
happened. 


P(E|A) =0.92  P(E|B) = 0.8 


To be pedantic, all probabilities are conditional since the conditions surrounding any 
event can change. However, for many situations there is tacitly assumed a definite set 
of conditions which is always satisfied. The probability of an event calculated under 
only these conditions is known as the unconditional probability. If further well-defined 
conditions are attached, the probability is conditional. 


Engineering application 28.12 


Production line product fed by two machines 


Machines M and N manufacture a component. The probability that the component is 
of an acceptable standard is 0.95 when manufactured by machine M and 0.83 when 
manufactured by machine N. Machine M supplies 65% of components; machine N 
supplies 35%. A component is picked at random. 

(a) What is the probability that the component is of an acceptable standard? 


(b) What is the probability that a component is of an acceptable standard and is made 
by machine M? 


(c) What is the probability that the component is of an acceptable standard given it 
is made by machine M? 


(d) What is the probability that the component was made by machine M? 


(e) What is the probability that the component was made by machine M given it is 
of an acceptable standard? 


(f) The component is not of an acceptable standard. What is the probability that it 
was made by machine N? 


Solution 

We define the events: 
A: the component is manufactured by machine M 
B: the component is manufactured by machine N 
C: the component is of an acceptable standard 


(a) 


(b 


we 


(c 


wm 


(d 
( 


a 
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Consider 1000 components. Then 650 are manufactured by machine M, 350 by 
machine N. Of the 650 manufactured by machine M, 95% will be acceptable, that 


is 650 x = = 617.5. Of the 350 manufactured by machine N, 83% will be ac- 


ceptable, that is 350 x T00 = 290.5. So on average in 1000 components, 617.5+ 
290.5 = 908 will be acceptable, that is P(C) = 0.908. 


From part (a) we know that out of 1000 components 617.5 will be made by 


617.5 
machine M and be of an acceptable standard. Hence P(A MC) = = 


tl 
0.6175. o 


We require P(C|A). The probability a component is acceptable given it is man- 
ufactured by machine M is 


P(C\A) = 0.95 


Machine M makes 65% of components, that is P(A) = 0.65. 


We require P(A|C). Consider again the 1000 components. On average 908 are 
acceptable. Of these 908 acceptable components, 617.5 are manufactured by ma- 
chine M and 290.5 by machine N. We are told the component is acceptable and 
so we must restrict attention to the 908 acceptable components. So, out of 908 ac- 
617.5 


ceptable components, 617.5 are made by machine M, that is P(A|C) = coe 


0.68. 


We require P(B|C). Consider 1000 components. Machine M manufactures 
650 components of which 617.5 are acceptable and hence 32.5 are unaccept- 
able. Machine N manufactures 350 components of which 290.5 are acceptable 
and 59.5 are unacceptable. There are 92 unacceptable components of which 59.5 
were made by machine N. The probability of the component being made by ma- 
chine N given it is unacceptable is 


P(BIC) = a = 0.647 


that is, almost 65% of unacceptable components are manufactured by 
machine N. 


As an alternative method of solution, we can represent the information via a tree 


diagram. This is shown in Figure 28.6. 


650 
ae 
1000 
—~ 
350 


617.5 (acceptable) 


32.5 (unacceptable) 


290.5 (acceptable) 


Figure 28.6 


Tree diagram for Engineering application 
59.5 (unacceptable) 28 12. 


or 
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(a) There are 617.5 + 290.5 = 908 acceptable components, so 
P(component is acceptable) = 0.908 
(b) There are 617.5 components which are acceptable and made by machine M. So 


P(component is acceptable and made by machine M) 
SICA) =O61S 


(c) Of the 650 components made by machine M, 617.5 are acceptable, and so 


P(component is acceptable given it is made by machine M) = P(C|A) 
(O3) 
= SS = 05 
650 


(d) Of the 1000 components, 650 are made by machine M, so 
P(component is made by machine M) = P(A) = 0.65 


(e) There are 908 acceptable components. Of these, 617.5 are made by machine M. 
Hence 


P(component is made by machine M given it is acceptable) = P(A|C) 
Ses 0.68 
ec0cae 
(f) There are 32.5 + 59.5 = 92 unacceptable components. Of these 92, 59.5 are 


made by machine N. Thus 


P(component is made by machine N given it is not acceptable) = P(BIC) 
59.5 


= SS} = O47 
OD 


28.6.1 The multiplication law 


Consider events A and B for which AN B 4 Gas shown in Figure 28.7. Suppose we 
know that event A has occurred and we seek the probability that B occurs, that is P(B|A). 
Knowing event A has occurred we can restrict our attention to the set A. Event B will 
occur if any outcome is in A M B. Hence, 
P(ANB) 

P(A) 


The multiplication law of probability states: 
Ce) P(AN B) = P(A)P(BIA) 


Since the compound event ‘A and B’ is identical to “B and A’ we may also say 


P(BIA) = 


P(ANB) = P(BNA) = P(B)P(A|B) 
Figure 28.7 
A and B are not Consider Engineering application 28.12(e). We require the probability that the compo- 
mutually exclusive. nent was manufactured by machine M, given it is acceptable. This is P(A|C). Now 
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P(ANC) = P(CNA) = P(C)P(A|C) 
P(ANC 
P(A|C) = PANG) 
P(C) 
Now P(ANC) is the probability that the component is manufactured by machine M and 
is acceptable. This is known to be 0.6175, using Engineering application 28.12(b). Also 
from Engineering application 28.12(a) we see P(C) = 0.908. Hence, 


0.6175 


PIC) = F008 


= 0.68 


Engineering application 28.13 


Reliability of manufactured components 


A manufacturer studies the reliability of a certain component so that suitable guar- 
antees can be given: 83% of components remain reliable for at least 5 years; 92% 
remain reliable for at least 3 years. What is the probability that a component which 
has remained reliable for 3 years will remain reliable for 5 years? 


Solution 

We define the events: 
A: a component remains reliable for at least 3 years 
B: a component remains reliable for at least 5 years 


Then P(A) = 0.92, P(B) = 0.83. Note that these are unconditional probabilities. We 
require P(B|A), a conditional probability. 


P(ANB) = P(A)P(BIA) 


AM Bis the compound event a component remains reliable for at least 3 years and it 
remains reliable for at least 5 years. Clearly this is the same as the event B. So 


P(ANB) = P(B) 


Hence 
P(B) = P(A)P(BIA) 
P(B) 0.83 
P(B\A) = P(A) =A 0.90 


that is, 90% of the components which remain reliable for 3 years will remain reliable 
for at least 5 years. 

Alternatively, a tree diagram is shown in Figure 28.8. Starting with 100 compo- 
nents, 92 remain reliable 3 years later. Of these 92, 2 years later, 83 are still reliable. 
So 


83 
P(component is reliable after 5 years, given it is reliable after 3 years) = 5) 
= 0.90 
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1 


3 years 


5 years 


92 (reliable) ae 83 (reliable) 
ee 9 (unreliable) 


100 


SS 8 (unreliable) 


Figure 28.8 


17 (unreliable) 


8 (unreliable) 


Tree diagram for Engineering application 28.13. 


EXERCISES 28.6 


A component is manufactured by machines 1 and 2. 
Machine 1 manufactures 72% of total production of 
the component. The percentage of components which 
are acceptable varies, depending upon which machine 
is used. For machine 1, 97% of components are 
acceptable and for machine 2, 92% are acceptable. A 
component is picked at random. 


(a) What is the probability it was manufactured by 
machine 1? 
(b) What is the probability it is not acceptable? 


(c) What is the probability that it is acceptable and 
made by machine 2? 


(d) If the component is acceptable what is the 
probability it was manufactured by machine 2? 


(e) If the component is not acceptable what is the 
probability it was manufactured by machine 1? 


The measured lifespans (L) of 1500 components are 
recorded in Table 28.4. 


Table 28.4 
Lifespans of 1500 components. 


Lifespan (hours) Number of components 


L > 1000 210 
900 < L < 1000 820 
800 < L < 900 240 
700 < L < 800 200 
L < 700 30 


(a) What is the probability that a component which is 
still working after 800 hours will last for at least 
900 hours? 

(b) What is the probability that a component which is 


still working after 900 hours will continue to last 
for at least 1000 hours? 


The lifespan, L, of 1000 components is measured and 
detailed in Table 28.5. 


Table 28.5 
Lifespans of 1000 components. 


Lifespan (hours) Number of components 


L > 1750 70 
1500 < L < 1750 110 
1250 < L < 1500 150 
1000 < L < 1250 200 
750 < L < 1000 317 
500 < L < 750 96 
250 < L < 500 42 

L < 250 15 


(a) Calculate the probability that the lifespan of a 
component is more than 1000 hours. 


(b) Calculate the probability that the lifespan of a 
component is less than 750 hours. 


(c) Calculate the probability that a component which 
is still working after 500 hours will continue to 
last for at least 1500 hours. 


Machines A, B and C manufacture components. 
Machine A makes 50% of the components, machine 
B makes 30% of the components and machine C 
makes the rest. The probability that a component is 
reliable is 0.93 when made by machine A, 0.90 when 
made by machine B and 0.95 when made by machine 
C. A component is picked at random. 


(a) Calculate the probability that it is reliable. 


(b) Calculate the probability that it is made by 
machine B given it is unreliable. 


5 Components are made by machines A, B, C and D. 
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(a) reliable 


Machine A makes 17% of the components, machine (b) not reliable 
B makes 21% of the components, machine C makes (c) reliable, given it is made by machine B 


20% of the components and machine D makes the 
remainder. For machine A, 96% of the components 


(d) not reliable, given it is made by machine D 


are reliable, for machine B, 89% are reliable, for (e) made by machine A given it is reliable 
machine C, 92% are reliable and for machine D, 97% (f) made by machine C given it is unreliable 
are reliable. A component is picked at random. 

Calculate the probability that it is 


Solutions 


1 (a) 0.72 (b) 0.044 ~— (c)- 0.2576 4 (a) 0.925 (b) 0.4 


(d) 0.2695 (e) 0.4909 


5 (a) 0.9415  (b) 0.0585 (c) 0.89 


2 (a) 0.8110 (b) 0.2039 (d) 0.03 (e) 0.1733 (f) 0.2735 
3 (a) 0.53 (b) 0.153 (c) 0.1909 
ya: =6INDEPENDENT EVENTS 


Two events are independent if the occurrence of either event does not influence the 
probability of the other event occurring. 


Engineering application 28.14 


Acceptability of manufactured electronic chips 


Machine | manufactures an electronic chip, A, of which 90% are acceptable. Ma- 
chine 2 manufactures an electronic chip, B, of which 83% are acceptable. Two chips 
are picked at random, one of each kind. Find the probability that they are both 
acceptable. 


Solution 
The events E, and E, are defined: 
E,: chip A is acceptable 
E,: chip B is acceptable 
P(E,) = 0.9 P(E,) = 0.83 


A single trial consists of choosing two chips at random. We require the probability 
that the compound event, E,  E,, is true. Using the multiplication law we have 


P(E, NE) = P(E, )P(E5|E,) = 0.9P(E,|E;) 
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P(E,|E,) is the probability of E, happening given E, has happened. However, ma- 
chine | and machine 2 are independent, so the probability of chip B being acceptable 
is in no way influenced by the acceptability of chip A. The events E, and E, are 
independent. 


P(E,|E,) = P(E,) = 0.83 
Therefore 


P(E, NE>) = P(E, P(E) = (0.9) (0.83) = 0.75 


For independent events E, and E,: 
(1) P(E, \|E,) = P,), P(E|E,) = P(E,) 
CQ) AWE, (M18) = AVE, WAU») 


The concept of independence may be applied to more than two events. Three or more 
events are independent if every pair of events is independent. If E,, E,,...,E, aren 
independent events then 


P(E||E;) = P(E;) for any i and j,i # j 
and 
P(E, E,) = P(E,)P(E;) ixj 


A compound event may comprise several independent events. The multiplication law is 
extended in an obvious way: 


P(E, NE, ME;) = P(E,)P(E,)P(E3) 
P(E, NE, NE; NE,) = P(E, )P(E,)P(E;)P(E,) 


and so on. 


Engineering application 28.15 


Probability of faulty components 


The probability that a component is faulty is 0.04. Two components are picked at 
random. Calculate the probability that 

(a) both components are faulty 

(b) both components are not faulty 

(c) one of the components is faulty 

(d) one of the components is not faulty 

(e) at least one of the components is faulty 

(f) at least one of the components is not faulty 


28.7 Independent events 


Solution 
We define events F, and F, to be 


F;: the first component is faulty 
F,: the second component is faulty 


Then events F, and F. are 


F ,: the first component is not faulty 
F,: the second component is not faulty 


Then 
PE )=PE,)=004 FF.) = FG) =1—004 = 0.96 
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(a) We require both components to be faulty: 
IAC, (N/A) SS IATA WAU) since events are independent 
= (0.04) 
= 0.0016 
(b) We require both components to be not faulty: 
AUF, VF) = IAG, WAUF a) 
= (0.96)? 
—O92NG 
(c) Consider the two components. Then either both components are faulty or one 


component is faulty or neither of the components is faulty. So, 


P(one component is faulty) = 1 — P(both components are faulty) 
—P(neither of the components is faulty) 
= 1 — 0.0016 — 0.9216 
= 0.0768 


An alternative approach is as follows. Either the first component is faulty and the 
second one is not faulty, or the first component is not faulty and the second one 


is faulty. These two cases are represented by F, M F, and F, M F,. So, 


P(one component is faulty) = P(F, N F,) + PF, NF) 
= P(F,)P(F,) + PF )P(A) 
= (0.04) (0.96) + (0.96) (0.04) 
= 0.0768 


(d) We require one component to be not faulty. Since there are two components then 
requiring one to be not faulty is equivalent to requiring one to be faulty. Hence 


the calculation is the same as that in (c): 


P(one component is not faulty) = 0.0768 


(e 


wm 


nents is faulty: 


At least one of the components is faulty means that one or both of the compo- 
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P(at least one component is faulty) = P(exactly one component is faulty) 
+ P(both components are faulty) 
= 0.0768 + 0.0016 
= 0.0784 
As an alternative we can note that the complement of ‘at least one of the compo- 
nents is faulty’ is ‘none of the components is faulty’. The probability that neither 
of the components is faulty is given in (b). Hence 
P(at least one component is faulty) 
= | — P(none of the components is faulty) 
= 1 — 0.9216 
= 0.0784 
At least one of the components is not faulty means that one or both of the com- 
ponents is not faulty. So 
P(at least one component is not faulty) 
= P(exactly one component is not faulty) 
+ P(both components are not faulty) 
= 0.0768 + 0.9216 
= 0.9984 
As an alternative we note that the complement of “at least one of the components 
is not faulty’ is ‘none of the components are not faulty’. The last statement is 
equivalent to “both the components are faulty’. Hence 
P(at least one component is not faulty) = 1—P(both components are faulty) 
= | — 0.0016 
= 0.9984 


Engineering application 28.16 


Probability of acceptable components 


Machines 1, 2 and 3 manufacture resistors A, B and C, respectively. The probabili- 
ties of their respective acceptabilities are 0.9, 0.93 and 0.81. One of each resistor is 
selected at random. 

(a) Find the probability that they are all acceptable. 

(b) Find the probability that at least one resistor is acceptable. 


Solution 
Define events E,, E, and E, by 


E,: resistor A is acceptable P(E) = 0:9 
E,: resistor B is acceptable IAUEm)) = 093 
E,: resistor C is acceptable P(E,) = 0.81 


1 
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E,, E, and E, are independent events. 
(a) P(E, OE, E;) = P(E, )P(E,)P(E;) = (0.9)(0.93) (0.81) = 0.68 
(b) Let E, and E, be the events 
E,: at least one resistor is acceptable 
E;: no resistor is acceptable 
E, and E, are complementary events and so 
IE.) ap PEs) = tl 
E, may be expressed as 
E;: resistor A is not acceptable and resistor B is not acceptable and 
resistor C is not acceptable 
thatis, i 1 By Ey 
P(E;) = P(E, NE, E;) = P(E, )P(E,) P(E) 
= (1—0.9)(1 — 0.93) (1 — 0.81) 
= (0.1) (0.07) (0.19) 


= 0.001 33 
IXGE) = Il = Js) = 1 = O00 33 = O22367/ 


EXERCISES 28.7 
A and B are two independent events with P(A) = 0.7 (a) all three are acceptable 
and P(B) = 0.4. The compound event A occurs, then (b) none are acceptable 
A occurs, then B occurs, is denoted AAB, and other (c) exactly two are acceptable 


compound events are denoted in a similar way. 
Calculate the probability of the following compound 


(d) at least two are acceptable 


events: 4 Components are made by machines A, B and C. 

(a) AAB Machine A makes 30% of the components, machine 
B makes 25% of the components and machine C 

(b) BAB makes the rest. Two components are picked at 

(c) AAAA random. Calculate the probability that 

The probability a component is faulty is 0.07. Two (a) they are both made by machine C 

components are picked at random. Calculate the (b) one is made by machine A and one is made by 

probability that aching 


(a) both are faulty 


(c) exactly one of the components is made by 


(b) both are not faulty machine B 
(c) the first one picked is faulty and the second one (d) at least one of the components is made by 

picked is not faulty machine B 
(d) at least one is not faulty (e) both components are made by the same machine 
The probability a component is acceptable is 0.92. 5 Capacitors are manufactured by four machines, 1, 2, 3 
Three components are picked at random. Calculate and 4. The probability a capacitor is manufactured 


the probability that 


acceptably varies according to the machine. The 
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probabilities are 0.94, 0.91, 0.97 and 0.94, 
respectively, for machines 1, 2, 3 and 4. 


(a) A capacitor is taken from each machine. What is 
the probability all four capacitors are acceptable? 


(b) Two capacitors are taken from machine | and two 
from machine 2. What is the probability all four 
capacitors are acceptable? 


(c) A capacitor is taken from each machine. 
Calculate the probability that at least three 
capacitors are acceptable. 


Solutions 
(a) 0.196 (b) 0.112 (c) 0.2401 
(a) 0.0049 (b) 0.8649 ~— (c):(0.0651 
(d) 0.9951 
(a) 0.7787. (b) 5.12 x 1074 
(c) 0.2031  (d) 0.9818 


(d) A capacitor is taken from each machine. From 
this sample of four capacitors, one is taken at 
random. 

(i) What is the probability it is acceptable and 
made by machine 1? 


(ii) What is the probability it is acceptable and 
made by machine 2? 


(e) A capacitor is taken from each machine. From 
this sample of four capacitors, one is taken at 
random. What is the probability it is acceptable? 
[Hint: use the results in (d).] 


(a) 0.2025 — (b) 0.15 (c) 0.375 
(d) 0.4375 — (e) (0.355 

(a) 0.7800 (b) 0.7317 (c):0.9808 
(d) (i) 0.235 (ii) 0.2275 = (e) (0.94 


REVIEW EXERCISES 28 
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Components are made by machines A, B and C. 
Machine A makes 30% of the components, machine 
B makes 50% of the components and machine C 
makes the rest. The probability that a component is 
acceptable when made by machine A is 0.96, when 
made by machine B the probability is 0.91 and when 
made by machine C the probability is 0.93. 


(a) A component is picked at random. Calculate the 
probability that it is made by machine C. 


(b) A component is picked at random. Calculate the 
probability that it is made by either machine A or 
machine C. 


(c) A component is picked at random. Calculate the 
probability that it is made by machine B and is 
acceptable. 


(d) A component is picked at random. Calculate the 
probability that it is not acceptable. 


(e) A component picked at random is not acceptable. 
Calculate the probability that it is made by 
machine A. 


(f) A component picked at random is acceptable. 
Calculate the probability that it is made by either 
machine A or machine B. 


(g) Two components are picked at random. Calculate 
the probability that they are both acceptable. 


(h) Two components are picked at random. Calculate 
the probability that one is acceptable and one is 
unacceptable. 


Three machines, A, B and C, manufacture a 
component. Machine A manufactures 35% of the 
components, machine B manufactures 40% of the 
components and machine C makes the rest. A 
component is either acceptable or not acceptable: 7% 
of components made by machine A are not 
acceptable, 12% of components made by machine B 
are not acceptable and 2% of those made by machine 
C are also not acceptable. 


(a) Find the probability that a component is made by 
either machine A or machine B. 


(b) Two components are picked at random. What is 
the probability that they are both made by 
machine B? 


(c) Three components are picked at random. What is 
the probability they are each made by a different 
machine? 


(d) A component is picked at random. What is the 
probability it is not acceptable? 


(e) A component is picked at random. It is not 
acceptable. What is the probability it was made 
by machine B? 


(f) A component is picked at random and is 
acceptable. What is the probability it was made 
by either machine A or machine B? 


Machines M and N manufacture components. The 
probability that a component is of an acceptable 
standard is 0.93 when manufactured by machine M 
and 0.86 when manufactured by machine N. Machine 
M supplies 70% of the components and machine N 
supplies the rest. 


(a) Calculate the probability that a component picked 
at random is of an acceptable standard. 


(b) A component is not of an acceptable standard. 
Calculate the probability that it is made by 
machine N. 


(c) Two components are picked at random. Calculate 
the probability that they are made by different 
machines. 


E, is the event the component is reliable. E, is the 
event the component is made by machine A. State 


(a) E, 
(b) E, 
(c) E, NE, 
(d) E,; UE, 
(e) EAN E, 


The lifespans, L, of 2500 components were measured 
and the results were recorded in Table 28.6. 


Table 28.6 

Lifespan (hours) Number of components 
0<L< 200 76 

200 < L < 300 293 

300 < L < 400 574 

400 < L < 500 1211 

500 < L < 600 346 


(a) Calculate the probability that a component has a 
lifespan of over 400 hours. 


(b) Calculate the probability that a component has a 
lifespan of less than 300 hours. 


(c) Calculate the probability that a component that 
lasts for 300 hours will continue to last for at 
least 500 hours. 
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Given the events E), E, and E; are 

E,: the circuit is functioning 

E,: the circuit is made by machine A 

E;: the circuit will last for at least 600 hours state 
(a) E, 

(b) E3 

(c) E, UE, 

(d) E, NE; 

(ce) E,NE; 

A data stream consists of the characters A, B, C, D 


and E, with respective probabilities of 0.16, 0.23, 
0.31, 0.12 and 0.18. 


(a) Calculate the information associated with the 
character B. 


(b) Calculate the entropy. 

(c) Calculate the redundancy. 

The probability a component is reliable is 0.93. Three 
components are picked at random. Calculate the 
probability that 

(a) all components are unreliable 

(b) exactly one component is reliable 

(c) at least one component is reliable 

(d) exactly two components are reliable 

A circuit is as shown in Figure 28.9. The circuit is 
operational if current can flow from P to Q along any 
route. The probability that resistor type A is faulty is 
0.09, and for resistor type B the probability of a fault 


is 0.14. Calculate the probability that the circuit is 
operational. 


Figure 28.9 


The probability a component is reliable is 0.96. Five 
components are picked at random. Calculate the 
probability that 


(a) all five components are reliable 
(b) none are reliable 


(c) at least one is reliable 
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Solutions 
1 (a) 0.2 (b) 0.5 (c) 0.455 6 (a) The circuit is not made by machine A. 
(d) 0.071 (e) 0.1690 (f) 0.7998 (b) The circuit will last for less than 600 hours. 
Cg) 0:8030 hy 041319 (c) The circuit is functioning or it is made by 
2 (a) 0.75 (b) 0.16 (c) 0.21 machine A. 
Gone ot Gy Oi (d) The circuit is functioning and will last for at least 
3 (a) 0.909 (b) 0.4615 (c) 0.42 600 hours. 
4& (a) The component is not reliable. (e) The circuit is made by machine A and will last 


(b) The component is not made by machine A. for at least 600 hours. 


(c) The component is reliable and made by 7 (a) 2.1203 (b) 2.2469 — (c) 0.0323 


machine A. 4 
: i to 8 (a) 3.43 x 10 (b) 0.0137 (c) 0.9997 
(d) The component is reliable or it is made by 


(d) 0.1816 
machine A. 
(e) The component is made by machine A and it is me 9.9948 
not reliable. 10 (a) 0.8154 (b) 1.024 x 1077 


5 (a) 0.6228 (b) 0.1476 (c) 0.1624 (c) 0.999 999 9 
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P29. INTRODUCTION 


Engineers often need to measure many different variables, for example the output voltage 
of a system, the strength of a beam or the cost of a project. Variables are classified into 
one of two types, discrete or continuous. These are discussed in Section 29.2. 

It is of interest to know the probability that a variable has of falling within a given 
range. There is a high probability that a given variable will fall within some ranges, 
and a small probability for other ranges of values. The way in which the probability is 
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distributed across various ranges of values gives rise to the idea of a probability distribu- 
tion. The study of probability distributions forms the major part of this chapter. The most 
important distributions, the binomial, the Poisson, the uniform, the exponential and the 
normal, are all included. The chapter concludes with a study of reliability engineering. 


RANDOM VARIABLES 


Engineering quantities whose variation contains an element of chance are called random 
variables. Some examples are listed below: 


(1) the diameter of a motor shaft of nominal size 0.2 m; 

(2) the weight of a steel box used to contain an electronic circuit board; 

(3) the number of components passing a point on a factory production line in 1 minute; 
(4) the nominal resistance value of resistors; 


(5) the length of time a machine works without failing. 


All of these quantities vary. In (1), (2) and (5) the quantities vary continuously; that 
is, they can assume any value in some range. For example, a motor shaft may have any 
diameter between 0.197 m and 0.203 m; a steel box could have any weight between, say, 
0.345 kg and 0.352 kg. These are examples of continuous variables. The value itself 
will only be recorded to a certain accuracy, which depends on the measuring device 
and the use to which the data will be put. For example, the shaft diameter may only 
be measured to the nearest tenth of a millimetre. Although such a measurement is only 
integer in multiples of a tenth of a millimetre, the variable being measured is continuous. 

In (3) and (4) the variables can assume only a limited number of values. The number of 
components passing a point in a minute will be a non-negative integer 0, 1,2,3.... The 
nominal resistance value of resistors has a limited number of values which is specified 
by manufacturers in their catalogues. Variables such as these, which can assume only a 
limited set of values, are called discrete variables. 


EXERCISES 29.2 

Is the length of time a machine works without failing 3 State whether the following variables are discrete or 

a continuous or a discrete variable? continuous: 

State whether the following variables are continuous (a) the force required to stretch a spring by a 

or discrete: specified length 

(a) the length of a bridge (b) the output voltage of a system 

(b) the number of electrical sockets in a house (c) the height of a column of liquid 

(c) the length of cable used to wire a house (d) the number of resistors in a circuit 

(d) the weight of solder used to build a circuit board (e) the number of bits of memory of a computer 
Solutions 

continuous 3 (a) continuous (b) continuous (c) continuous 

(d) discrete (e) discrete 
(a) continuous (b) discrete 


(c) continuous 


(d) continuous 


1 


1 
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PROBABILITY DISTRIBUTIONS — DISCRETE VARIABLE 


The range of values that a variable can take does not give sufficient information about 
the variable. We need to know which values are likely to occur often and which values 
will occur only infrequently. For example, suppose x is a discrete random variable which 
can take values 0, 1, 2, 3, 4, 5 and 6. We may ask questions such as ‘Which value is most 
likely to occur?’, ‘Is a 6 more likely to occur than a 5?’, and so on. We need information 
on the probability of each value occurring. Suppose that information is provided and is 
given in Table 29.1. If x is sampled 100 times then on average 0 will occur 10 times, 1 
will occur 10 times, 2 will occur 15 times and so on. Table 29.1 is called a probability 
distribution for the random variable x. Note that the probabilities sum to 1; the table 
tells us how the total probability is distributed among the various possible values of the 
random variable. Table 29.1 may be represented in graphical form (see Figure 29.1). 


P(x) 
0.3 
0.2 


Table 29.1 0.1 
The probability of a discrete 


value occurring. a oa a eS 
x 


x 0 1 2 3 4 5 6 


P(x) 0.1 01 O15 03 02 O01 0.05 Figure29.1 
SSS Plotted data of Table 29.1. 


EXERCISES 29.3 
The probability distribution for the random variable, (f) The variable, x, is sampled 50 000 times. How 
x, is many times would you expect x to have a value 
of 2.5? 
7 2 25 30 35 40 45 2 The probability distribution of the random variable, y, 
P(x) 0.07 0.36 0.21 0.19 0.10 0.07 is given as 
y =3 =2 =1 0 1 2 3 
(a) State P(x = 3.5) Py) 0.63 0.20 0.09 0.04 0.02 0.01 0.01 
(b) Calculate P(x > 3.0) 
Calculate 
(c) Calculate P(x < 4.0) 
(a) P(y 2 0) (b) P(y< 1) 
(d) Calculate P(x > 3.5) (c) P(ly| <1) (d) P(y > 3) 
(e) Calculate P(x < 3.9) (e) PO? < 6) 
Solutions 
(a) 0.19 (b) 0.57 (c) 0.83 (d) 0.17 2 (a) 0.08 (b) 0.98 (c) 0.15 (d) 0.85 


(e) 0.83 (f) 18000 (e) 0.36 
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PROBABILITY DENSITY FUNCTIONS — CONTINUOUS 


P(x) 
0.4 


0.3 
0.2 


0.1 


0 0.2 
Figure 29.2 


VARIABLE 


Suppose x is a continuous random variable which can take any value on [0, 1]. It is 
impossible to list all possible values because of the continuous nature of the variable. 
There are infinitely many values on [0, 1] so the probability of any one particular value 
occurring is zero. It is meaningful, however, to ask “What is the probability of x falling 
in a sub-interval, [a, b]?’ Dividing [0, 1] into sub-intervals and attaching probabilities to 
each sub-interval will result in a probability distribution. Table 29.2 gives an example. 
The probability that x will lie between 0.4 and 0.6 is 0.35, that is P(0.4 < x < 0.6) = 
0.35. Similarly, 


P(0.2 <x < 0.4) = 0.25 


Figure 29.2 shows the table in a graphical form. By making the sub-intervals smaller a 
more refined distribution is obtained. Table 29.3 and Figure 29.3 illustrate this. 

The probability that x lies in a particular interval is given by the sum of the heights 
of the rectangles on that interval. For example, the probability that x lies in [0.5, 0.8] is 
0.2+0.1+0.1 = 0.4; that is, there is a probability of 0.4 that x lies somewhere between 
0.5 and 0.8. Note that the sum of all the heights is 1, representing total probability. 

Consider the sub-interval [a, b]. We require the probability that x lies in this interval. 
The way this is answered is by means of a probability density function (p.d.f.), f(x). 
Such a p.d.f. is shown in Figure 29.4, where it is the area under the graph between a and 


Table 29.2 
Probability that x lies in a given sub-interval. 


x (0,0.2) [0.2,0.4) [0.4,0.6) [0.6,0.8) [0.8, 1.0] 
P(x) 0.1 0.25 0.35 0.2 0.1 
Table 29.3 

Refining the sub-intervals in Table 29.2. 


x [0, 0.1) [0.1,0.2) [0.2,0.3) [0.3,0.4)  [0.4, 0.5) 


P(x) 0.03 0.07 0.1 0.15 0.15 
x [0.5,0.6) [0.6,0.7) [0.7,0.8) [0.8,0.9) [0.9, 1.0] 
P(x) 0.2 0.1 0.1 0.07 0.03 
P(x) 
0.2 
0.1 
0.4 0.6 0.8 1,0° x 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 10 x 
Figure 29.3 


Plotted data of Table 29.2. Plotted data of Table 29.3. 
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Sx) 


Figure 29.4 
0 a b x Shaded area represents P(a < x <b). 


b that gives Pia <x <b). 


b 
P(a <x <b) = area above [a, b] = i f(x) dx 


The total area under a p.d.f. is always 1. 


Example 29.1 Suppose x is a continuous random variable taking any value on [1, 4]. Its p.d.f., f(x), is 
given by 
1 
= 1l<x<4 
fa=5 wF x 
(a) Check that f(x) is a suitable function for a p.d.f. 
(b) What is the probability that (i) x lies on [2, 3.5], (ii) x > 2, (iii) x < 3? 


Solution (a) x can have any value on [1, 4]. For f(x) to be a p.d-f., then the total area under it 
should equal 1, that is 


4 
[ tover=t 
1 


4 4 
= 1 = 4_ 
[ teoa= | 55a = Wat = 1 


Hence f(x) is a suitable function for a p.d_-f. 
3.5 


(b) @) P2<x<35)=]  f(x)dx=[Ve) = 0.457 
2 


4 
(ii) Px > 2) = i f(x) dx = [/x]j = 0.586 
2 


3 
(iii) P(x <3) = / f@) dx = L/x], = 0.732 
1 


Example 29.2 A random variable, z, has a p.d.f. f(z) where 
f@=e* 0<z<co 
Calculate the probability that 


(a) O<z<2 
(b) zis more than | 
(c) zis less than 0.5 
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Solution Note that {>° e~dz = 1 so that f(z) = e~ is suitable as a p.d.f. 
(a) PO<z<2)= fi e%dz = [-e“}} = 0.865 
6b) Pes N= etde= [-e 1 = 0.368 
(ec) P@ =< 05) =([=6")}° = 0.293 


EXERCISES 29.4 


Mm fa)= kx, k constant, -1 <x <1. f(x) isap.d. (a) If 200 measurements of z are made, how many, 
(a) Winkie th I fk on average, will be greater than 1? 
a at is the value of k? 


(by Calculate the pobabiliahnie 05 (b) If 50% of measurements are less than k, find k. 


(c) If P(x > c) = 0.6 then what is the value of c? 4 Ap.df., h(x), is defined by 

2 f(x) isap.d-f. for the random variable x, which can j _ 3 1 2 rer 
vary from 0 to 10. It is illustrated in Figure 29.5. What oe 4‘ 9 Sept 
is the probability that x lies in [2, 4]? Calculate 
SQ) (a) PO<x <0.5) 


< 0.5 
(b) P(—0.3 <x < 0.7) 
(c) P(|x| < 0.5) 
(d) P(x > 0.5) 


; 7 s (ce) P(x < 0.7) 


Figure 29.5 5 (a) Verify that 


Probability density function for Question 2. f(t) =Ae™! peo 


3 Ap.d-f. is given by is suitable as a p.d.f. 


f@M= 2e 7% 0<z<o@w (b) Calculate P(t > 2) if A = 3. 
Solutions 
1 (a) 1.5 (b) 0.4375 (c) —0.5848 4 (a) 0.3438 (b) 0.6575 (c) 0.6875 


(d) 0.1563 (e)_: 0.9393 
5 (b) 2.479 x 1073 


2 0.28 


3 (a) 27 (b) 0.3466 


If {x,, x), %3,...,X,} is a set of n numbers, then the mean value of these numbers, de- 
noted by X, is 


_  sumofthenumbers  )>x; 
x= — 
n n 


X is sometimes called the arithmetic mean. 
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Example 29.3 Find the mean of —2.3, 0, 1, 0.7. 


Solution 


-23+0+41+07  -0.6 


= 0,15 
4 4 


x= 


The mean value is a single number which characterizes the set of numbers. It is useful 
in helping to make comparisons. 


Engineering application 29.1 


Comparison of two production machines 


A component is made by two machines, A and B. The lifespans of six components 
made by each machine are recorded in Table 29.4. Which is the preferred machine? 


Table 29.4 
The lifespans of six components made by machines A and B. 
Lifespan (hours) 
Machine A 92 86 61 70 58 65 
Machine B 64 75 84 80 63 70 
Solution 


For components manufactured by machine A 


. 43 
average lifespan = Lear 2 


For components manufactured by machine B 
’ 43 
average lifespan = Tepe WaT 


Machine B produces components with a higher average lifespan and so is the pre- 
ferred machine. 


Example 29.4 A variable, x, can have values 2, 3, 4, 5 and 6. Many observations of x are made and 


Solution 


denoted by x;. They have corresponding frequencies, f;. The results are as follows: 


lon 


value (x;) 2 3 4 5 
frequency (f;) 6 9 3 7 4 


Calculate the mean of x. 


The sum of all the measurements must be found. The value 2 occurs six times, contribut- 
ing 12 to the total. Similarly, 3 occurs nine times contributing 27 to the total. Thus 


total = 2(6) + 3(9) + 4(3) + 5(7) + 6(4) = 110 
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The number of measurements made is 6+ 9+3-+7+4 = 29. So, 


Example 29.4 illustrates a general principle. 


If values x,,x,,...,x, occur with frequencies f,, f,,..., f, then 


EAE 
Dee, 


— 


EXERCISES 29.5 


1 Engineers in a design department are assessed by their 
leader. A ‘0’ is ‘Terrible’ and a ‘5’ is ‘Outstanding’. 
The 29 members of the department are evaluated and 
their scores recorded as follows: 


score 012 3 4 5 
number of staff 2 5 6 9 4 3 


What is the mean score for the whole department? 


2 Two samples of nails are taken. The first sample has 
12 nails with a mean length of 2.7 cm, the second 
sample has 20 nails with a mean length of 2.61 cm. 
What is the mean length of all 32 nails? 


3 The output, in volts, from a system is measured 40 
times. The results are recorded as follows: 


voltage (volts) 9.5 10.0 10.5 11.0 11.5 


Calculate the mean output voltage. 


The current, in amps, through a resistor is measured 
140 times. The results are: 


current (amps) 2.25 2.50 2.75 3.00 3.25 
number of 32 pay 39 22 20 
measurements 


Calculate the mean current through the resistor. 


In a communication network, packets of information 
travel along lines. The number of lines used by each 
packet varies according to the following table: 


number of lines used 1 2 3 4 °5 
number of packets 17 54 32 6 1 


number of 6 14 8 7 5 Calculate the mean number of lines used per packet. 
measurements 
Solutions 
1 2.586 4 2.70 
2 2.64 ae 2.27 


3 10.39 


Example 29.5 


Solution 
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STANDARD DEVIATION 


Although the mean indicates where the centre of a set of numbers lies, it gives no measure 
of the spread of the numbers. For example, —1, 0, 1 and —10, 0, 10 both have a mean of 
0 but clearly the numbers in the second set are much more widely dispersed than those 
in the first. A commonly used measure of dispersion is the standard deviation. 

Let x,,x,,...,x, be n measurements with a mean x. Then x, — x is the amount by 
which x, differs from the mean. The quantity x, — xX is called the deviation of x; from 
the mean. Some of these deviations will be positive, some negative. The mean of these 
deviations is always zero (see Question 3 in Exercises 29.6) and so this is not helpful 
in measuring the dispersion of the numbers. To avoid positive and negative deviations 
summing to zero the squared deviation is taken, (x, — x)”. The variance is the mean of 
the squared deviations: 


Le ae 


n 


variance = 


and 


standard deviation = ¥V variance 


Standard deviation has the same units as the x,. 


Calculate the standard deviation of 


(a) —1,0,1 
(b) —10, 0, 10 


(a) x, = —l,x, =0,x, = 1. Clearly x = 0. 


4-— FS 1 X,—-x=0 xX,—-xX=1 
. (-1)? +0? 4+ 1? 2 
variance = 3 =— 


2 
standard deviation = Je = 0.816 
(b) x, = —10, x, = 0, x, = 10. Again ¥ = 0 and so x, —x = x,, fori = 1, 2, 3. 


(=r -+10 206 
Variance = 3 — 3 


rae 200 
standard deviation = -s = 8.165 


As expected, the second set has a much higher standard deviation than the first. 
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Example 29.6 Find the standard deviation of —2, 7.2, 6.9, —10.4, 5.3. 


Solution x, = —2,x, =7.2,x, = 6.9, x, = —10.4,x, =5.3,¥= 1.4 


x, -xX=—3.4 
xX, -xX=5.8 
X= x= 5.5 
xX,—-x=-—11.8 
xX; —-xX=3.9 
. (—3.4)? + (5.8)? + (5.5)? + (—11.8)? + (3.9)? 229.9 
variance = = 
5 D) 
229.9 
standard deviation = ——— = 6.78 
Calculating x, —x,i = 1, 2,...,n, 1s tedious for large n and so a more tractable form of 


the standard deviation is sought. Firstly observe that )> x, = n¥ and > X* = nx’, since 
X° is a constant. Now 


> i@,- 3" — ae — So 2xx+ aes 
= ae — 2) x, +n¥ 


= De — 2% (nx) + nx 


= ) x? — nx 


Hence, 


and 
RE ey ues 
standard deviation = ,/ —————_ 
n 
Using these formulae it is not necessary to calculate x; — x. 


Example 29.7 Repeat Example 29.6 using the newly derived formulae. 


Solution °x7 = (—2)? + (7.2)* + (6.9)? + (—10.4)? + (5.3)? = 239.7 


1239.7 — 5(1.4)2 
standard deviation = aa = 6.78 


1 


1 
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EXERCISES 29.6 
Calculate the means and standard deviations of: 2 A set of measurements {x,, x7, X3,...,X,} has a mean 
(a) 1,2,3,4,5 of x and a standard deviation of s. What are the mean 
eee and standard deviation of the set 
(b) 2.1, 2.3, 2.7, 2.6 {kx ,, kxy, kx3,..., kx, } where k is a constant? 
(c) 37, 26, 19, 21, 19, 25, 17 —— 
3. The mean of the numbers {x,,x5,x3,...,X,} is X. 
(d) 6,6, 6, 6, 6, 6 ce sith : 
es ae Show that the sum of the deviations about the mean 
(e) —1,2, -3,4, —5, 6 is 0; that is, show )~_, (x; —X) = 0. 
Solutions 
(a) mean = 3; st. dev. = 1.414 (d) 6,0 
(e) 0.5, 3.862 


(b) 2.425, 0.238 
(c) 23.4, 6.321 


> | 


29.7.1 


2 mean = kx, st. dev. = ks 


EXPECTED VALUE OF A RANDOM VARIABLE 


In Sections 29.5 and 29.6 we showed how to calculate the mean and standard deviation 
of a given set of numbers. No reference was made to probability distributions or p.d.f.s. 
Suppose now that we have knowledge of the probability distribution of a discrete random 
variable or the p.d.f. of a continuous random variable. The mean value of the random 
variable can still be found. Under these circumstances the mean value is known as the 
expected value or expectation. 


Expected value of a discrete random variable 


Suppose for definiteness that x is a discrete random variable with probability distribution 
as given in Table 29.5. 


Table 29.5 
Probability distribution for a discrete random variable x. 
x 0 di 2 3 4 


P(x) 0.1 0.2 0.4 0.15 0.15 


In 100 trials x will have a value of zero 10 times on average, a value of one 20 times on 
average and so on. The mean value, that is the expected value, is therefore 


O(10) + 1(20) + 2(40) + 3(15) + 4(15) 
100 ~ 


We could have arranged the calculation as follows: 


expected value = 2.05 


expected value = O(z0) as ey + 2(im) + Sa) - 4(s60) 
= 0(0.1) + 1(0.2) + 2(0.4) + 3(0.15) + 4(0.15) 
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Each term is of the form (value) x (probability). Thus, 


i=5 
expected value = yar (x;) 


i=1 


The symbol, ju, is used to denote the expected value of a random variable. 


If a discrete random variable can take values 


pelo eeG lee 


with probabilities P(x,), P(x,),..., P(x,,), then 


expected value of x = uw = ye 2s) 
i=1 


Example 29.8 A random variable, y, has a known probability distribution given by 


Solution 


29.7.2 


y 2 4 6 8 10 
P(y) 0.17 0.23 0.2 0.3 0.1 


Find the expected value of y. 


We have 
j4 = expected value = 2(0.17) + 4(0.23) + 6(0.2) + 8(0.3) + 10(0.1) = 5.86 


Expected value of a continuous random variable 


Suppose a continuous random variable, x, has p.d.f. f(x), a < x < b. The probability 
that x lies in a very small interval, [x, x + dx], is 


x+8x 


f(t) dt 


x 


Since the interval is very small, f will vary only slightly across the interval. Hence the 
probability is approximately f(x)dx: see Figure 29.6. The contribution to the expected 
value as a result of this interval is 


(value) x (probability) 


that is, xf (x)dx. Summing all such terms yields 


b 
expected value = = il xf (x) dx 
a 


1 


fo) 


x x+6x 


Area = f(x).6x 


29.7 Expected value of a random variable 


Figure 29.6 


x in the small interval [x, x + 6x]. 


Example 29.9 A random variable has p.d.f. given by 


1 
IO) =F 


Calculate the expected value of x. 


Solution 


7 33/2 Cok 
ia ae Oe Foe 


1l<x<4 


945 


The shaded area represents the probability that x lies 


7 
So, if several values of x are measured, the mean of these values will be near to Se As 


. 7 
more and more values are measured the mean will get nearer and nearer to —. 


EXERCISES 29.7 


Calculate the expected value of the discrete random 
variable, h, whose probability distribution is 


h 1 Ss Ly 24 32 
P(h) 0.32 0.24 0.17 0.15 0.12 


Calculate the expected value of the random variable, 
x, whose probability distribution is 


x 2 25 30 35 40 45 
P(x) 0.07 0.36 0.21 0.19 0.10 0.07 


A random variable, z, has p.d.f. f(z) = e~“, 
0 < z < o. Calculate the expected value of z. 


4 A random variable, x, has p.d.f. f(x) given by 


5 
IO = Fa 


(a) Calculate the expected value of x. 


(b) Ten values of x are measured. They are 
1.9, 2.9, 2.8, 2.1, 3.2, 3.4, 2.7, 2.3, 2.8, 2.7 
Calculate the mean of the observations and 
comment on your findings. 


5 Ap.d.f. h(x) is defined by 
3 2 
h@)= 7d—-*) -l<x<l 


Calculate the expected value of x. 


6 Is the expected value of a discrete random variable 


necessarily one of its possible values? 
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1 
2 
3 


Solutions 


1.668 
3.05 


1 


29.8.1 


Example 29.10 
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4 (a) 2.012 (b) 2.68 
5 0 


6 no 


STANDARD DEVIATION OF A RANDOM VARIABLE 


Standard deviation of a discrete random variable 


Recall from Section 29.6 that the standard deviation of a set of numbers, {x,, x,,...,X,}, 
is given by 
cone » (x; = x)? 
standard deviation = ,/ =—~——— 
n 

Now suppose that x is a discrete random variable which can have values x,, X,,X3,...,X,, 
with respective probabilities of p,, p,, P3,..-, P,; that is, we have 
xX xX 1 X5 X3 ite xX, 
P(x) P P2 P3 tee Pn 


Let the expected value of x be jz. Then the square of the deviation from the expected 
value has an identical probability distribution: 


value (x1 — mb)? (x) — pw)? ee = mb)? 
probability P| Po xe Pn 


The expected value of the mean squared deviation is the variance. The symbol ? is used 
to denote the variance of a random variable: 


variance = 0? = Se — py 
1 


As before the standard deviation is the square root of the variance: 
standard deviation = o = ,/ Oe Die Ly 


A discrete random variable has probability distribution 


x 1 2 3 4 > 
P(x) 0.12 0.15 0.23 0.3 0.2 


Solution 


29.8.2 


Example 29.11 


Solution 


29.8 Standard deviation of a random variable 


Calculate 


(a) the expected value 
(b) the standard deviation 


(a) w= Yo x,p; = 1(0.12) + 2(0.15) + 3(0.23) + 4(0.3) + 5(0.2) = 3.31 


(b) = page? = pL) 


=0.12(1 — 3.31)? + 0.15(2 — 3.31)? + 0.23(3 — 3.31)? 


+0.3(4 — 3.31)? + 0.2(5 — 3.31)? 
= 1.6339 


standard deviation = 0 = 1.6339 = 1.278 


Standard deviation of a continuous random variable 


947 


We simply state the formula for the standard deviation of a continuous random variable. 
It is analogous to the formula for the standard deviation of a discrete variable. Let x be 


a continuous random variable with p.d.f. f(x), a <x < b. Then 


b 
ve if G = FO) dr 


A random variable, x, has p.d.f. f(x) given by 
f@a=l1 O0<x<l 


Calculate the standard deviation of x. 


The expected value, jz, is found: 


1 x2 1 1 


The variance can now be found: 


1 1 2 
variance = 0? = f ( -;) ldx 
0 2 


} 1 
= f P-x4+ ja 
0 4 


1 
=,/—= 20.99 
o = V 12 


The standard deviation of x is 0.29. 


948 Chapter 29 Statistics and probability distributions 


EXERCISES 29.8 


1 Ap.d.f. h(x) of the random variable x is defined by Calculate the standard deviation of w. 


3 4 A discrete random variable, y, has a probability 
ha= 70-2) -1<x<1 distribution 


(a) Calculate the expected value of x. 


(b) Calculate the standard deviation of x. y 6 7 8 1 10 11 
Py) 0.13 0.26 0.14 0.09 0.11 0.27 


2 A random variable, t, has p.d.f. H(t) given by 
H(it)=3e* 130 


(a) Calculate the expected value of r. Calculate the standard deviation of y. 


(b) Calculate the standard deviation of t. 5 Acontinuous random variable has p.d.f. 


3 A discrete random variable, w, has a known 


probability distribution f®= | xt+1 -1<x<0 


—x+1 O0<x<l 


w -1 -05 0 OS. <1 


(a) Calculate the expected value of x. 
P(w) 0.1 0.17 04 0.21 0.12 


(b) Calculate the standard deviation of x. 


Solutions 

1 (a) 0 = (b) 0.4472 4 1.84 

Zai 0) 4 5 (a)0 = (b) 0.4082 
3 0.5598 


| 29.9 | PERMUTATIONS AND COMBINATIONS 


Both permutations and combinations are used extensively in the calculation of probabil- 
ities. 


29.9.1 Permutations 


The following problem introduces permutations. 


Engineering application 29.2 


Linking process control computers 


A primary and a secondary route must be chosen from available routes A, B and C, 
for the linking of two process control computers in order to provide redundancy in 
case one fails. In how many ways can the choices be made? 


Example 29.12 


Solution 


Solution 
The various possibilities are listed. 
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Primary route Secondary route 
A B 
B A 
A Cc 
Cc A 
B Cc 
Cc B 


There are six ways in which the choices can be made. Alternatively we could argue 
as follows. Suppose the primary route is chosen first. There are three choices: any 
one of A, B or C. The secondary route is then chosen from the two remaining routes, 
giving two possible choices. Together there are 3 x 2 = 6 ways of choosing a primary 
route and a secondary route. 


In Engineering application 29.2 the choice AB is distinct from the choice BA, that is the 
order is important. Choosing two routes from three and arranging them in order is an 
example of a permutation. More generally: 


A permutation of n distinct objects taken r at a time is an arrangement of r of the n 
objects. 


In forming permutations, the order of the objects is important. If three letters are chosen 
from the alphabet the permutation XYZ is distinct from the permutation ZX Y. We pose 
the question ‘How many permutations are there of n objects taken r at a time?’ The 
following example will help to establish a formula for the number of permutations. 


Calculate the number of permutations there are of 


(a) 
(b) 
(c) 


(a) 


(b) 


four distinct objects taken two at a time 
five distinct objects taken three at a time 


seven distinct objects taken four at a time 


Listing all possible permutations is not feasible when the numbers involved are 
large. In choosing the first object, four choices are possible. In choosing the sec- 
ond object, three choices are possible. There are thus 4 x 3 = 12 permutations of 
four objects taken two at a time. Note that 12 may be written as 
4! 4! 

12=4x3=— = —_ 
2! (4-2)! 
There are five objects available for the first choice, four for the second choice and 
three for the third choice. Hence there are 5 x 4 x 3 = 60 permutations. Again note 
that 

5! 5! 

60 = — = ——_ 

2! (5-3)! 
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(c) There are seven objects available for the first choice, six for the second, five for the 
third and four for the fourth. The number of permutations is 7 x 6 x 5 x 4 = 840. 
! 


7! 
This may be written as ————. 
(7 — 4)! 


The example illustrates the following general rule. 


The number of permutations of n distinct objects taken r at a time, written P(n, r), is 
n! 


= Gea 


Example 29.13 Find the number of permutations of 
(a) 10 distinct objects taken six at a time 
(b) 15 distinct objects taken two at a time 


(c) six distinct objects taken six at a time 


10! 10! 
Solution () (10,6) = Go—By = Gy = 151200 
15! 15! 
(b) P15, 2) = ee, = ed = 210 
(5—2)! 13! 
! 
(©) P66) = = on = % = 720 (Note that 0! = 1) 


P(6, 6) is simply the number of ways of arranging all six of the objects. 


Note that 
P(n,n) =n! 


This is the number of ways of arranging n given objects. 


29.9.2. Combinations 


Closely related to, but nevertheless distinct from, permutations are combinations. 


A combination is a selection of r distinct objects from n objects. 


In making a selection the order is unimportant. For example, given the letters A, B and C, 
AB and BA are the same combination but different permutations. As with permutations 
we develop an expression for the number of combinations of n objects taken r at a time. 
Examples 29.14 and 29.15 help with this development. 
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Example 29.14 There are three routes, A, B and C, joining two computers. In how many ways can two 


Solution 


Example 29.15 


routes be chosen from A, B and C? 


The possible combinations (selections) can be listed as 
AB, BC, AC 


that is, there are three possible ways of making the selection. We can also use our knowl- 
edge of permutations to calculate the number of combinations. There are P(3, 2) ways 
of arranging the three routes taken two at a time. 


C2) = a= 


Each combination of two routes can be arranged in P(2, 2) ways; that is, each combina- 
tion gives rise to two permutations. For example, the combination AB could be arranged 
as AB or BA, giving two permutations. Thus, the number of combinations is half the 
number of permutations. There are 6/2 = 3 combinations. 


Calculate the number of combinations of 


Solution 


(a) six distinct objects taken four at a time 


(b) 10 distinct objects taken six at a time 


(a) Consider one combination of four objects. These four objects can be arranged in 
4! ways; that is, each combination gives rise to 4! permutations. The number of 
permutations of six objects taken four at a time is 


6! 
P(6,4) = ry 
Hence the number of combinations is 
P(6,4 ! 
(64) _ 6! _ a, 
4! 214! 


There are 15 combinations of six objects taken four at a time. 


(b) Each combination, comprising six objects, gives rise to 6! permutations. The number 
of permutations of 10 objects taken six at a time is 


10! 
P(10, 6) = a 
P10, 6 10! 
Hence the number of combinations = ao = Tie! = 210. 


We write ( 


ae) 


) to denote the number of combinations of n objects taken r at a time. A 


formula for (") is now developed. 
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Example 29.16 


Solution 


Example 29.17 


Solution 


Each combination of r objects gives rise to r! permutations, but 


n! 


P(n,r) = Gani =a 


The number of combinations of n distinct objects taken r at a time is 


ay P(n, r) * n! 
(") Sie 277 


Calculate the number of combinations of 


(a) six distinct objects taken five at a time 
(b) nine distinct objects taken nine at a time 


(c) 25 distinct objects taken five at a time 


6! 9! 2 25! 
(a) ($)-7=6 (b) ()=an=! (c) (2) = Baa 3310 


We can generalize the result of Example 29.16(b) and state that & =, 


There are k identical objects and n compartments (n > k). Each compartment can 
hold only one object. In how many different ways can the k objects be placed in the n 
compartments? 


The order in which the objects are placed is unimportant since all the objects are iden- 
tical. Placing the k objects is identical to selecting k of the n compartments (see Fig- 


ure 29.7). But the number of ways of selecting k compartments from n is precisely (a) 


n 


Hence the k objects can be placed in the n compartments in ( k 


) different ways. 


n compartments 


: Figure 29.7 
k objects Placing k objects in n compartments. 
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EXERCISES 29.9 
1. Evaluate Expand 4 6 5 
(a) P(8, 6) (b) P(11,7) (a) (a+b)" (b) (1+x)” (©) (pt+q) 
12 15 15 4 Primary and secondary routes connecting two 
(c) (d) (e) ‘ 
9 12 3 computers need to be chosen. Two primary routes are 
2 . _ needed from eight which are suitable and three 
Write out explicitly secondary routes must be chosen from four available. 
(a) (\) (b) (7) (c) @) In how many ways can the routes be chosen? 
; ; ae 5 Acombination lock can be opened by dialling three 
3 The expansion of @ +b)" where n is a POStiive mteeet correct letters followed by three correct digits. How 
may be written with the help of combination notation. many different possibilities are there for arranging the 
(a+b)" =a" + (7) ab letters and digits? Is this more secure than a lock 
1 which has seven digits? Is the word ‘combination’ 
rs (3) ap... being used correctly? 
6 A nuclear power station is to be built on one of 20 
ms nm \ apr! +(")p possible sites. A team of engineers is commissioned to 
=1 n examine the sites and rank the three most favourable 
ee fs in order. In how many ways can this be done? 
= ( ) qk pk 
k 
k=0 
Solutions 
(c) p> + Sp*q + 10p3q? + 10p*q? + Spq* + @° 
1 (a) 20160 (b) 1663200 (c) 220 C)P Pq P| iP Pa + 
(d) 455 (e) 455 BR 112 
2 n(n — 1) 
(a) 1 (b) n (c) 5) 5 17576000. Yes, it is more secure. Combination is not 
Fi 3 242 3 4 being used correctly: permutation lock would be 
3 (aja"+4a°b+ 6aeb* + 4ab’? +b b 
etter. 
(b) 1 + 6x + 15x? + 20x3 + 15x4 + 6x5 + x® 6 6840 


| 29.10 | THE BINOMIAL DISTRIBUTION 


In a single trial or experiment a particular result may or may not be obtained. For ex- 
ample, in an examination, a student may pass or fail; when testing a component it may 
work or not work. The important point is that the two outcomes are complementary. As- 
suming that the probability of an outcome is fixed, such a trial is called a Bernoulli trial 
in honour of the mathematician, J. Bernoulli. 

We address the following problem. In a single trial the outcome is either A or B, that is 
A. We refer to A as asuccess and B as a failure. If we know P(A) = p then P(B) = 1—p. 
If n independent trials are observed, what is the probability that A occurs k times, and B 
occurs n — k times? The number of successful trials in n such experiments is a discrete 
random variable; such a variable is said to have a binomial distribution. Let us consider 
a particular problem. 
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29.10.1 


Engineering application 29.3 


Sampling components made by a machine 
A machine makes components. The probability that a component is acceptable is 0.9. 


(a) If three components are sampled find the probability that the first is acceptable, 
the second is acceptable and the third is not acceptable. 


(b) If three components are sampled what is the probability that exactly two are 
acceptable? 


Solution 
Let the events A and B be defined thus: 


A: the component is acceptable, P(A) = 0.9 
B: the component is not acceptable, P(B) = 0.1 


(a 


= 


We denote by AAB the compound event that the first is acceptable, the second 
is acceptable, the third is not acceptable. Since the three events are independent 
the multiplication law in Section 28.7 gives 


P(AAB) = P(A)P(A)P(B) = (0.9) (0.9) (0.1) = (0.9)?(0.1) = 0.081 


(b) Weare interested in the compound event in which two components are acceptable 
and one is not acceptable. We denote by AAB the compound event that the first 
is acceptable, the second is acceptable, the third is not acceptable. Compound 
events ABA and BAA have obvious interpretations. 

If exactly two components are acceptable then either AAB or ABA or BAA 
occurs. These compound events are mutually exclusive and we can therefore use 
the addition law (see Section 28.3). Hence 


P(exactly two acceptable components) = P(AAB) + P(ABA) + P(BAA) 


From (a) P(AAB) = 0.081 and by similar reasoning P(ABA) = P(BAA) = 
0.081. Hence, 


P(exactly two acceptable components) = 3(0.081) = 0.243 


Probability of k successes from 7 trials 


Let us now return to the general problem posed earlier. We define the compound 
event C: 


C: A occurs k times and B occurs n — k times 
n 


k 
ways (see Example 29.17). The probability of a particular distribution of k occurrences 


The k occurrences of event A can be distributed amongst the v trials in ( ) different 


n 


4 distinct distributions 


of A and n—k occurrences of B is p*(1—p)"~*. Since there are ( 


possible then 


Mo ({) Aa =p k=0,1,2,.5..7 
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Example 29.18 The probability that a component is acceptable is 0.93. Ten components are picked at 
random. What is the probability that 


(a) at least nine are acceptable 


(b) at most three are acceptable? 


Solution (a 


wm 


P(exactly 9 components are acceptable) = (s) (0.93)? (0.07) = 0.364 


P(exactly 10 components are acceptable) = (Ga) (0.93)!° = 0.484 


Hence, 
P(at least 9 components are acceptable) = 0.364 + 0.484 = 0.848 
(b 


~ 


We require the probability that none are acceptable, one is acceptable, two are ac- 
ceptable and three are acceptable: 


P(0 are acceptable) = 3, (0.93)°(0.07)!° = 2.825 x 107 


P(1 is acceptable) = ic (0.93)!(0.07)? = 3.753 x 107!° 


2 


— 


0 


3 


P(2 are acceptable) = .) (0.93)?(0.07)® = 2.244 x 10°8 
) (0.93)? (0.07)’ = 7.949 x 1077 


P(3 are acceptable) = ( 


Hence, 


P(at most 3 are acceptable) = 2.825 x 107! + 3.753 x 10°-° 
+ 2.244 x 1078 + 7.949 x 1077 
= 8.18 x 107” 


that is, the probability that at most three components are acceptable is almost zero; 
it is virtually impossible. 


29.10.2 Mean and standard deviation of a binomial distribution 


Let the probability of success in a single trial be p and let the number of trials be n. 
The number of successes in 7 trials is a discrete random variable, x, with a binomial 
distribution. Then x can have any value from {0, 1, 2,3,...,m}, although clearly some 
values are more likely to occur than others. The expected value of x can be shown to be 
np. Thus, if many values of x are recorded, the mean of these will approach np. 


Expected value of the binomial distribution = np 


The standard deviation of the binomial distribution can also be found. This is given by 


standard deviation of the binomial distribution = ./np(1 — p) 
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29.10.3 


Example 29.19 


operate correctly in order to be acceptable. The 


Most likely number of successes 


When conducting a series of trials it is sometimes desirable to know the most likely 
outcome. For example, what is the most likely number of acceptable components in a 
sample of five tested? 


The probability a component is acceptable is 0.8. Five components are picked at random. 
What is the most likely number of acceptable components? 


Solution P(no acceptable components) = (3) (0.8)°(0.2)° = 3.2 x 107+ 
P(1 acceptable component) = (7) (0.8)!(0.2)* = 6.4 x 1073 
P(2 acceptable components) = (3) (0.8)?(0.2)? = 0.0512 
P(3 acceptable components) = (3) (0.8)3 (0.2)? = 0.2048 
P(4 acceptable components) = (3) (0.8)*(0.2)! = 0.4096 
P(5 acceptable components) = (3) (0.8)°(0.2)° = 0.3277 
The most likely number of acceptable components is four. 
Example 29.19 illustrates an important general result. Suppose we conduct n Bernoulli 
trials and wish to find the most likely number of successes. If p = probability of success 
on a single trial, and i = most likely number of successes in v trials, then 
pa+1)-1<i<pm+l1) 
In Example 29.19, p = 0.8, n = 5 and so 
(0.8)(6) — 1 <i < (0.8)(6) 
3.8<i1<4.8 
Since i is an integer, then i = 4. 
EXERCISES 29.10 
The probability a component is acceptable is 0.8. Four (a) What is the probability the machine is 
components are sampled. What is the probability that acceptable? 
(a) exactly one is acceptable (b) What is the probability that six of the seven chips 
(b) exactly two are acceptable? are operating correctly? 
A machine requires all seven of its micro-chips to (c) The machine is redesigned so that the original 


seven chips are replaced by four new chips. The 


probability a micro-chip is operating correctly is 0.99. probability a new chip operates correctly is 0.98. 
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Is the new design more or less reliable than the the most likely number of valves to remain reliable 
original? for more than 10 years? 
3 The probability a machine has a lifespan of more than 5 The probability a chip is manufactured to me 
5 years is 0.8. Ten machines are chosen at random. acceptable standard is 0.87. A sample of six chips is 
What is the probability that picked at random from a large batch. 


(a) eight machines have a lifespan of more than 5 


years 


(b) all machines have a lifespan of more than 5 years 


(a) Calculate the probability all six chips are 
acceptable. 


(b) Calculate the probability none of the chips is 


acceptable. 
(c) at least eight machines have a lifespan of more (c) Calculate the probability that fewer than five 
than 5 years chips in the sample are acceptable. 
(d) no more than two machines have a lifespan of (d) Calculate the most likely number of acceptable 


less than 5 years? 


chips in the sample. 


4 The probability a valve remains reliable for more than (e) Calculate the probability that more than two 
10 years is 0.75. Eight valves are sampled. What is chips are unacceptable. 
Solutions 
1 (a) 0.0256 (b) 0.1536 4 6 
2 (a) 0.9321 (b) 0.0659 (c) 0.9224. 5 (a) 0.4336 (b) 4.826 x 10-® (c) 0.1776 (d) 6 
New design is less reliable (e) 0.0324 


3 (a) 0.3020 (b) 0.1074 (c) 0.678 (d) 0.678 


THE POISSON DISTRIBUTION 


The Poisson distribution models the number of occurrences of an event in a given inter- 
val. Consider the number of emergency calls received by a service engineer in one day. 
We may know from experience that the number of calls is usually three or four per day, 
but occasionally it will be only one or two, or even none, and on some days it may be six 
or seven, or even more. This example suggests a need for assigning a probability to the 
number of occurrences of an event during a given time period. The Poisson distribution 
serves this purpose. 

The number of occurrences of an event, F, in a given time period is a discrete random 
variable which we denote by X. We wish to find the probability that ¥ = 0, X = 1, 
X = 2, X = 3, and so on. Suppose the occurrence of F in any time interval is not 
affected by its occurrence in any preceding time interval. For example, a car is not more, 
or less, likely to pass a given spot in the next 10 seconds because a car passed (or did 
not pass) the spot in the previous 10 seconds, that is the occurrences are independent. 

Let A be the expected (mean) value of X, the number of occurrences during the time 
period. If X is measured for many time periods the average value of X will be A. Under 
the given conditions X follows a Poisson distribution. The probability that X has a value 
ris given by 


OX =7) = 
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The expected value and variance of the Poisson distribution are both equal to i. 


Engineering application 29.4 


Emergency calls to a service engineer 


Records show that on average three emergency calls per day are received by a service 
engineer. What is the probability that on a particular day 
(a) three (b) two (c) four 


calls will be received? 


Solution 
The number of calls received follows a Poisson distribution. The average number of 
calls is three per day, that is A = 3. 

333 e332 


(@) POS = 3)= 31 = 0.224 (0) POS = 2) = an os 0.224 
—324 
@) POX e4)= — = 0.168 


The engineer will receive three calls on approximately 22 days in 100, two calls on 
approximately 22 days in 100 and four calls on approximately 17 days in 100. 


Engineering application 29.5 


Machine breakdowns in a workshop 


A workshop has several machines. During a typical month two machines will break 
down. What are the probabilities that in a month 


(a) none (b) one (c) more than two 


will break down? 


Solution 


4. = average number of machines that break down = 2 
X = number of machines broken down 


ene 

(@) Ox =@Q) = 01 = O35 
e297! 

(ld) 20K = I) = 7 = O27 


CICS a= C=) 9406S ae 
=]—e2—2e? —2e? = 0.323 
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Table 29.6 
The probabilities for binomial and Poisson distributions. 
Binomial (n, p) Poisson (A) 
P(X =r); n= 15, p= 0.05 P(X =r); 4 =0.75 
r=0 0.463 39 0.472 37 
r=1 0.365 76 0.354 27 
r=2 0.134 75 0.132 85 
r= 0.030 73 0.033 21 
r= 0.004 85 0.006 23 
r= 0.000 56 0.000 90 
r=6 0.000 05 0.000 07 
r=7 0.000 00 0.000 01 


Poisson approximation to the binomial 


The Poisson and binomial distributions are related. Consider a binomial distribution, in 
which n trials take place and the probability of success is p. If n increases and p decreases 
such that np is constant, the resulting binomial distribution can be approximated by a 
Poisson distribution with A = np. Recall that np is the expected value of the binomial 
distribution, and A is the expected value of the Poisson distribution. 

To illustrate the above point, Table 29.6 lists the probabilities for binomial and Pois- 
son distributions with n = 15, p = 0.05 and hence A = 15(0.05) = 0.75. The remaining 
probabilities are all almost 0. 

As n increases and p decreases with np remaining constant, agreement between the 
two distributions becomes closer. 


Engineering application 29.6 


Workforce absentees 

A workforce comprises 250 people. The probability a person is absent on any one 
day is 0.02. Find the probability that on a day 

(a) three (b) seven 


people are absent. 


Solution 
This problem may be treated either as a sequence of Bernoulli trials or as a Poisson 
process. 
Bernoulli trials 
The probabilities follow a binomial distribution. 
E: a person is absent 
n = number of trials = 250 


p = probability that E occurs in a single trial = 0.02 
X = number of occurrences of event E 
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250 


(a) P(X =3)= ( 3 ) (0.02)3(0.98)*47 = 0.140 


250 


(oy) OCS 7) = ( 7 ) (0.02)7(0.98)2” = 0.105 


Poisson process 


Since n is large and p is small the Poisson distribution will be a good approximation 


to the binomial distribution 
A = 9 = 3 
e>(5)3 


(@) AQ = 3) = 


= 0.140 


e>(5)? 


(D) PCS 7) = = 0.104 


EXERCISES 29.11 


1 


A computer network has several hundred computers. 
During an 8 hour period, there are on average seven 
computers not functioning. Find the probability that 
during an 8 hour period 

(a) nine (b) five 

do not function. 


A workforce has on average two people absent 
through illness on any given day. Find the probability 
that on a typical day 


(a) two 
(b) at least three 


(c) less than four 


people are absent. 


A machine manufactures 300 micro-chips per hour. 
The probability an individual chip is faulty is 0.01. 
Calculate the probability that 

(a) two 

(b) four 

(c) more than three 

faulty chips are manufactured in a particular hour. Use 


both the binomial and Poisson approximations and 
compare the resulting probabilities. 


Solutions 


1 
7 


(a) 0.1014 — (b) 0.1277 


(a) 0.2707 
(c) 0.8571 


(b) 0.3233 


4 (a) 0.1435 


The probability of a disk drive failure in any week is 
0.007. A computer service company maintains 900 
disk drives. Use the Poisson distribution to calculate 
the probability of 

(a) seven (b) more than seven 

disk drive failures in a week. 


The probability an employee fails to come to work is 
0.017. A large engineering firm employs 650 people. 
What is the probability that on a particular day 

(a) nine (b) 10 

people are away from work? 


A machine manufactures electrical components for 
the car industry at the rate of 750 per hour. The 
probability a component is faulty is 0.013. Use both 
the binomial distribution and the corresponding 
Poisson approximation to find the probability that in a 
sample of 200 components 

(a) none are faulty 

(b) one is faulty 

(c) two are faulty 


(d) three are faulty 


(e) more than three are faulty 


(a) Binomial 0.2244; Poisson 0.2240 
(b) 0.1689, 0.1680 
(c) 0.353, 0.353 


(b) 0.2983 
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5 (a) 0.1075 —(b) 0.1188 ey Oset, Senn 
(d) 0.2191, 0.2176 


6 (a) Binomial 0.0730; Poisson 0.0743 (c) 0.2634, 0.2640 


(b) 0.1923, 0.1931 


| 29.12 | THE UNIFORM DISTRIBUTION 


We now consider a continuous distribution — the uniform distribution. Suppose the prob- 
ability of an event occurring remains constant across a given time interval. The p.d_-f., 
f(t), of such a distribution takes the form shown in Figure 29.8. 


fo 


Ale 


Area = 1 


0 = Figure 29.8 
T t The uniform p.d_f. 


The area under f(t) must equal | and so if the interval is of length T, the height of 
1 


the rectangle is —. 


The p.d.f. for the uniform distribution is given by 


al ORS ali 
fO=4,T 


0 otherwise 


The probability an event occurs in an interval [a, b] is (a : f@d.ifO<a<cb<T 


b— 
this probability is simply ied We shall make use of this distribution in Section 29.15 
when we deal with reliability engineering. 


EXERCISES 29.12 
1. A random variable, x, has a uniform p.d.f. with 2 A random variable ¢ has a uniform p.d.f. with 
T = 10. Calculate the probability that T = 1.5. Calculate the probability that 
(a) L<&x<3 (b) 16< x < 9.3 (a) 0.7 <t< 1.3 (b) 1<t<2 
(c) x2 2.9 (d) x < 7.2 (c) |t| < 0.5 (d) |t| > 1 


(e) -l<x<2 (f) 9.1 <x < 12.3 


Solutions 


1 (a) 0.2 (b) 0.77. (c) 0.71 (d) 0.72 2 (a) 04 (b) 0.3333 (c) 0.3333 
(e) 0.2 (f) 0.09 (d) 0.3333 
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29.13 | THE EXPONENTIAL DISTRIBUTION 


Suppose a random variable has a Poisson distribution; for example, the random variable 
could be the number of customers arriving at a service point, the number of telephone 
calls received at a switchboard, the number of machines breaking down in a week. Then 
the time between events happening is a random variable which follows an exponential 
distribution. Note that whereas the number of events is a discrete variable, the time 
between events is a continuous variable. 

Let t be the time between events happening. 


The exponential p.d.f. f(t) is given by 
0) oe 7 S50) 
~ 10 otherwise 


where a > 0. 


The probability of an event occurring in a time interval, T, is given by i, f(t) dt. 
Figure 29.9 shows f(t) for various values of a. The expected value of the distribution is 
given, by definition, as 

ae 1 
expected value = = / tae“ dt = — 
0 a 


For example, if 


f(t) =3e* t in seconds t>0 


: 2 ll . 
then the mean time between events is 3 s; that is, on average there are three events 
happening per second. 


fo) 


a increasing 


Figure 29.9 
t The exponential p.d-f. for various values of a. 


Engineering application 29.7 


Time between breakdowns of a machine 


The time between breakdowns of a particular machine follows an exponential distri- 
bution, with a mean of 17 days. Calculate the probability that a machine breaks down 
in a 15 day period. 
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Solution 
1 1 

The mean time between breakdowns = 17 = —, soa = 7 Thus the p.d.f., f(f), is 
a 

given by 


1 
ft)= a eo 750 


We require the probability that the machine breaks down in a 15 day period: 


15 
AO Si < IS) = f(t) dt 
0 


15 ] Aq 
= = ah 
[ i 


15 
= |---| 
0 


2 Sree) 
= Oster 


There is a 58.62% chance that the machine will break down in a 15 day period. 


EXERCISES 29.13 


the time between breakdowns for a particular 


1. A service engineer receives on average an emergency ane 
machine is 


call every 3 hours. If the time between calls follows 
an exponential distribution, calculate the probability (a) greater than 450 hours 
that the time from one emergency call to the next is (hie leas than 350 hours 


(a) greater than 3 hours 


by less than 45h 3 The mean time taken by an engineer to repair an 
ess than 4.5 hours 


electrical fault in a system is 2.7 hours. Calculate the 


2 The mean time between breakdowns for a certain type probability that the engineer will repair a fault in less 
of machine is 400 hours. Calculate the probability that than the mean time. 
Solutions 
1 (a) 0.3679 (b) 0.7769 3 0.6321 


2 (a) 0.3247. (b): 0.5831 


">a §=6THE NORMAL DISTRIBUTION 


The normal probability density function, commonly called the normal distribution, is 
one of the most important and widely used. It is used to calculate the probable values 
of continuous variables, for example weight, length, density, error measurement. Prob- 
abilities calculated using the normal distribution have been shown to reflect accurately 
those which would be found using actual data. 
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N(x) N(x) N(x) (a) (b) 

7 . 1 i) > 
(a) i (b) - : My be x 
Figure 29.10 Figure 29.11 
Two typical normal curves. Two normal curves with the same 


29.14.1 


standard deviation but different means. 


Let x be a continuous random variable with a normal distribution, N(x). Then 


1 
NG) = —= eH) Per —-0O <x<0O 


ov 27 


where jz = expected (mean) value of x, 0 = standard deviation of x. Figure 29.10 shows 
two typical normal curves. All normal distributions are bell shaped and symmetrical 
about ju. 

In Figure 29.10(a) the values of x are grouped very closely to the mean. Such a dis- 
tribution has a low standard deviation. Conversely, in Figure 29.10(b) the values of the 
variable are spread widely about the mean and so the distribution has a high standard 
deviation. 

Figure 29.11 shows two normal distributions. They have the same standard deviation 
but different means. The mean of the distribution in Figure 29.11(a) is 2, while the mean 
of that in Figure 29.11(b) is 4. Note that the domain of N(x) is (—00, oo); that is, the 
domain is all real numbers. As for all distribution curves the total area under the curve 
is 1. 


The standard normal 


A normal distribution is determined uniquely by specifying the mean and standard de- 
viation. The probability that x lies in the interval [a, b] is 


b 
Pa<x<b)= | N(x) dx 


The mathematical form of the normal distribution makes analytic integration impossible, 
so all probabilities must be computed numerically. As these numerical values would 
change every time the value of jz or o was altered some standardization is required. To 
this end we introduce the standard normal. The standard normal has a mean of 0 and 
a standard deviation of 1. 

Consider the probability that the random variable, x, has a value less than z. For con- 
venience we call this A(z). 


A(z) =P <2) = / N(x) dx 
Figure 29.12 illustrates A(z). Values of A(z) have been computed numerically and tabu- 


lated. They are given in Table 29.7. Using the table and the symmetrical property of the 
distribution, probabilities can be calculated. 
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N(x) 


A(z) 


Figure 29.12 
ues X A@ =P@ <2 = f%,,N@) de. 


Example 29.20 Thecontinuous random variable x has a standard normal distribution. Calculate the prob- 


Solution 


ability that 
(a) x < 1.2 (b) x > 1.2 (c) x > -1.2 (d) x < —1.2 
(a) From Table 29.7 

P(x < 1.2) = 0.8849 


This is depicted in Figure 29.13. 


(b) P(x > 1.2) = 1 — 0.8849 = 0.1151 
This is shown in Figure 29.14. 


(c) By symmetry P(x > —1.2) is identical to P(x < 1.2) (see Figure 29.15). So, 
P(x > —1.2) = 0.8849 

(d) Using part (c) we find 
P(x < -1.2) =1-—P(> —-1.2)=0.1151 


(see Figure 29.16). 


N(x) N(x) 
0 1.2 x O 1.2 x 
Figure 29.13 Figure 29.14 
P(x < 1.2) = 0.8849. P(x > 1.2) =0.1151. 
N(x) N(x) 
-1.2 0 x -1.2 0 x 
Figure 29.15 Figure 29.16 


P(x > —1.2) = 0.8849. P(x < —1.2) = 0.1151. 
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Example 29.21  Thecontinuous random variable v has a standard normal distribution. Calculate the prob- 
ability that 


(a) O<v<l (b) -l<v<l (c) -O0.5<u<2 


Solution (a) Figure 29.17 shows the area (probability) required. 


P(v < 1) = 0.8413 using Table 29.7 
P(v <0) =0.5 using symmetry 
PO <v < 1) = 0.8413 — 0.5 = 0.3413 

(b) Figure 29.18 shows the area (probability) required. 


P(-1l <v<1)=2xP(O<v <1) using symmetry 
= 2 x 0.3413 = 0.6826 
This tells us that 68.3% of the values of v are within one standard deviation of the 
mean. 
(c) Figure 29.19 shows the area (probability) required. 
P(v < 2) = 0.9772 
P(v < —0.5) = P(v > 0.5) = 1 — P(v < 0.5) 
= | — 0.6915 = 0.3085 
P(—0.5 < v < 2) = 0.9772 — 0.3085 = 0.6687 


Note that whether or not inequalities defining v are strict is of no consequence in 
calculating the probabilities. 


NV) N(v) Nv) 
0 1 v -1 0 1 v -0.5 0 2 v 
Figure 29.17 Figure 29.18 Figure 29.19 
P(O <v < 1) = 0.3413. P(-1 <v < 1) = 0.6826. P(—0.5 < v < 2) = 0.6687. 


29.14.2 Non-standard normal 


Table 29.7 allows us to calculate probabilities for a random variable with a standard 
normal distribution. This section show us how to use the same table when the variable 
has a non-standard distribution. A non-standard normal has a mean value other than 0 
and/or a standard deviation other than 1. The non-standard normal is changed into a 
standard normal by application of a simple rule. Suppose the non-standard distribution 
has a mean w and a standard deviation o. Then all non-standard values are transformed 
to standard values using 


non-standard — standard 


on 
x > . 
o 
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Table 29.7 
Cumulative normal probabilities. 


Zz A(z) Zz A(z) Zz A(z) Zz A(z) z A(z) 


N 


A(z) 


0.00 0.5000000 0.40 0.6554217 0.80 0.7881446 1.20 0.8849303 1.60 0.9452007 2.00 0.9772499 
0.01 0.5039894 0.41 0.6590970 0.81 0.7910299 1.21 0.8868606 1.61 0.9463011 2.01 0.9777844 
0.02 0.5079783 0.42 0.6627573 0.82 0.7938919 1.22 0.8887676 1.62 0.9473839 2.02 0.978 3083 
0.03 0.5119665 0.43 0.6664022 0.83 0.7967306 1.23 0.8906514 1.63 0.9484493 2.03 0.978 8217 
0.04 0.5159534 0.44 0.6700314 0.84 0.7995458 1.24 0.8925123 1.64 0.9494974 2.04 0.9793248 


0.05 0.5199388 0.45 0.6736448 0.85 0.8023375 1.25 0.8943502 1.65 0.9505285 2.05 0.9798178 
0.06 0.5239222 0.46 0.6772419 0.86 0.8051055 1.26 0.8961653 1.66 0.9515428 2.06 0.9803007 
0.07 0.5279032 0.47 0.6808225 0.87 0.8078498 1.27 0.8979577 1.67 0.9525403 2.07 0.9807738 
0.08 0.5318814 0.48 0.6843863 0.88 0.8105703 1.28 0.8997274 1.68 0.9535213 2.08 0.981 2372 
0.09 0.5358564 0.49 0.6879331 0.89 0.8132671 1.29 0.9014747 1.69 0.9544860 2.09 0.981691 1 


0.10 0.5398278 0.50 0.6914625 0.90 0.8159399 1.30 0.9031995 1.70 0.9554345 2.10 0.982 1356 
0.11 0.5437953 0.51 0.6949743 0.91 0.8185887 1.31 0.9049021 1.71 0.9563671 2.11 0.9825708 
0.12 0.5477584 0.52 0.6984682 0.92 0.8212136 1.32 0.9065825 1.72 0.9572838 2.12 0.9829970 
0.13 0.5517168 0.53 0.7019440 0.93 0.8238145 1.33 0.9082409 1.73 0.9581849 2.13 0.983 4142 
0.14 0.5556700 0.54 0.7054015 0.94 0.8263912 1.34 0.9098773 1.74 0.9590705 2.14 0.983 8226 


0.15 0.5596177 0.55 0.7088403 0.95 0.8289439 1.35 0.9114920 1.75 0.9599408 2.15 0.9842224 
0.16 0.5635595 0.56 0.7122603 0.96 0.8314724 1.36 0.9130850 1.76 0.9607961 2.16 0.9846137 
0.17 0.5674949 0.57 0.7156612 0.97 0.8339768 1.37 0.9146565 1.77 0.9616364 2.17 0.9849966 
0.18 0.5714237 0.58 0.7190427 0.98 0.8364569 1.38 0.9162067 1.78 0.9624620 2.18 0.9853713 
0.19 0.5753454 0.59 0.7224047 0.99 0.8389129 1.39 0.9177356 1.79 0.9632730 2.19 0.985 7379 


0.20 0.5792597 0.60 0.7257469 1.00 0.8413447 1.40 0.9192433 1.80 0.9640697 2.20 0.9860966 
0.21 0.5831662 0.61 0.7290691 1.01 0.8437524 1.41 0.9207302 1.81 0.9648521 2.21 0.9864474 
0.22 0.5870604 0.62 0.7323711 1.02 0.8461358 1.42 0.9221962 1.82 0.9656205 2.22 0.9867906 
0.23 0.5909541 0.63 0.7356527 1.03 0.8484950 1.43 0.9236415 1.83 0.9663750 2.23 0.987 1263 
0.24 0.5948349 0.64 0.7389137 1.04 0.8508300 1.44 0.9250663 1.84 0.9671159 2.24 0.9874545 


0.25 0.5987063 0.65 0.7421539 1.05 0.8531409 1.45 0.9264707 1.85 0.9678432 2.25 0.9877755 
0.26 0.6025681 0.66 0.7453731 1.06 0.8554277 1.46 0.9278550 1.86 0.9685572 2.26 0.988 0894 
0.27 0.6064199 0.67 0.7485711 1.07 0.8576903 1.47 0.9292191 1.87 0.9692581 2.27 0.988 3962 
0.28 0.6102612 0.68 0.7517478 1.08 0.8599289 1.48 0.9305634 1.88 0.9699460 2.28 0.988 6962 
0.29 0.6140919 0.69 0.7549029 1.09 0.8621434 1.49 0.9318879 1.89 0.9706210 2.29 0.988 9893 


0.30 0.6179114 0.70 0.7580363 1.10 0.8643339 1.50 0.9331928 1.90 0.9712834 2.30 0.9892759 
0.31 0.6217195 0.71 0.7611479 1.11 0.8665005 1.51 0.9344783 1.91 0.9719334 2.31 0.9895559 
0.32 0.6255158 0.72 0.7642375 1.12 0.8686431 1.52 0.9357445 1.92 0.9725711 2.32 0.9898296 
0.33 0.6293000 0.73 0.7673049 1.13 0.8707619 1.53 0.9369916 1.93 0.9731966 2.33 0.9900969 
0.34 0.6330717 0.74 0.7703500 1.14 0.8728568 1.54 0.9382198 1.94 0.9738102 2.34 0.990358 1 


0.35 0.6368307 0.75 0.7733726 1.15 0.8749281 1.55 0.9394292 1.95 0.9744119 2.35 0.9906133 
0.36 0.6405764 0.76 0.7763727 1.16 0.8769756 1.56 0.9406201 1.96 0.9750021 2.36 0.990 8625 
0.37 0.6443088 0.77 0.7793501 1.17 0.8789995 1.57 0.9417924 1.97 0.9755808 2.37 0.991 1060 
0.38 0.6480273 0.78 0.7823046 1.18 0.8809999 1.58 0.9429466 1.98 0.9761482 2.38 0.991 3437 
0.39 0.6517317 0.79 0.7852361 1.19 0.8829768 1.59 0.9440826 1.99 0.9767045 2.39 0.9915758 


(continued) 
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Table 29.7 


Cumulative normal probabilities (continued). 


Zz A(z) F4 


A(z) Zz A(z) Z A(z) Zz A(z) Zz A(z) 


2.40 0.9918025 2.45 
2.41 0.9920237 2.46 
2.42 0.9922397 2.47 
2.43 0.9924506 2.48 
2.44 0.9926564 2.49 


0.992 8572 2.50 0.9937903 2.55 0.9946139 2.60 0.9953383 3.20 0.9993129 
0.9930531 2.51 0.9939634 2.56 0.9947664 2.70 0.9965330 3.40 0.999663 1 
0.993 2443 2.52 0.9941323 2.57 0.9949151 2.80 0.9974449 3.60 0.999 8409 
0.993 4309 2.53 0.9942966 2.58 0.9950600 2.90 0.9981342 3.80 0.9999277 
0.993 6128 2.54 0.9944574 2.59 0.9952012 3.00 0.9986501 4.00 0.999968 3 


4.50 0.9999966 
5.00 0.999 999 7 
5.50 0.999999 9 


This table is condensed from Table 1 of the Biometrika Tables for Statisticians, Vol. 1 (1st ed.), edited by E. S. Pearson 
and H. O. Hartley. Reproduced with the kind permission of E. S. Pearson and the trustees of Biometrika. 
Source: Statistics Vol. 1 Probability Inference and Decision by Hays, W. L. and Winkler, R. L. (Holt Rinehart & Winston, 


New York, 1970). 


Example 29.22 


Solution 


A random variable, h, has a normal distribution with mean 7 and standard deviation 2. 
Calculate the probability that 


(a) h>9 (b) h<6 (c) 5<h<8 


(a) Applying the transformation gives 
o=] 


9> ——_e=]1 


So h > 9 has the same probability as x > 1, where x is a random variable with a 
standard normal distribution: 


P(h > 9) = P(x > 1) =1—P(x < 1) = 1— 0.8413 = 0.1587 


(b) Applying the transformation gives 


6-7 
63 2 vee 
aa) 


So h < 6 has the same probability as x < —0.5: 
P(x < —0.5) = P(x > 0.5) = 1 — P(x < 0.5) = 0.3085 


(c) Applying the transformation to 5 and 8 gives 


5-7 8-7 
——_ = -1 —— = 0. 
55> 5 8 > 5 0.5 


and so we require P(—1 < x < 0.5). Therefore 
P(x < 0.5) = 0.6915 P(x < —1) = 0.1587 


and then 


P(-1 <x < 0.5) = 0.6915 — 0.1587 = 0.5328 


EXERCISES 29.14 


A random variable, x, has a standard normal 
distribution. Calculate the probability that x lies in the 
following intervals: 

(a) (0.25, 0.75) 

(b) (—0.3, 0.1) 

(c) within 1.5 standard deviations of the mean 

(d) more than two standard deviations from the mean 
(e) (—1.7, —0.2) 

A random variable, x, has a normal distribution with 
mean 4 and standard deviation 0.8. Calculate the 
probability that 

(a) 3.0<x<44 

(b) 2.5 <x<3.9 

(c) x > 4.6 

(d) x < 4.2 

(e) xis within 0.6 of the mean 


A random variable, t, has a normal distribution with 
mean | and standard deviation 2.5. Calculate the 


probability that 
(a) -l<r<g2 (be) r>0 
(c) |t] < 0.9 (d) |t| > 1.6 


The scores from IQ tests have a mean of 100 and a 
standard deviation of 15. What should a person score 
in order to be described as in the top 10% of the 
population? 


A machine produces car pistons. The diameter of the 
pistons follows a normal distribution, mean 6.04 cm 
with a standard deviation of 0.02 cm. The piston is 
acceptable if its diameter is in the range 6.010 cm to 
6.055 cm. What percentage of pistons is 


acceptable? 

Solutions 
(a) 0.1747 (b) 0.1577 (c) 0.8664 
(d) 0.0455 (e) 0.3762 
(a) 0.5858 (b) 0.4199 (c) 0.2266 
(d) 0.5987 (e) 0.5467 
(a) 0.4436 (b) 0.6554 (c) 0.2604 


(d) 0.5544 


6 
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The random variable, x, has a normal distribution. 
How many standard deviations above the mean must 
the point P be placed if the tail-end is to represent 


(a) 10% (b) 5% (c) 1% 
of the total area? (See Figure 29.20.) 


N(x) 


Figure 29.20 
Graph for Question 6. 


Consider Figure 29.21. The two tail-ends have equal 
area. How many standard deviations from the mean 
must A and B be placed if the tail-ends are 


(a) 10% (b) 5% (c) 1% 
of the total area? 
N(x) 
> 
A B # 
Figure 29.21 
Graph for Question 7. 
119 
71% 
(a) 1.28 (b) 1.645 (c) 2.33 
(a) 1.64 (b) 1.96 (c) 2.57 
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RELIABILITY ENGINEERING 


Reliability engineering is an important area of study. Unreliable products lead to hu- 
man frustration, financial loss and in the case of life-critical systems can lead to death. 
As the complexity of engineering systems has increased, mathematical methods of as- 
sessing reliability have grown in importance. Probability theory forms a central part of 
the design of highly reliable systems. For most items the failure rate changes with time. 
A common pattern is exhibited by the appropriately named ‘bath tub’ curve, illustrated 
in Figure 29.22. 

Car part failures are quite well modelled by this distribution. For example, a crankshaft 
may fail quite quickly as a result of a manufacturing defect. If it does not, then there is 
usually a long period during which the likelihood of failure is low. After many years the 
probability of failure increases. 

Consider the probability of an item failing over a total time period T. This period, T, 
is the time during which the item is functioning. The time taken to repair the item is not 
considered in this calculation but is considered a little later on. Suppose the probability of 
failure is evenly distributed over this period; that is, the probability of failure is modelled 
by the uniform distribution (see Section 29.12). It is important to note that this is a fairly 
simplistic assumption. If N = number of failures of the item over a time period T, it is 
possible to define a mean failure rate, k, by 


k= = 
T 


For example, if an item fails 10 times in a period of 5 years we define the mean failure 
rate to be k = 10/5 = 2, that is two failures per year. Because of the uniform distribution 
of the failures across the time period, the quantity & is constant. To illustrate this consider 
the previous example with a period of 10 years. During this time the item will fail 20 
times and so 


20 : 
k= To = 2 failures per year 


as found earlier. 
Another useful term is the mean time between failures (M7BF), which is given by 


1 
MTBF = — 
k 


The term MTBF is only used for items that are repairable. 

Let the interval T be divided into n small sub-intervals each of length 67, that is 
néT = T. Suppose each sub-interval is so small that only one failure can occur during 
it. Note that since repair time has been neglected it is always possible to have a failure 
in a sub-interval. So the N failures which occur during time T occur in N distinct sub- 


Failure rate 


Figure 29.22 
Time The ‘bath tub’ curve. 
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intervals. The probability of failure occurring in a particular sub-interval, P;, is then 
given by 


number of sub-intervals in which failure occurs 


total number of sub-intervals 


In each sub-interval, 57, the probability of an item not failing, P_,, is 


N n-N 
| ee ey ae 
n n 


As the item progresses through each of the successive sub-intervals 6T it can be thought 
of as undergoing a series of trials, the result of which is failure or non-failure. Therefore, 
the probability of the item not failing as it passes through all of the n sub-intervals can 
be obtained by multiplying each of the sub-interval probabilities of not failing, that is 


= N n 
probability of not failing in time T = (" ) 
n 


As the sub-interval 5T becomes small, that is 67 —> O, the number of sub-intervals, n, 
becomes large, that is 1 —> oo. Hence we need to consider the quantity 


lim 
noo n 


This limit can be shown to be e~” (see Appendix IV). Hence 


probability of not failing in time T =e 


Finally, the probability of an item failing one or more times — items can be repaired and 
fail again — in the time interval T is given by 


P(T)=1-e%=1-e"" 


P(T) is the probability of at least one failure in the time period T. As a consequence 
of the constancy of k, this formula can be used to calculate the probability of an item 
failing one or more times in an arbitrary time period, ft, in which case 


P(t) =1—e™ (29.1) 


It is important to stress that this formula only applies if the probability of failure is evenly 
distributed. 


Engineering application 29.8 


Breakdowns on a factory process line 


A factory process line makes use of 12 controllers to maintain process variables at 
their correct values; all 12 controllers need to be working in order for the process 
line to be operational. Records show that each controller fails, on average, once 
every six months. Calculate the probability of the process line being stopped as a 
result of a controller failure within a time period of one month. 
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Solution 

The mean failure rate of each controller is 1/6 breakdowns per month. Given that 
there are 12 controllers the overall failure rate for controllers is 12/6 = 2 breakdowns 
per month. Using Equation (29.1) with k = 2 and t = | gives the probability, P(t), 
of the process line being stopped because of a controller failure within a period of 
one month as 


AQ) = il=e* = il=e 7 aOR 


So far we have ignored the ‘downtime’ associated with an item waiting to be repaired 
after it has failed. This can be a significant factor with many engineering systems. The 
simplest possibility is that an item is out of action for a fixed period of time 7, while it is 
being repaired. If there are N failures during a period T then the total downtime is NT,. 
The time the item is available, 7), is given by 


2» “a? 
T, =T —NT. 


A useful quantity is the fractional dead time, D, which is the ratio of the mean time the 
item is in the dead state to the total time. In this case 


pao" (29.2) 
== 


Another useful quantity is the availability, A, which is the ratio of the mean time in the 
working state to the total time. For the present model 


=——t=1-D 
T 
The failure rate model developed previously was based on the time the item was working 
rather than the total time. When the repair time is included k becomes 


N 
k= 
L. 


a 


and so 


N =kT, =k(T —NT) 


N + KNT, = kT 
and therefore 
N= kT 
~ L+KT, 
So using Equation (29.2) 
_ APT, ORG 
~ L+kE.T 14K. 
Also 
kT. 1 
A=1-D=1 : (29.3) 


ree ee 


For more complicated repair characteristics the equations for D and A are correspond- 
ingly more complicated. 


29.15.1 
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Engineering application 29.9 


Availability of an electrical supply to a large factory 


The electrical supply to a large factory has a mean time between failures of 350 hours. 
When the supply fails it takes 3 hours to repair the failure and restore the supply. 
Calculate the average availability of the electrical supply to the factory. 


Solution 


Failure rate of the supply is k where k = 1/350 failures per hour. Using Equa- 
tion (29.3) with T. = 3 hours gives 


A=———__ = 0,992 _that is, 99.2% 
1+ 3/350 cas 


The supply is up and running for 99.2% of the time. 


So far we have only examined systems in which failure was caused by one or more 
components each with the same failure rate or MTBF. A more common situation is one in 
which the different components of a system have different degrees of reliability. It is still 
useful to be able to calculate the overall reliability of the system although the analysis 
is more complicated. In order to do so it is necessary to define the term reliability. From 
Equation (29.1) we know that P(t) defines the probability of one or more failures during 
atime period, t. Therefore the probability of no failures is given by | — P(t). The quantity 
1— P(t) is called the reliability of the system during a time period f, and is denoted R(f), 
that is 


R(t) =1—P(t) =e" (29.4) 


R(t) can be interpreted as the probability a component works properly during a period 
t. We now examine the reliability of two simple system configurations. 


Series system 


A series system is one in which all the components of a system must operate satis- 
factorily if the system is to function correctly. Consider a system consisting of three 
components, shown in Figure 29.23. The reliability of the system is the product of the 
reliabilities of the individual components, that is 


R=R,RpRo (29.5) 


This formula is a direct consequence of the fact that the failure of any one of the com- 
ponents is an independent event. So the probability of the system not failing, that is its 
reliability, is the product of the probabilities of each of the components not failing. (See 
Section 28.7 for independent events.) 


Input a | Output Figure 29.23 
Series system. 
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Engineering application 29.10 


Reliability of a radio system 


A radio system consists of a power supply, a transmitter/amplifier and an antenna. 
During a 1000 hour period the reliability of the various components is as follows: 


Ry, = 0.95 
Ry = 0.93 
R, = 0.99 


Calculate the overall reliability of the radio system. 


Solution 
R=R_ RR. =0.95 x 0.93 x 0.99 = 0.87 


ps ta’ ‘a 


that is, there is an 87% chance that the radio will not fail during a 1000 hour period. 


Using Equations (29.4) and (29.5) it is possible to generate a formula for the reliability 
of a system consisting of n series components: 


R= eit ew kt wae ent = en ki tht +k, ie 


where k,,k,,...,k,, are the mean failure rates of the n components. 


Engineering application 29.11 


Reliability of a satellite link 


A satellite link is being used to transmit television pictures from America to England. 
The components of this system are the television studio in New York, the transmitter 
ground station, the satellite and the receiver ground station in London. The MTBF of 
each of the components is as follows: 

MTBF,, = 1000 hours 

MTBF’, = 2000 hours 

MTBF, = 500000 hours 

MTBF,,, = 5000 hours 


Calculate the overall reliability of the system during a 28 day period and a yearly 


period of 365 days. 
Solution 
First we calculate the mean failure rate of each of the components: 
1 
k,, = =x = 1 x 10° failures per hour 
1000 
1 is 
Kiog = sanx = 5 x 10 “failures per hour 
ss 2000 
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1 


—— = 2x 10~° failures per hour 
500 000 
1 
kegs = ieee 2 x 10~ failures per hour 


So the overall reliability of the system is 


Rie ee cites tetas) ae (1x 107345 x 10-42 x 1076+2 1074 )r 


= e7!-702x iO 


For t = 28 x 24 hours, R = 0.319. 
For t = 365 x 24 hours, R = 3.35 x 107’. Clearly during a yearly period the system 
is almost certain to fail at least once. 


29.15.2 Parallel system 


A parallel system is one in which several components are in parallel and all of them 
must fail for the system to fail. The case of three components is shown in Figure 29.24. 
The probability of all three components failing in a time period, f, is the product of the 
individual probabilities of each component failing, that is (1 — R,)(1 — Rg)( — Re). 
So the overall system reliability is 


R=1-(1—R,)(1— Rg) — Ro) 


This formula can be generalized to the case of n components in parallel quite easily. 


Output 


Figure 29.24 
Parallel system. 


Engineering application 29.12 


Reliability of process control computer power supplies 


A process control computer is supplied by two identical power supplies. If one fails 
then the other takes over. The MTBF of the power supplies is 1000 hours. Calculate 
the reliability of the power supply to the computer during a 28 day period. Compare 
this with the reliability if there is no standby power supply. 


Solution 
1 
k 1000 0.001 failures per hour 


ie a e fiat @ 0-001 x24x28 = 0.511 
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Therefore the overall reliability of the power supply to the computer is 
R=1-—(1—0.511)d — 0.511) = 0.761 


Clearly the existence of a standby power supply does improve the reliability of the 
power supply to the computer. In practice, the figure would be much higher than 
this because once a power supply failed the standby would take over and main- 
tenance engineers would quickly repair the failed power supply. Therefore the pe- 
riod of time in which the computer was relying on one power supply would be very 


small. 


We have only examined the simplest possible reliability models. In practice, reliability 
engineering can often be very complicated. Some models can cater for maintenance 
strategies of the sort we touched on in the previous example. The effect of non-uniform 
failure rates can also be catered for. Also, many systems consist of a mixture of series and 
parallel subsystems of the type we have discussed as well as more complicated failure 


modes than we have examined. 


EXERCISES 29.15 


A computer contains 20 circuit boards each with an 
MTBF of 3 months. Calculate the probability that the 
computer will suffer a circuit board failure during a 
monthly period. 


Three pumps are required to feed water to a boiler in a 
power station in order for it to be fully operational. 
The MTBF of each of the pumps is 10 days. Calculate 
the probability that the boiler will not be fully 
operational during a monthly period if there are only 
three pumps available. 


A process control computer has a power supply with 
an MTBF of 600 hours. If the power supply fails then 
it takes 2 hours to replace it. Calculate the average 
availability of the computer power supply. 


A radar station has an MTBF of 1000 hours. The 
average repair time is 10 hours. Calculate the average 
availability of the radar station. 


A process line to manufacture bread consists of four 
main stages: mixing of ingredients, cooking, 
separation and finishing, packaging. Each of the 
stages has an MTBF of 

MTBF,,, = 30 hours 

MTBF, = 10 hours 

MTBF = 20 hours 

MTBF, = 15 hours 


Calculate the probability that the process line will be 
stopped during an 8 hour shift. 


A remote water pumping station has two pumps. Only 
one pump is needed in normal operation; the other 
acts as a standby. Access to this station is difficult and 
so if a pump fails it is not easy to repair it 
immediately. Given that the MTBF of the pumps is 
3000 hours, calculate the overall reliability of the 
pumping station during a 28 day period in which the 
maintenance engineers are unable to repair a pump 
which fails. Calculate the improvement in reliability 
that would be obtained if a second standby pump was 
installed in the pumping station. 


Repeat the calculation carried out in Question 6 with 
the following information. The pumps, which are 
different from those in Question 6, can be considered 
to consist of two components: a motor and the pump 
itself. The MTBF of the motors is 4000 hours and that 
of the pumps is 6000 hours. 


Solutions 


0.999 
0.9998 
0.9967 


0.9901 
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6 
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0.865 
0.960, 0.992 


0.940, 0.985 


REVIEW EXERCISES 29 


1 


Find the mean and standard deviation of 
8, 6.9, 7.2, 8.4, 9.6, 10.3, 7.4, 9.0 


A discrete random variable, v, has a probability 
distribution given by 


v 2 1:5 1 0.5 0 0.5 1.0 
P(v) 0.23 0.17 0.06 0.03 0.17 0.31 0.03 


Calculate 
(a) P(v = —-1) (b) P(v = —2 orv = 0) 
(c) Pv = 0) (d) P(v < —0.5) 


A continuous random variable, x, has a p.d.f., f(x), 
given by 


fx) =3x% O<x<l 
(a) Find P(O <x < 0.5). 
(b) Find P(x > 0.3). 
(c) Find P(x < 0.6). 
(d) Find the expected value of x. 
(e) Find the standard deviation of x. 


A discrete random variable, y, has a probability 
distribution given by 


y —0.25. 0.25 0.75 1.25 1.75 
Py) 0.25 0.20 0.10 0.15 0.30 


(a) Find the expected value of y. 
(b) Find the standard deviation of y. 


The random variable, t, has a p.d.f., H(t), given by 
H(t)=ae“ +350 


(a) Calculate the expected value of rt. 


(b) Calculate the standard deviation of t. 


6 


8 


Calculate the number of permutations of 
(a) five distinct objects, taken two at a time 


(b) eight distinct objects, taken four at a time 


Calculate the number of combinations of 

(a) seven distinct objects, taken four at a time 

(b) 200 distinct objects, taken 198 at a time 

The probability a component is manufactured to an 
acceptable standard is 0.92. Twelve components are 
picked at random. Calculate the probability that 

(a) six are acceptable 

(b) 10 are acceptable 

(c) more than 10 are acceptable 

(d) at least two are not acceptable 

The number of breakdowns of a computer system in a 
month is a random variable with a Poisson 
distribution. The mean number of breakdowns per 
month is two. Calculate the probability that the 
number of breakdowns of the system in a month is 
(a) two 

(b) three 

(c) one 

(d) more than two 

The probability of a computer failing in a system in a 


given week is 0.03. In a system there are 150 
computers. 


(a) Use the binomial distribution to calculate the 
probability that in a given week five computers 
fail. 

(b) Use the binomial distribution to calculate the 
probability that in a given week more than two 
computers fail. 

(c) Use the Poisson approximation to calculate the 
probability that in a given week four computers 
fail. 
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(d) Use the Poisson approximation to calculate the 
probability that in a given week more than three 
computers fail. 


A random variable, t, has a uniform distribution, f(t), 
given by 
1 
fO= | 0 
Calculate the probability that 
(a) l<t<3 (b) 0.2 <t <2.7 
(c) |t| > 1 (d) |t| < 1.5 


0<t<4 
otherwise 


The time, t, between failures for a particular type of 
electrical component follows an exponential 
distribution with a mean value of 24 weeks. Calculate 
the probability that 


(a) t > 24 (b) t < 22 

The resistance of resistors is a random variable with a 
normal distribution with a mean of 3 ohms and a 
standard deviation of 0.1 ohm. Calculate the 
probability that a resistor has a resistance of 

(a) between 2.9 and 3.05 ohms 

(b) more than 2.95 ohms 

(c) less than 2.83 ohms 

The capacitance of capacitors is a random variable 


with a normal distribution, with a mean of 12 farads 
and a standard deviation of 0.6 farads. In a batch of 


Solutions 


mean = 8.35; st. dev. = 1.1314 


(a) 0.06 (b) 0.4 (c) 0.51 (d) 0.46 


(b) 0.973 
(e) 0.194 


(c) 0.216 


(b) 0.799 
b 1 
(b+) 5 
(b) 1680 
(a) 35 (b) 19900 


(a) 1.4687 x 1074 
(c) 0.7513 


(b) 0.1835 
(d) 0.2487 


(a) 0.2707 
(d) 0.3233 


(b) 0.1804 — (c) 0.2707 


16 


18 


10 


11 


12 
13 
14 
15 
16 
17 
18 


300 capacitors how many would you expect to have 
with capacitance 


(a) between 11 and 12.3 farads 
(b) more than 11.6 farads 
(c) less than 12.8 farads? 


A computer fails, on average, once every 4 months. 
An engineering system uses nine computers, all of 
which need to be working for the system to operate. 
Calculate the probability that the system will fail 
sometime during 


(a) a1 month period 


(b) a2 week period (i.e. half a month) 


The MTBF for a motor is 270 hours. When the motor 
fails it takes 7 hours to repair. Calculate the 
availability of the motor. 


A system comprises four components in series. The 
MTBF for each component is 170 hours, 200 hours, 
250 hours and 210 hours. Calculate the reliability of 
the system over a 100 hour period. 


A system comprises two components in parallel. The 
components have an MTBF of 150 hours and 

170 hours. Calculate the probability the system will 
fail in a 200 hour period. 


(a) 0.1736 (b) 0.8307 (c) 0.1898 
(d) 0.6577 

(a) 0.5 (b) 0.625 (c) 0.75 
(d) 0.375 

(a) 0.3679 (b) 0.6002 

(a) 0.5328 (b) 0.6915 (c) 0.0446 
(a) 193 (b) 225 (c) 272 
(a) 0.8946 (b) 0.6753 

0.9747 

0.1402 

0.5093 
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REPRESENTING A CONTINUOUS FUNCTION AND A 
SEQUENCE AS A SUM OF WEIGHTED IMPULSES 


In this appendix we show how a continuous function, f(t), or a discrete sequence, f[k], 
obtained by sampling this function, can be represented as a sum of weighted impulses. 
Such a representation is important in the study of the z transform and the discrete Fourier 
transform. 


The delta function 


The delta function is introduced in Chapter 2. For ease of reference we restate its devel- 
opment here. The delta function 6(t), or unit impulse function, is the limit of a rectangle 
function bounding an area of | unit, and located at the origin, as its width approaches 
zero, and its height increases accordingly to ensure that the area remains |. The enclosed 
area is known as the strength, or the weight, of the impulse. This is illustrated in Fig- 
ure AI.1. Note that the impulse is represented by an arrow and the height of the arrow 
gives the strength of the impulse. If the impulse occurs at the point t = d, then this is 
written 6(t—d). Further, by multiplying the delta function by a number A, to give Ad(f), 
we obtain the limit of a rectangle function bounding an area of A. This is an impulse of 
strength A. 


Sampling a continuous function 


Now consider a function f(t) defined for t > 0 as shown in Figure AI.2. Suppose this 
function is sampled at times t = 0,7, 27,3T,...,kT,..., to generate a sequence of 
values f(0), f(T), f2T), fT), ..., f(kT), ..., which we shall write as f[0], f[1], 
F(21, f[3],.-.., SLA), ..-- 

Note from the discussion above that the quantity f[0]5(r) is the limit of a rectangle 
function located at the origin and bounding an area equal to f[O]. 
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Figure Al.1 
The limit of the rectangle function is the delta function. 


f(t) 


0 t 0 T 27 37... kT t 


Figure Al.2 
Sampling a continuous function f(t) gives the sequence f[k]. 


Similarly, f[1]6(¢ — T) is the limit of a rectangle function bounding an area equal to 
f{1] and located att = T. 

In general f[k]5(t — kT) is the limit of a rectangle function bounding an area equal 
to f[k] and located at t = kT. 

The quantity 


Y= flk18(t — kT) 


k=0 


is a series of delta functions. The area bounded by these is given by 


area = f[0] + f+ fl2]+---+flkl+---, — thatis 0 f{k] (AL.1) 
k=0 


Approximating the area under a curve 


Now consider approximating the area under the continuous function f(t) by a series of 
rectangular areas as shown in Figure AI.3. The first rectangle has width T and height 
Jf (O) and so its area is T f(0). Using our sampling notation this may be written T f[0]. 
Similarly, the second rectangle has width T and height f(T) and so its area is T f(T). 
Using the sampling notation this is written T f[1]. The third rectangle has area T f[2] 
and so on. The total area is thus given by 


area = Tf[O] + TfUI+Tf21+---+Tflt+:-=TY fle (AI.2) 
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Figure Al.3 
The area under f(t) can be approximated by a series of rectangular areas. 


Now compare Equations (AI.1) and (AI.2). If we multiply the series of delta functions 
by T we see that 


TS f[k]5(t — kT) 
k=0 
will bound the same area as our approximation to the area under the graph of f(t). In 
this sense we can regard 


TS f{kI8(t — kT) 
k=0 x 
as an approximation to the function f(t). We will denote this approximation by f(r). 
In summary, a continuous function f(f) can be represented as a sum of weighted 
impulses each of strength f[A] occurring at t = kT: 


FO) =T YT FURS — kT) 


k=0 


This representation can also be thought of as a way of expressing a discrete sequence of 


values, f[k], as a continuous function f(t). This is useful when studying the z transform 
and the discrete Fourier transform. Sometimes it will be convenient to work without the 
factor T, in which case we define 


f° O = >> fst — kT) 


k=0 
It should be remembered that when using this form, f* needs to be multiplied by the 
factor T in order to approximate the function f(t). 


THE GREEK ALPHABET 


A a alpha I l iota P p tho 

B B beta K K kappa x o sigma 
Tr y gamma A Xr lambda T T tau 

A 5 delta M a mu Y v upsilon 
E € epsilon N v nu ® p phi 

Z c zeta 1S é xi xX x chi 

H n eta O oO omicron wv w psi 

iS) 6 theta Il re pi Q oO omega 
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SI UNITS AND PREFIXES 


Throughout this book SI units have been used. Below is a list of these units together with 
their symbols. 


Quantity SI unit Symbol Prefix Symbol 


length metre m 10!5 peta P 
mass kilogram kg 10!2 tera T 
time second s 10° giga G 
frequency hertz Hz 10° mega M 
electric current ampere A 10° kilo k 
temperature kelvin K 102 hecto h 
energy joule J 10! deca da 
force newton N 107! deci d 
power watt WwW 10-2 centi c 
electric charge coulomb Cc 10-3 milli m 
potential difference volt Vv 10-6 micro we 
resistance ohm Q 10-9 Wan fi 
capacitance farad F 19712 pico p 
inductance henry H 10-15 femto f 


THE BINOMIAL EXPANSION OF (2-%)" 


n 
n 
n—N 


n 


Consider the binomial expansion of 
n— N n _ i N n 
n 7 n 
N -1)( NY 
es m= 1) (_ 
n 2! n 


mln - oe a) (-*) . 


n 


Suppose now we let n > oo. We find 


. n—N\" NN? N? 
lim =1-N+—-—-—+-:: 


noo n 


But this is the power series expansion of e~” (see Section 6.5). We conclude that 


n 
: n—N _ 
im ( ) =e 
noo n 


In particular, note that if VN = —1 


; (" a :): 
lim =e 
noo n 


absolute quantity 88 
absorption laws 179, 185, 186 
acceleration 690, 868 
addition: 
algebraic fraction 30-1 
complex numbers 329 
law of probability 909-13, 954 
matrix 259-60 
vectors 225-6, 336-7 
adjoint matrix 281-2 
algebra see Boolean algebra; matrix algebra 
algebraic equations 627, 649 
algebraic fractions 26-33 
addition 30-1 
division 29-30 
equivalent fractions 27-8 
expressing fraction in simplest form 28-9 
multiplication 29-30 
proper and improper fractions 27 
resistors in parallel 32 
subtraction 30-1 
algebraic techniques 1-53 
algebraic fractions see algebraic fractions 
indices, laws of 2-11 
dividing expressions 4-5 
first law 2 
fractional indices 8-11 
multiple 7 
multiplying expressions 2-4 
negative indices 5, 6 
positive indices 6 
power density of signal transmitted by radio antenna 6 
power dissipation in resistor 3-4 
radar scattering 7-8 
second law 4 
skin depth, in radial conductor 9-10 
third law, 8 7 
inequalities, solution of 33-8 
number bases see base, number 
partial fractions see partial fractions 
polynomial equations 20-6 
current used by electric vehicle 21-3 
of higher degree 24-5 
quadratic equations 20-4 
summation notation 46-50 


Kirchhoff’s current law 47-8 
Kirchhoff’s voltage law 48-9 
alphabetic character stream, information content of 916 
alternating current: 
circuits 324 
waveforms 134 
Ampére’s circuital law 900 
amplitude 581, 724, 744, 751-3 
Fourier transform 766 
modulation 767-8 
trigonometric functions 133, 134, 135, 136 
of wave 131 
analog-to-digital converter 690 
analogue simulation 603-5 
analytical integration 457 
analytical solutions 534, 618-19, 621-2 
AND gate 184, 187, 188 
angular acceleration 613 
angular frequency 133, 134, 135, 136, 340 
Fourier series 723-4, 753 
fundamental 723-4 
of wave 131 
antennae 169-70 
approximate solution 615 
arbitrary constant 538, 589, 608 
arbitrary function 876 
arbitrary scaling constant 304 
Argand diagram 332, 334, 337, 345-6, 351 
argument: 
of angle 333 
of function 57-8 
arithmetic mean (mean value) 938-40 
arithmetic progressions 204 
arithmetic series, finite 209-10 
armature current/voltage 612-13 
arrays | 
arrows (signals) 661 
associative laws 179, 186 
associative matrix addition 260 
associative matrix multiplication 264 
asymptote 76-7, 81 
Bode plot of transfer function with real poles and zeros 
671-5 
of graph 75 
oblique 76 


983 


984 Index 


attenuation 87, 88, 578 

in step-index optical fibre 89-90 
augmented matrix 309 
autocorrelation 814, 815-16, 817 
automated vehicles, routing 226-7 
auxiliary equation 562-4, 572 

particular integral 574 
availability, in reliability 972 
average information per event 916 
average value: 

of function 471-5 

of periodic waveform 477-8 


back substitution 287-90 
backward wave 578-80 
base, number 2, 11-20, 80 
binary coded decimal 11, 17-20 
binary system 11-14 
decimal system 11 
hexadecimal system 11, 14-17 
seven-segment displays 18-19 
“bath tub’ curve 970 
Bernoulli, J. 953 
Bernoulli trials 953, 956, 959 
Bessel functions 584, 586, 587-601 
first kind of order one 591 
first kind of order zero 589 
in frequency modulation 598-601 
and generating function 595-7 
of higher order 592-3 
Jacobi-Anger identities 597-8 
modes of cylindrical microwave cavity 593-5 
second kind of order one 591 
second kind of order zero 589 
tabulated values of 601 
Bessel’s equations 584, 587-601 
of higher order 592-3 
of order one 589-91 
of order zero 587-9 
power series solution of 587-8, 590-1 
biased circuit 379 
binary coded decimal 11, 17-20 
binary data stream: 
entropy of 917 
information content of 916 
redundancy of 918 
binary digits (bits) 11, 17 
binary full-adder circuit 190-2 
binary system 11-14 
binary to decimal conversion 12 
decimal to binary conversion 12-14 
binary words 190-1 
binomial distribution 953-7 
mean and standard deviation 955 
most likely number of successes 956 
probability of k successes from n trials 954-5 
binomial expansion 982 


binomial theorem 214—18, 700, 717-18 
Biot-Savart law 250 
bit stream 915 
bits (binary digits) 11, 17 
block 627-8, 661 

basic 661 

delay 688-9 

diagram 661, 663-4, 665, 688-94 
Bode plot of linear circuit 96-7 


Bode plot of transfer function with real poles and zeros 


671-5 
Boolean algebra 175, 183, 185-97 
expressions 185, 187-8 
laws of 185-7 
logical equivalence 187-93 
variables 185 
bounded sequence 205 
branch current 307-10 
breakpoints 673, 674 
byte 11, 17 


capacitance 575, 637 
of coaxial cable 448-50 
mutual 102 
between two parallel wires 102-3 
capacitive reactance 435 
capacitors 577, 637 
with capacitance 435 
current through 368-9 
discharge of 82, 655 
energy stored in 487-8 
phasors 341-3 
in series 484-5 
voltage across 435 
cardinal sine function 123-4 
carrier 599 
carrier signal 767 
carry-out value 190 
Cartesian components 232-40 
line and multiple integrals 871 


Cartesian coordinate system 160-1, 162, 166-7, 171 


complex numbers 333-4 
functions of more than one variable 823-5 
three dimensions 157-8 
two dimensions 154-7 
vectors 240 
Cartesian equations 169 
Cartesian form 334 
complex numbers 342 
discrete Fourier transform 793 
cathode ray tube, electrical potential inside 824 
chain rule 389-91, 395, 463 
character data stream, redundancy of 918 
characteristic equation 299, 300-2 
characteristic impedance 348 
of coaxial cable 450-1 


charged particle, movement in electric field 244-5 


chord 357-8 
circuital law (Ampére) 900 
circuits: 

biased 379 

binary full-adder 190-2 

digital electronic 175 

integrated 428 

inverting 437 

LC 572-3 

LCR 560-1 

linear 97 

RC 535-6, 554-7 

RL 544-5, 553-4, 651-2 

RLC 109, 573-5 

thyristor firing 473-4 

truth table for 187-8 
circular (periodic) convolution 803-7 
circular convolution theorem 807-12 
circular correlation theorem 816-20 
circular cross-correlation 815-16, 818-20 
closed form (binomial theorem) 216 
closed loop system with negative feedback 665 


CMOS (complementary metal-oxide semiconductor) logic 


192-3 
co-domain 180-1 
coaxial cable 576-8 
capacitance of 448-50 
characteristic impedance of 450-1 
codes (communication theory) 918 
coding theory 919 
coefficient: 
damping 610 
difference equations and the z transform 685-6 
differential equations, linear constant 649-59 
equating 40-1, 585, 586 
Fourier 734, 739, 741-2, 748-50, 766 
matrix 285, 289 
polynomial 20 
reflection 581-2 
cofactor of element 279, 282 
column vector 233, 258-9, 261-2 
n-component 612 
p-component 612 
r-component 612 
combinations 950-2 
common derivatives 372-4 
common difference 204 
common ratio 204-5, 211, 213 
communication engineering 903 
communication theory 915-19 
commutative laws 179, 186, 191 
commutative numbers 259-60 
complement: 
laws 179, 186, 189-90, 191 
in probability 904 
in set theory 178-9 
complementary events 913-15 
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complementary function 558, 561-9, 572 
inhomogeneous term appears in 575-83 
particular integral 574-5 


complementary metal-oxide semiconductor (CMOS) logic 


192-3 

completeness and periodic functions 733 
completing the square 23, 37-8 
complex frequency: 

function 751 

variable 636 
complex notation and Fourier series 749-51 
complex numbers 97, 324-55 

addition 329 

complex conjugate pairs 327 

De Moivre’s theorem 344-51 

division 330 

in polar form 335 

exponential form 337-8 

Fourier series 751 

graphical representation 332-3 

imaginary part 325-6, 328-30, 334 

and inverse Laplace transform 643-6 

loci and regions of complex plane 351-3 

modulus 333 

multiplication 329-30 

in polar form 335 

operations with 328-30 

phasors 340-4 

polar form 333-6 

Poynting vector 343-4 

real part 326, 328-30, 334 

subtraction 329 

vectors and 336-7 
complex plane 332 
complex poles 668, 669 
complex roots 24, 564, 567 
complex translation theorem 714 
component laws, differential equations 572, 577 
composition, of functions 61-2 
compound events 905-8, 954 
compression, image and audio 795 
computer software packages 73 

Fourier transform 816-17 

graph plotting 723 

Laplace transform 675 

see also MATLAB® 
computer solutions for matrix algebra 319-21 
concave down function 402-3, 415-17 
concave up function 402-3, 415-17 
conditional probability 919-25 
conduction current 369 
conjunction (AND gate) 184 
conservative fields 875-80 
constant 689 

arbitrary 538, 589, 608 

arbitrary scaling 304 


coefficient differential equations, linear 627, 649-59 
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constant (Continued) 
coefficient equations 560-84 
exponential 81 
of integration 429 
phase 581 
time 82, 545 
constant polynomial 71 
continuous function 64—5, 361 
and sequence representation as sum of weighted impulses 
979-81 
approximating area under a curve 980-1 
delta function 979 
sampling a continuous function 979-80 
continuous integrand 491 
continuous random variable 965-6 
expected value 944-5 
standard deviation 947 
continuous signal sampling 702-4 
continuous spectra 766 
continuous variable 962, 963 
control engineering 319 
control systems 628 
controller signal 693 
converged series 211 
convergence, radius of 218, 698 
convergence criteria 212 
converges sequence 205 
convolution 647 
circular (periodic) 803-7 
linear 801-3, 809-11 
theorem 647-9, 676, 772-8 
circular 807-12 
see also discrete convolution and correlation 
coordinate frame 268 
coordinate system 154-74 
Cartesian see Cartesian coordinate system 
electrode 156 
polar see polar coordinates 
polar curves 163-6 
corner (derivatives) 371 
correlation: 
theorem 780-2 
circular 816-20 
see also discrete convolution and correlation 
cosecant (cosec) 120, 434 
hyperbolic 100 
cosine (cos): 
complex notation 749 
complex numbers 333-4, 338-9 
De Moivre’s theorem 344-50 
definite and indefinite integrals 445 
differential equations 539 
differentiation 391, 394, 420 
discrete Fourier transform 795-7 
first-order linear differential equations 547-9, 555 
Fourier series 734—43 
odd and even functions 740-3 


functions 120-3 
half-range series 746-8 
hyperbolic (cosh) 100-1, 102-3 
improper integrals 486, 488 
integration, elementary 429-33, 438-9 
integration by parts 458, 460 
integration by substitution 464-5 
Jacobi-Anger identities 598 
Maclaurin series 525-6 
modelling waves using 131-44 
odd and even functions 726-7, 730-1 
orthogonal functions 481-3 
orthogonality relations 732-1 
partial differential equations 825, 827 
polar coordinates 160, 171 
polar curves 164-5 
second-order linear differential equations 541, 559-60, 
564, 566-7, 571-3 
Taylor polynomials 522 
Taylor series 526-7 
trigonometric equations 144, 146, 147, 148 
trigonometric identities 125-30 
trigonometric ratios 116-20 
vectors 244 
waves 734, 757 
cosine (cos) periodic waveforms 723-4 
cotangent (cot) 120, 434 
hyperbolic 100 
coupled difference equations 694 
coupled equations 606-7 
coupled first-order equations 607-8 
coupled-tanks system 614-15 
Cramer’s rule 280, 658 
critical resistance 412 
critically damped response 667 
cross-correlation 780 
circular 815-16, 818-20 
linear 812-15 
cubic equation 543 
cubic polynomial 71 
cumulative normal probabilities 967-8 
current 577-8 
alternating 134, 324 
branch 307-10 
conduction 369 
density 892 
direct 510 
displacement 369 
eddy 827 
law see Kirchhoff’s current law 
leakage 576-7 
mesh 254, 307, 308-10 
node 47-8, 310-12, 317-19 
sinusoidal signals 136-7 
through capacitor 368-9 
used by electric vehicle 21-3 
cusp (derivatives) 371 


cycle of sint 131 
cycles of logarithmic scale 94-5 
cylindrical microwave cavities, modes of 593-5 
cylindrical polar coordinates 166-70 
decaying exponential 670 
fluid flow along a pipe 167 
helical antennae 169-70 


damped sinusoidal signal 387-8 


damping coefficient 610 
damping ratio 411-12 
DCT-II 795 

DCT-II 795 


De Moivre’s theorem 344-51 
De Morgan’s laws 179, 186, 191 
decaying exponential 578 
decibels (dB) 87 
decimal system 11 
deconvolution 810 
definite integrals 442-51 
degree 116, 117 
of equation 20 
of polynomial 71 
delta function 111-12, 979, 980, 981 
Dirac 480 
discrete Fourier transform 787 
Fourier transform 770-1 
integral properties 489-91 
Laplace transform 675-6 
sifting property of 490 
demodulation 817-18 
demodulator 598 
denominator 27-8, 29-30, 39, 75, 465 
dependent variable 58, 59, 366 
difference equations 682, 683, 684-5, 686-7 
differential equations 535-7, 543, 550-2 
derivable from a potential 875 
derivatives 365, 370-2 
common 372-4 
higher see higher order derivatives 
and Laplace transform 635-8 
partial see partial derivatives 
second see second-derivative 
table 356 
determinants 275-6, 278-81 
to evaluate vector products 248-9 
and vector products 279-80 
deviation: 
from mean 941 
see also standard deviation 
diagonal matrix 271 
diagonally dominant, matrix of coefficients 317 
dielectric 82 
difference, common 204 
difference equations 175, 203, 681-98 
block diagram representation 688-93 
coefficient 685-6 
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coupled 694 

dependent and independent variables 682 
discrete-time controller design 693-5 
discrete-time filter 689-90 

homogeneous and inhomogeneous equations 684-5 
linear and non-linear equations 683 

low-pass filter 696-7 

non-recursive 684 

numerical solution 695-8 

order 684 

recursive 684 

rewriting 686-8 

second-order 719 

signal processing using a microprocessor 684-5 
solution of difference equation 683 

and z transform 718-20 


differential calculus 357 
differential equations 534-626, 638 


analogue simulation 603-5 

condition 538 

coupled-tanks system 614-15 

Euler’s method 616-19 

first-order see first-order equations 

first-order linear see first-order linear differential 
equations 

higher order equations 606-9, 625 

Laplace transform 635, 659-61, 663 

LC circuit with sinusoidal input 572-3 

linear 536-7 

linear constant coefficient 627, 649-59 

numerical methods 615-16 

order 536 

oscillating mass-spring system 565-6 

parallel RLC circuit 573-5 

RC charging circuit 535-6 

RC circuit: zero-input response and zero-state response 
554-8 

RL circuit with ramp input 553-4 

RL circuit with step input 544-5 

Runge-Kutta method of fourth-order 623-5 

second-order 538-9, 605, 610 

second-order linear see second-order linear ordinary 
differential equation 

series solution of 584-6 

solution of 537-9 

standing waves on transmission lines 579-82 

state-space models 609-15 

voltage reflection coefficient 578-80 


differentiation 356-85, 429, 457-8 


applications see differentiation applications 
common derivatives 372-4 

current through capacitor 368-9 

defined 366 

derivatives, existence of 370-2 

dynamic resistance of semiconductor diode 378-9 
explicit 394 

from first principles 366, 372 
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differentiation (Continued) 
fluid flow into tank 376-7 
graphical approach to 357-8 
implicit 394-7 
limits and continuity 358-61 
linear model for simple fluid system 380-3 
as linear operator 375-83 
linearity 430 
logarithmic 397-8 
parametric 393-4 
rate of change 
at general point 364-9 
at specific point 362-4 
techniques see differentiation techniques 
voltage across an inductor 368 
differentiation applications 406-27 
inflexion, points of 415-18 
maximum points and minimum points see 
maximum/minimum points 
maximum power transfer 413-14 
Newton-Raphson method 406, 418-23 
risetime for second-order electrical system 411-13 
series diode-resistor circuit 421-3 
vectors 423-6 
differentiation techniques 386-405 
chain rule 389-91, 395 
damped sinusoidal signal 387-8 
higher derivatives 400-4 
logarithmic differentiation 397-8 
parametric differentiation 393-4 
product rule 386-8, 397-8 
quotient rule 388-9, 401 
differentiator 691 
digital computers 615 
digital controller 694 
digital electronic circuits 175 
digital filtering 692 
digital signal processing 200, 692 
diode equation 83-4 
Dirac delta function 480 
direct current electrical network 510 
direct transmission matrix 612 
Dirichlet conditions 735, 740 
discontinuous function 64—5, 361 
discrete, usage of term 175 
discrete convolution and correlation 801-20 
circular (periodic) convolution 803-7 
circular convolution theorem 807-12 
circular correlation theorem 816-20 
circular cross-correlation 815-16, 818-20 
convolution reverb 810-12 
linear convolution 801-3, 809-11 
linear cross-correlation 812-15 
radar, use of correlation in 817-20 
discrete cosine transform 795-801 
definition and its inverse 795-801 
truncation of set of samples 798-9 


two-dimensional and image compression 799-801 
discrete Fourier transform 783-7, 979, 981 
definition 784-6 
derivation 787-90 
to estimate Fourier transform 790-1 
inverse 786-7 
matrix representation 792-3 
properties 793-5 
linearity 793 
Parseval’s theorem 794 
periodicity 793 
Rayleigh’s theorem 794 
discrete independent variable 202 
discrete mathematics see Boolean algebra; set theory 
discrete random variable 953, 957 
expected value 943-4 
standard deviation 946-7 
discrete spectra 766 
discrete-time: 
controller design 693-5 
data 688 
filter 689-90, 696 
interval 702 
discrete variable 962 
disjoint sets 178 
disjunction connective 183 
disjunctive normal form 188 
displacement: 
current 369 
vector 226, 227, 423 
dissipation, power, in resistor 3-4 
function to model 59 
distance travelled by a particle 434 
distance travelled by rocket 498 
distributive laws 179, 186, 189, 191 
divergence theorem 893, 895 
divergent sequence 205 
diverging integral 486-7 
division: 
algebraic fractions 29-30 
complex numbers 330 
indices 4-5 
in polar form 335 
domain (sets) 180-1 
domain, function 58 
doping 83 
dot product 241 
double integrals 880-5, 886-7, 891, 895 
double the input rule 56-7, 61 
dual modular redundancy 906 
dummy variable, and integration 451 
duty cycle 743-4 
dynamic resistance, of semiconductor diode 378-9 
dynamic systems 535 


echelon form 290-1 
eddy current losses 827 


eigenvalues and eigenvectors 294-307 
eigenvalues 297-303, 305 
eigenvectors 303-6 
linear homogeneous equations 294-7 
electric charge density 900 
electric displacement, electric field strength and 
240-1 
electric field: 
of electrostatic dipole 529-31 
movement of charged particle in 244-5 
strength, and electric displacement 240-1 
work done by 869 
electric flux 857-8, 893, 900 
electrical components: 
multiple, testing 914-15 
reliability 911-12 
electrical networks, analysis of 307-12 
electrical potential inside a cathode ray tube 824 
electrode coordinates 156 
electromagnetism and vector calculus 864-6 
electronic chips, manufactured, acceptability of 
925-6 
electronic circuits, digital 175 
electronic thermometer measuring oven temperature 
656-7 
electrostatic dipole, electric field of 529-31 
electrostatic potential 241, 854-5, 863 
electrostatics theory 900 
elements 176, 259 
of area 882 
cofactor of 279, 282 
minor of 278-9 
of system 661 
empty set 178, 910 
energy 798 
stored in capacitor 487-8 
used by electric motor 447 
engineering functions 54-114 
argument of function 57-8 
composition of 61-2 
continuous function 64-5 
discontinuous function 64-5 
exponential functions 80-4 
graph of function 58-9 
hyperbolic functions see hyperbolic functions 
inverse of function 62—4 
logarithm functions see logarithm functions 
many-to-one functions 60-1 
numbers and intervals 55-6 
one-to-many function 60 
one-to-one functions 60-1 
parametric definition 61 
periodic function 65-6 
piecewise continuous functions 64-5 
polynomial functions 70-4 
power dissipated in resistor 59 
rational functions 75-9 


Index 


saw-tooth waveform 66-7 
square waveform 67 
triangular waveform 66 
entropy 916-19 
of binary data stream 917 
of signal consisting of three characters 917 
equal sets 177 
equating coefficients 40-1, 585, 586 
equations: 
algebraic 627, 649 
auxiliary 562-4, 572, 574 
Bessel’s see Bessel’s equations 
Cartesian 169 
characteristic 299, 300-2 
circle, centre on the origin 163, 164 
coupled 606-7 
coupled difference 694 
coupled first-order 607-8 
cubic 543 
degree of 20 
difference see difference equations 
differential see differential equations 
diode 83-4 
exact 547-9 
family of 587 
first-order see first-order equations 
higher order 606-9, 625 
homogeneous 558-60, 684-6 
inconsistent 288 
inhomogeneous 558-60, 572-3 
Laplace 833-4, 862, 863 
linear 163, 164, 220, 536-7, 683, 685 
non-linear 219-22, 536-7, 683, 685 
non-recursive 691 
output 614 
partial differential 832-5, 862 
Poisson’s 863 
quadratic 20-4, 103, 325-6 
second-order 684, 685, 696 
separable 541 
simple 540-1 
simultaneous see simultaneous equations 
state 609, 613 
state-space 615 
third-order 684, 685 
transmission 834 
trigonometric 144-50 
wave 833 
zero order 684 
equipotential surface 245 
equivalent fractions 27-8 
equivalent resistance 77-8 
error-correcting code 291 
error signal 693 
error term 521 
Euler’s method 616-19, 623 
improved 620-2 


989 


990‘ Index 


Euler’s relations 339, 749 first-order equations 540-5, 573, 606-8, 685 
differential equations 564, 566 coupled 607-8 
discrete Fourier transform 787 difference equations 696 
Fourier transform 760, 771-2 Laplace transform 656 
even functions 726-32 separation of variables 541-5 
exact equations 547-9 simplest 540-1 
exact solution 616, 617-18, 620-1, 624 first-order linear differential equations 547-57 
exclusive OR gate 189 exact equations 547-9 
expansion along the first row 278 integrating factor method 550-7 
expected value 943-5, 946-7, 955, 959, 962 separation of variables 549 
experiment and binomial distribution 953 first-order Taylor polynomial 509-13 
explicit differentiation 394 direct current electrical network 510 
exponent see power gravity feed water supply 510-11 
exponent, defined 80 linearity 509-13 
exponential constant 81 power dissipation in resistor 511-12 
exponential decay 81 first-order Taylor polynomials, in two variables 835-7 
exponential distribution 962-3 first shift theorem 632-3, 719, 768 
exponential expressions 80 Fourier transform 762-4 
exponential form, of complex number 337-8 z transform 711-12 
exponential functions 80-4, 728 fluid flow: 
diode equation 83-4 across a surface 890-2 
discharge of capacitor 82 along a pipe 167 
see also logarithm functions coupled-tanks system 614-15 
exponential growth 81 into tank 376-7 
exponential term 669, 670, 671 flux 891 
folding (signal processing) 775 
factorial notation 55 forward wave 578-80 
factorization 20-1, 28-9, 39 Fourier analysis 482, 745 
factory production line, quality control on 907-8 Fourier coefficients 734, 739, 741-2, 748-50, 766 
failure in a single trial 953 Fourier components 752, 754-5 
Faraday, M. 240 Fourier series 722-56, 723-4, 728, 731-42, 746, 749-50 
Faraday’s law 368, 865 complex notation 749-51 
feedback: frequency response of linear system 751-5 
loop, negative, elimination of 662-8 half-range series 745-8 
negative 436 low-pass filter 752-5 
path 689 odd and even functions 726-32, 740-4 
Fibonacci sequence 203 Parseval’s theorem 748-9 
field: periodic waveforms 723-6 
irrotational 860 spectrum of pulse width: 
line and multiple integrals 876 modulation controlled solar charger 743-4 
solenoidal 857 Fourier synthesis 734 
filters 722 Fourier transform 749, 757-822 
digital 692 amplitude modulation 767-8 
discrete-time 689-90, 696 convolution and convolution theorem 774-82 
high pass 689 definitions 758-61 
low pass 689, 696-7 of delta function 770-1 
final value theorem 634, 657 discrete cosine transform 795-81 
finite sequences 801, 809, 813 fast 785 
finite series 209 integration 480 
arithmetic 209-10 inverse 758-9, 774 
geometric 211 and Laplace transform 772-4 
first condition (differential equations) 539 pair 758 
first-derivative 417 periodic functions 771-2 
Laplace transform 635 properties 761-6 
Taylor polynomials 508-9 first shift theorem 762-4 
test 407, 411 linearity 761-2 


first law of indices 3 second shift theorem 764—5 


spectra 766-8 
t-w duality principle 768-70, 771 


see also discrete convolution and correlation; discrete 


Fourier transform 
fourth-order Taylor polynomials 518 
fractional dead time 972 
fractional indices 8-11 
fractions: 
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back substitution 287-90 
row operations 287-90 


Gauss’s theorem 448, 857-8, 864, 893, 900 
general solution 537 


complementary function 558-9 

difference equations 683 

differential equation 584-5 

first-order linear differential equations 547 


algebraic see algebraic fractions 
equivalent 27-8 

expressing, in simplest form 28-9 
improper 27, 43-4 

negative 35-6 

partial see partial fractions 
positive 35-7 

proper 27 


free variables 288-9 
free vectors 225 
frequency 132, 724 


angular see angular frequency 
component, zero 736 

domain 722, 766 

Fourier series 723 

natural 411 

resonant 343, 411, 594 
response of a system 636-7 
response of linear system 751-5 
sinusoidal steady-state 636-7 
spectrum 744 

variable, generalized 636 


frequency modulation (FM), Bessel functions in 598-601 
full-wave rectifier 106-7 
function: 


average value of 471-5 
orthogonal 480-3 
root mean square value of 475-9 


function handle 61 
functions of several variables 823-48 


dependent variable 823-4 

eddy current losses 827 

electrical potential inside a cathode ray tube 824 

first-order Taylor polynomial in two variables 835-7 

functions of more than one variable 823-5 

higher order derivatives 829-32 

independent variables 823-4, 825 

maximum and minimum points of a function of two 
variables 841-6 

partial derivatives 825-9 

partial differential equations 832-5 

power dissipated in a variable resistor 824 


second-order Taylor polynomial in two variables 837-40 


Taylor series in two variables 840 


higher order equations 607-8 
particular integral 578 


second-order differential equation 538, 561-4, 566-8, 


569-70 

generalized frequency variable 636 
generating function 509 

Bessel functions and 595-7 
geometric progressions 204-5 
geometric series: 

finite 211 

infinite 212-13 
gradient 620 

of chord 357-8 

of solution 616 

of tangent 366-7, 508 
graph, of function 58-9 
graphics calculators 73, 723 
gravitational field, work done by 868-9 
gravity feed water supply 510-11 
Greek alphabet 981 
Green’s theorem 895 


half-range cosine series 746-8 
half-range Fourier series 745-8 
half-range sine series 746 
half-wave dipole, radiation patterns of 172-3 
half-wave rectifier 107 
Hall effect, in semiconductor 251-2 
harmonics 723 
components 724 
even 744 
fundamental or first 723 
nth 744 
odd 736, 744 
second 723 
waves 725, 734 
helical antennae 169-70 
hexadecimal system 11, 14-17 
binary to hexadecimal conversion 16-17 
conversion to decimal 14-15 
decimal to hexadecimal conversion 15-16 
high pass filter 689 
higher order derivatives 400-4, 829-32 
fifth 402 
first 400-2 


gain/attenuation 87, 88, 578 

Gauss-Seidel method 314-17 

Gaussian elimination 286—94, 306, 307, 309 
augmented matrix 287-91, 292 


fourth 402, 503-4, 831 
mixed 831 

second 400-2, 831 
third 402, 831 
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higher order equations 606-9, 625 
homogeneous equations 558-60, 572, 684-6 
linear 294-7 
horizontal axis turbines 73 
horizontal shift 138-44 
horizontal strips 881-4 
hyperbolic functions 100-4 
capacitance between two parallel wires 102-3 
cosecant (cosech) 100 
cosine (cosh) 100-1 
cotangent (coth) 100 
identities 101 
inverse 102 
sine (sinh) 100-1 


i and j components of r 233 
identity 
hyperbolic function 101 
Jacobi-Anger 597-8 
laws 179, 186, 189-90, 191 
matrix 271-2, 274, 284, 292-3 
trigonometric 125-30 
imaginary axis 670, 709 
imaginary axis (y axis) 332 
imaginary parts 578, 669 
complex numbers 325-6, 334, 348 
implicit differentiation 394-7 
impossible event 910 
improper fraction 27, 43-4 
improper integrals 480, 483-9 
impulse response 112, 676, 810-12 
impulse train 111-12 
inclusive OR gate 189 
inconsistent equations 288 
increment (rate of change at a specific point) 363 
indefinite integrals 442-51 
independent events 925-30 
independent solution 568 
independent variable 58, 59, 366, 374 
difference equations 682, 683, 684-5, 686, 691 
differential equations 535-7, 540-1, 543, 550-2 
discrete 202, 682 
integration 429, 431-2 
indices, laws of see under algebraic techniques 
inductor 577 
phasors 341 
inequalities, solution of 33-8 
inertia, moment of 612, 613 
infinite integrand 483, 486 
infinite limits of integration 483 
infinite series 209, 211-12 
geometric, sum of 212-13 
inflexion, points of 415-18 
information content: 
of alphabetic character stream 916 
of binary data stream 916 
information per event 915 


inhomogeneous equations 294, 558-60, 569-70, 572-3 


difference equations 684-6 
inhomogeneous term, in complementary function 
575-83 
initial conditions 539, 542, 556-7, 566 
analogue simulation 604-5 
difference equations 686, 696 
differential equations 616 
Laplace transform 659 
particular integral 578 
zero 660, 663 
initial values 696 
input: 
matrix 612 
sequence 685, 689, 691, 697 
signal 661, 669 
to system 659-61 
transfer functions 663, 665 
vector 612 
voltage 676 
integers 55, 345 
non-negative 682 
positive 55, 214-16 
integrals 438, 460, 463, 480, 490 
differential equations 541-2 
diverging 486-7 
double 880-5, 886-7, 891, 895 
improper 480, 483-9 
indefinite/definite 442-51 
infinite limits 483 
inner 881-3, 886 
Laplace transform 628, 635-8 
limits 443, 446, 488 
lower limits 443-5 
odd and even functions 730-1 
outer 881-3 
periodic functions 677-8 
scalar 493 
second-order linear ordinary differential equation 
569-75 
Simpson’s rule 502 
surface 889-95, 896, 898 
trapezium rule 499-500 
triple 885-6, 889, 893 
upper limits 443-5 
volume 889-95 
see also line integrals and multiple integrals 
integrands 458, 465, 480 
continuous 491 
infinite 483, 488 
integrating factor 551, 552 
integrating factor method 550-7 
integration 428-56, 480-95 
analytical 457 
applications of see integration applications 
capacitance of coaxial cable 448-50 
capacitors in series 484-5 


characteristic impedance of coaxial cable 450-1 


constant of 429 


delta function, integral properties of 489-91 


differential equations 542 
distance travelled by a particle 434 
elementary 429-41 

electronic integrators 435-7 
energy stored in capacitor 487-8 
energy used by electric motor 447 


iteration 420 
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of non-linear equations, sequences arising from 219-22 


simple 220 


Jacobi-Anger identities 597-8 
Jacobi’s method 313-14, 315-16 


Kirchhoff’s current law 310 
algebraic techniques 47-8 


improprer integrals 483-9 


integrals, definite and indefinite 442-51 


limits 491 
as linear operator 432-4 
linearity 438 


numerical see numerical integration 


orthogonal functions 480-3 


of piecewise continuous functions 491-2 


region of 880 


techniques see integration techniques 
of trigonometric functions 437-9 


vectors 493 
voltage across a capacitor 435 
integration applications 471-9 


average value and r.m.s. value of periodic waveform 


477-8 
average value of function 471-5 
load resistor 476-7 


root mean square value of function 475-9 


saw-tooth waveform 472-3 
thyristor firing circuit 473-4 
integration techniques 457-70 
by parts 457-63 
by substitution 463-6 
using partial fractions 466-8 
integrator 604-5 
circuits 428 
intersection (set theory) 178 
intervals 56 
closed 56 
of convergence 218 
open 56 
semi-open 56 
inverse: 
of function 62-4 
of matrix 285 
using row operations 292-3 
of 3 x 3 matrix 281-2 
trigonometric functions 121-3 
of 2 x 2 matrix 274-7 
finding 275-6 
orthogonal matrix 276 
inverse square law 6 
inverters 188 
inverting circuit 437 
irrational number 55 
irrotational vector field 860 
isotropic antenna 6 


complex numbers 342 


Kirchhoff’s voltage law 72, 308 


algebraic techniques 48-9 
complex numbers 342 


differential equations 535, 544, 560, 572, 577 


differentiation 413 
Fourier series 752 
integration 435, 485 
Laplace transform 655 


Kraus, J. 169 
Kronecker delta sequence 202, 203, 698 


Laplace equation 862, 863 


three-dimensional 833-4 


Laplace transform 411, 627-80, 681-2, 706, 709, 759, 774 


asymptotic Bode plot of transfer function with real poles 


and zeros 671-5 


of common functions (look-up table) 629-30 


convolution theorem 647-9 
definition 628 

delta function 675-6 

of derivatives and integrals 635-8 
discharge of a capacitor 655 


electronic thermometer measuring oven temperature 


656-7 
and Fourier transform 772-4 


frequency response of a system 636-7 


integration 480 

inverse 627, 638-41 
using complex numbers 643-6 
using partial fractions 641-3 


linear constant coefficient differential equations 649-59 


linearity 631-2 
partial differential equations 833 
periodic functions 677-8 
poles, zeros and s plane 668-75 
rule 1 670 
rule 2 670 
rule 3 670-5 
position control system 665-8 
properties 631-5 
final value theorem 634 
first shift theorem 632-3 
linearity 631-2 
second shift theorem 633-4 
RL circuit with ramp input 651-2 


rule 1: combining two transfer functions in series 


661-2 
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Laplace transform (Continued) 


rule 2: eliminating negative feedback loop 662-8 


transfer functions 659-68, 752 
transport lag 664-6, 664-5 
voltage across a capacitor 636 
and z transform, relationship between 704-9 
LCR circuit 560-1 
leakage, current 576-7 
left-hand limits 360 
lifespan, electrical components 911 
limit sequence 205 
limit 5 sequence 206 
limits 463 
differentiation 358-61 
of integrals 443, 446, 486 
integration 491 
left-hand 360 
right-hand 360 
line integrals and multiple integrals 867-902 
conservative fields 875-80 
line integral around closed loop 877-9 
potential function 875-7 
divergence theorem 895 
double integrals 880-5, 886-7, 891 
electric charge enclosed in a region 890 
electric current density 892 
electric field, work done by 869 
electric flux and Gauss’s law 893 
evaluation of line integrals in three dimensions 
873-5 


evaluation of line integrals in two dimensions 871-3 


fluid flow across a surface 890 
gravitational field, work done by 868-9 
Green’s theorem 886-7 
line integrals 867-71 
mass of a solid object 889 
Maxwell’s equations in integral form 899-900 
simple volume and surface integrals 889-95 
Stokes’ theorem 896-9 
three-dimensional line integrals 873-5 
triple integrals 885-6, 889 
two-dimensional line integrals 871-3 
line spectra 766 
linear circuit, Bode plot of 96-7 
linear combination 237 
of sinusoids 723 
linear constant coefficient: 
differential equations 627, 649-59 
linear convolution 801-3, 809-11 
linear cross-correlation 812-15 
linear equations 163, 164, 220, 536-7, 683, 685 
linear factors 39-41 
equating coefficients 40-1 
evaluation using specific value of x 40 
Laplace transform 651, 668 
repeated 39, 41-2 
linear homogeneous equations 294-7 


linear models for simple fluid system 380-3 
linear operator: 
differentiation as 375-83 
integration as 432-4 
Laplace transform 638 
linear polynomial 71 
linear scale 94 
linear system: 
frequency response 751-5 
Taylor polynomials 511 
linear time-invariant system 560-1 
linearity: 
differential equations 536-7 
of differentiation 430 
discrete Fourier transform 793 
first-order Taylor polynomial 509-13 
Fourier transform 761-2 
integration 438 
Laplace transform 631-2 
z transform and difference equations 710, 718-19 
linearly dependent vectors 238 
linearly independent functions 558 
linearly independent vectors 238 
lines of force 240 
linking process control computers 948-9 
load: 
line 422 
resistor 476-7, 655 
voltage waveform 473-4 
log-linear graph 92 
log-linear scales 92-4 
log-log graph 94-6 
log-log plot 92-3 
log-log scales 92-4 
log scale 92 
logarithm functions 85-100, 187-93 
attenuation in a step-index optical fibre 89-90 
decibels in radio frequency engineering 88-9 
delta function 111-12 
exponential functions and 91-2 
logarithms 85-7 
modulus function 104-8 
ramp function 108 
reference levels see reference levels 
signal ratios and decibels 87-8 
unit impulse function 111-12 
unit step function 108-10 
logarithmic differentiation 397-8 
logarithmic frequency scale 672 
logarithmic scale 94 
logarithms 85-7 
laws 86, 398, 433, 672 
natural 85, 89 
logical equivalence: 
AND gate 184, 187, 188 
binary full-adder circuit design 190-2 
connective 183 


exclusive OR gate 189 
NAND gate 185, 192-3 
NOR gate 184-5 
NOT gate 184 
OR gate see OR gate 
realization of logical gates 198-9 
lossless line 578 
lossy compression 799 
low pass filter 689, 692 
analogue 752-5 
digital 696-7 
lower bound 56, 499, 504 
lowest common denominator (1.c.d.) 30 


m-files 61 
Maclaurin series 524-31 
magnetic field: 
due to moving charge 250 
intensity 249-50 
strength 249-50 
magnetic flux 900 
magnitude: 
of phasor 340 
of vector 225 
many-to-one functions 60-1, 122 
maps 180 
mark time 743 
mass 610-11 
mass-spring system, oscillating 565-6 
mathematical model 54 
MATLAB® 1-2, 58, 320-1, 615, 785, 790, 808-9 
toolboxes 67 
matrices | 
matrix algebra 257-328 
addition 259-60 
adjoint matrix 281-2 
augmented matrix 309 
computer solutions for 319-21 
definitions 258-9 
determinants 275-6, 278-81 
Cramer’s rule 280 
and vector products 279-80 
diagonal matrix 271 
direct transmission 612 


eigenvalues and eigenvectors see eigenvalues and 


eigenvectors 

electrical networks, analysis of 307-12 
Gaussian elimination see Gaussian elimination 
identity matrix 271-2, 292-3 
input 612 
inverse of 2 x 2 matrix 274-7 

finding 275-6 

orthogonal matrix 276 
inverse of 3 x 3 matrix 281-2 
multiplication 261-4, 285 
non-singular matrix 276 
output 612 


Index 995 


representation 792-3 
robot coordinate frames 268 
scalar multiplication 260-1 
simultaneous equations and 283-6 
simultaneous equations, iterative techniques for solution 
of 312-19 
singular matrix 276, 282 
skew symmetric matrix 272-3 
square matrix 271, 272-3, 275, 300 
state 612 
subtraction 259-60 
symmetric matrix 272 
translation and rotation of vectors 267-71 
transpose of matrix 272 
Vandermonde matrix 291-2 
maximum/minimum points 406-15, 824-5 
first-derivative test 407, 411 
of a function of two variables 841-6 
local maximum 406-7 
local minimum 406-7 
second-derivative test 410, 412 
turning points 407-9, 412, 414 
Maxwell’s equations 864-6 
in integral form 899-900 
mean 964 
deviation and binomial distribution 955 
time between failures (MTBF) 970, 973, 975 
value (arithmetic mean) 938-40 
mesh current 307, 308-10 
vector 254 
microwave cavities 593-5 
minimization laws 179, 185, 186 
minimum points see maximum/minimum points 
minor of element 278-9 
mixed partial derivatives 862 
modelling systems 566 
modulation index 599 
modulation signal 767 
modulus: 
of complex number 333 
function 104-8 
of vector 225 
moment of inertia 612, 613 
moving average filter 692 
multipath-induced fading 142 
multiple integrals see line integrals and multiple integrals 
multiplication: 
algebraic fractions 29-30 
complex numbers 329-30 
indices 2-4 
law 919-25, 926, 954 
matrix 261-4 
in polar form 335 
scalar 260-1 
mutual capacitance 102 
mutually exclusive events: addition law of probability 
909-13 
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n-component column vector 612 
n-type metal-oxide semiconductor (NMOS) transistors 
192-3 
n-type semiconductor 83, 251 
NAND gate 185, 192-3 
natural frequency 411 
natural logarithms 85, 89 
natural numbers 55 
negative feedback 436 
negative number 325 
neper (Np) 89 
Newton-Raphson method 221, 406, 418-23 
Newton’s Second Law 610, 613 
nibble 17 
NMOS (n-type metal-oxide semiconductor) transistors 
192-3 
node 47-8 
reference 310 
voltages/currents 47-8, 310-12, 317-19 
non-linear equations 219-22, 536-7, 683, 685 
non-linear system, in Taylor polynomials 511 
non-negative number 333 
non-periodic function 757-8 
non-rectangular region 880, 884 
non-recursive equations 691 
non-singular matrix 276 
non-standard normal 966-9 
non-trivial solution 294—6, 298-300 
non-uniform failure rates 976 
non-zero initial value of state vector 660 
NOR gate 184-5 
normal distribution 963-9 
non-standard normal 966-9 
standard normal 964-6 
normalized sinc function 123, 124-5 
norms 254 
NOT gate 184 
nth-order Taylor polynomial 517-21 
numbers 55-6 
commutative 259-60 
complex see complex numbers 
numerator 27-8, 29-30, 75, 465 
numerical integration 457, 496-506 
distance travelled by rocket 498 
energy dissipation in resistor 502-3 
Simpson’s rule 500-5 
trapezium rule 496-500, 502 
numerical methods, and differential equations 615-16 
numerical solution 220, 534, 617-18, 695-8 


oblique asymptote 76 

odd and even functions 726-32, 740-4 
integral properties 730-1 

Ohm’s law 3, 55 
differential equations 535, 544 
differentiation 413 
fluid equivalent 614 


Fourier series 752 

integration 436, 487 

Laplace transform 655 

phasor form 341 

polynomial functions 71-2 

Taylor polynomials 510 
one-to-many function 60 
one-to-one functions 60-1, 122 
operating point 379-80 
OR gate 183, 184-5, 187, 188, 192 

exclusive 189 

inclusive 189 

truth table 183 
order 258, 263, 587 

difference equations and the z transform 684 

differential equations 536 

of objects 949 

of system 609 
ordinary differential equations see differential equations 
origin (coordinate systems) 154 
orthogonal functions 480-3 
orthogonal matrix 276 
orthogonal vectors 232, 242 
oscillate sequence 205 
oscillating mass-spring system 565-6 
oscillatory term 669 
output 663, 665 

equation 614 

matrix 612 

reduction 436 

sequence 685, 689, 691, 697 

signal 661, 669 

to system 659-61 

variables 614 

vector 612 

voltage 676 
overdamped response 666-7 
overshooting 666 


p-component column vector 612 
p-n junction 83 
p-type metal-oxide semiconductor (PMOS) transistor 
192-3 
p-type semiconductor 83, 251 
padding signals with zeros 809-11 
parallel system 975-7 
parallel vectors 242 
parallelogram 336-7 
parameters 393, 587 
functions 61 
parametric differentiation 393-4 
Parseval’s theorem 748-9, 794 
partial derivatives 825-9, 835 
first-order 825, 829, 833, 835-7, 842-5 
fourth-order 840 
line and multiple integrals 886 
mixed 862 


second-order 829-30, 833, 838-40, 842-5 
third-order 840 
vector calculus 854 
partial differential equations 832-5, 862 
partial fractions 39-45 
improper fractions 43-4 
and integration 466-8 
linear factors 39-41 
quadratic factors 42-3 
repeated linear factors 39, 41-2 
partial fractions: 
inverse Laplace transform 641-3, 644-5, 651, 653-4, 
656 
Laplace transform 666 
z transform 716-17, 720 
partial sums sequence 212 
particular integral function 558 
particular solution 538, 556, 563-4, 575 
parts, integration by 457-63 
Pascal’s triangle 214-15 
period 131 
of function 65 
periodic convolution see circular (periodic) convolution 
periodic extension 746-7 
periodic functions 65-6, 131-2, 725, 733 
Fourier transform 766, 771-2 
Laplace transform 677-8 
periodic waveforms 477-8, 723-6, 752 
periodicity and discrete Fourier 
transform 793 
permeability, of conductor 10 
permittivity of free space 102, 858 
permutations 948-50, 951-2 
phase 136, 724, 751-3 
angle 132, 133, 138-9, 723 
constant 581 
Fourier transform 766 
phasors 340-4 
capacitor 341-3 
inductor 341 
magnitude of 340 
reference 340 
resistor 341 
pick and place robot 159-60 
piecewise continuous functions 64-5, 648, 780 
integration of 491-2 
place sign 279 


PMOS (p-type metal-oxide semiconductor) transistor 192-3 


Poisson distribution 957-61, 962 
Poisson process 959 
Poisson’s equation 863 
polar coordinates 159-63, 333-4 
cylindrical see cylindrical polar coordinates 
pick and place robot 159-60 
spherical 170-3 
polar curves 163-6 
equation of circle, centre on the origin 163, 164 
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equation of line 163, 164 
and radiation patterns from antennas 164-5 
polar form 333-6, 338, 345-6 
poles 706-9 
complex 668, 669 
dominant 670 
of function 76 
Laplace transform 668-75 
at the origin 673 
positions 667 
real 668, 669 
stable complex 671 
system 668 
polynomial: 
coefficients 20 
constant 71 
cubic 71 
of degree 2 300 
of degree n—d 43-4 
degree of 71 
equations see under algebraic techniques 
expression 70 
functions 70-4 
non-ideal voltage source 72-3 
Ohm’s law 71-2 
wind power turbines 73 
linear 71 
quadratic 71 
quartic 71 
Taylor see Taylor polynomials 
position control system (servo-system) 108, 665-8 
position vector 423-4 
positive integers 55 
positive number 325 
postmultiplying 263 
potential function 875-7 
potential theory 833 
potentiometer 604-5 
power: 
dissipation in resistor 3-4, 511-12, 824 
function to model 59 


gain 87 
series 218-19, 337-8 
supply to a computer 906 


power density 6 
power series solution, of Bessel’s equation 587-8, 
590-1 
Poynting vector 343-4 
premultiplying 263, 272 
primary route (permutations) 948-9 
probability 903-32 
acceptability of manufactured electronic chips 925-6 
of acceptable components 928-9 
addition law 909-13, 954 
alphabetic character stream, information content 
of 916 
binary data stream, information content of 916 
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probability (Continued) 
breakdowns on factory process line 971-2 
character data stream, redundancy of 918 
communication theory 915-19 
complementary events 913-15 
component, calculating 912 
compound event 905-8 
conditional 919-25 
cumulative normal 967-8 
electrical component reliability 911-12 
electrical supply to a large factory, availability of 973 
emergency calls to service engineer 958 
entropy of binary data stream 917 
entropy of signal consisting of three characters 917 
events 904 
of faulty components 926-8 
independent events 925-30 
machine breakdowns in workshop 958 
mutually exclusive events 909-13 
power supply to a computer 906 
production line product fed by two machines 920-2 
quality control on factory production line 907-8 
redundancy of binary data stream 918 
reliability of manufactured components 923-4 
testing multiple electronic components 914-15 
trial 904 
unconditional 920, 923 
workforce absentees 959-60 
see also statistics and probability 
product rule 543 
differential equation 548, 551 
differentiation 386-8, 397-8, 412 
partial differential equations 827-8, 833 
production engineering 903 
projection (cylindrical polar coordinates) 166 
proper fraction 27 
proportional/integral/derivative (p.i.d.) controller 694 
Pythagoras’s theorem 106, 141, 161, 233, 235-6, 333 


quadratic approximation to diode characteristic 
515-16 

quadratic equations 20-4, 103, 325-6 

quadratic expression 37 

quadratic factors 39, 42-3 

quadratic polynomial 71 

quality control 903, 907-8 

quantity, absolute 88 

quartic polynomial 71 

quotient rule 388-9, 401, 414 


r-component column vector 612 
radar: 
and correlation 817-20 
defined 8 
scattering 7-8 
radar cross-section (RCS) 8 
radians 116, 117 


radiation patterns: 
of half-wave dipole 172-3 
polar curves and 164-5 
radio antenna, power density of signal transmitted by 6 
radio-frequency (RF) engineering 579 
decibels in 88-9 
radius of convergence 218, 698 
ramp function 108 
random variables 934, 962, 968 
continuous 944-5, 947, 965-6 
discrete see discrete random variable 
range, sets and functions 180 
rate of change 357-8, 362-4 
at general point 364-9 
negative 365 
positive 365 
at specific point 362-4 
rational functions 75-9 
rational numbers 55, 176, 345 
Rayleigh’s theorem 794 
RC charging circuit 535-6 
real axis (x axis) 332 
real line 55-6 
real numbers 55, 176, 181 
real parts 578 
complex numbers 326, 334, 348 
negative 669, 670 
real roots 24 
real system 670 
rectangle function 979-80 
rectangular region 880-3 
rectangular wave 743 
rectifier: 
defined 106 
full-wave 106-7 
half-wave 107 
recurrence relations 203, 220 
recursive difference equations 684 
recursive filter 689 
recursive sequence 203 
reduction formula 460-1 
redundancy 917-19 
Reed-Solomon codes 291 
reference levels 88, 90-7 
Bode plot of linear circuit 96-7 


exponential and logarithm functions, connection between 


91-2 
log-linear and log-log graph paper 94-6 
log-log and log-linear scales 92-4 
logarithm functions 90-1 
reference node 310 
reference phasor 340 
relation (sets and functions) 180-1 
reliability engineering 903, 970-7 
parallel system 975-7 
series system 973-5 
remainder of order n 522 


remainder term, Taylor’s formula and 521-4 
repeated linear factors 39, 41-2 
resistors 577 
load 476-7 
in parallel 32 
phasors 341 
power dissipation in 3-4, 511-12 
function to model 59 
resolving force into two perpendicular directions 
227-8 
resonant frequencies 343, 411, 594 
resultant of two forces acting on a body 227 
reverberation 810-12 
reverse saturated diode 83 
right-hand limits 360 
right-handed screw rule 896 
risetime 411-13 
RL circuit with ramp input 651-2 
RL circuit with step input 544-5 
RLC circuit 109 
parallel 573-5 
robot coordinate frames 268 
robot positions 237 
root mean square (r.m.s.) 340, 471 
value of function 475-9 
value of periodic waveform 477-8 
roots 20, 219-21, 563-4, 568, 668 
complex 24, 564, 567 
formula 21 
real 24 
routing, of automated vehicle 226-7 
row 258-9, 261-2 
operations 287-90, 292-3 
vector 233 
Runge-Kutta method of fourth-order 623-5 


s domain 661, 665 
s plane 668-75 
mapping of to the z plane 706-9 
saddle points 825, 842-5 
sample space 904 
sampled signal 690 
sampling 202, 688 
saw-tooth waveform 66-7, 472-3, 733-4, 742-3 
scalar 224 
field 240, 851-5, 856, 875, 889 
integrals 493 
multiple 230-1, 260-1 
product 241-5, 870-1, 891 
secant (sech) 120, 434 
hyperbolic 100 
second-derivative 416-17 
difference equations 691 
differential equations 536 
Laplace transform 635 
numerical integration 499 
test 410, 412 
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second law of indices 4 
second-order difference equations 719 


second-order differential equations 538-9, 604, 610 
second-order electrical systems, risetime for 411-13 


second-order equations 573, 606-8, 684, 685, 696 
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second-order linear ordinary differential equation 558-84 


complementary function 561-9 
constant coefficient equations 560-84 
integrals 569-75 
property 1 558 
property 2 558-60 
transmission lines 576-8 

second-order system 610 

second-order Taylor polynomials 513-17 


second shift theorem 633-4, 665, 712-14, 716, 764-5 


secondary route (permutations) 948-9 
semiconductors 192-3 
diode: 
dynamic resistance of 378-9 
ideal, operating point for 379-80 
Hall effect in 251-2 
n-type 83, 251 
p-type 83, 251 
separable equations 541 
separation of variables 541-5, 549 
sequences 200-9, 683, 688-9, 976 
arithmetic progressions 204 
binomial theorem 214-18 
bounded sequence 205 
converges sequence 205 
divergent sequence 205 
Fibonacci sequence 203 
finite sequence 801, 809, 813 
geometric progressions 204-5 
input sequence 685, 689, 691, 697 


from iterative solution of non-linear equations 219-22 


Kronecker delta sequence 202, 203 
limit sequence 205 
limit 5 sequence 206 
oscillate sequence 205 
output sequence 685, 689, 691, 697 
partial sums sequence 212 
recursive sequence 203 
unbounded sequence 205 
unit ramp sequence 699-700 
unit step sequence 202, 699 
see also series 
series 200, 209-14 
binomial theorem 214-18 
converged 211 
diode-resistor circuit 421-3 
finite 209 
arithmetic, sum of 209-10 
geometric, sum of 211 
inductance series 576 
infinite 209, 211-12 
geometric, sum of 212-13 
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series (Continued) 
power 218-19 
sequence of partial sums 212 
system series 973-5 
see also sequences 
servo-systems 665 
set theory 55, 175-83 
absorption laws 179, 185, 186 
associative laws 179, 186 
commutative laws 179, 186, 191 
complement 178-9, 186, 189-90 
De Morgan’s laws 179, 186, 191 
disjoint sets 178 
distributive laws 179, 186, 189, 191 
empty set 178 
empty sets 910 
equal sets 177 
identity laws 179, 186, 189-90, 191 
intersection 178 
logic 183-5 
minimization laws 179, 185, 186 
sets and functions 179-81 
subsets 178 
union 178 
universal set 177 
Venn diagrams 177, 178-9 
seven-segment displays 18-19 
shrinking interval method 363, 364, 372 
SI units and prefixes 982 
side bands 601 
sifting property, of delta function 490, 787-8 
sigma notation 209 
signals 133, 688 
damped sinusoidal 387-8 
input 661, 669, 697 
output 661, 669 
processing 200, 201 
digital 200, 692 
using a microprocessor 684-5 
ratios, and decibels 87-8 
sampling 690, 702-4 
sequence 697 
sinusoidal 751-2 
sinusoidal current 136-7 
sinusoidal modulating 599 
sinusoidal voltage 134-5 
of system 661 
telescope drive 108 
simple iteration 220 
Simpson’s rule 500-5 
simultaneous equations: 
algebraic 658 
direct methods 312 
and iterative techniques 312-19 
Gauss-Seidel method 314-17 
Jacobi’s method 313-14, 315-16 
and matrix algebra 283-6 


sinc function 123-5, 744, 760 
normalized 123, 124-5 
zero crossing points 124-5 

sine (sin) 115 
complex numbers 333-4, 338-9 
De Moivre’s theorem 344-50 
definite and indefinite integrals 444-5 
differential equations 534, 537, 539 
differentiation 387-9, 391, 394, 412, 420 
elementary integration 432-3, 434, 438-9 
first-order linear differential equations 547-8 
Fourier transform 763, 766 
functions 120-3 
hyperbolic 100-1 
improper integrals 486, 488 
integration, elementary 429-33 
integration by parts 458 
integration by substitution 464 
inverse function 122 
Jacobi-Anger identities 598 
Maclaurin series 525-6 
modelling waves using 131-44 
orthogonal functions 481-3 
partial differential equations 826, 827 
polar coordinates 160, 171 
polar curves 164-5 
root mean square value of function 475-6 


second-order linear differential equations 541, 559-60, 


564, 566-7, 571-3 
series, half-range 746 
Taylor polynomials 522-3 
Taylor series 526-7 
trigonometric equations 144, 145, 148, 149 
trigonometric identities 125-30 
trigonometric ratios 116-20 
vectors 246-7, 857 
waves 723, 734, 757 
single loop industrial control system 693 
single trial 953, 956 
singular matrix 276, 282 
sinusoidal functions 121 
sinusoidal modulating signal 599 
sinusoidal signals 751-2 
current 136-7 
damped 387-8 
voltage 134-5 
sinusoidal steady-state frequency 636-7 
sinusoidal term 670-1 
sinusoidal waveform 340 
sinusoids 723 
linear combination 723 
skew symmetric matrix 272-3 
skin depth, in radial conductor 9-10 
slope (differentiation) 357 
slotted line apparatus 582 
small-angle approximations 219 
smoothing the input signal 696 


solenoidal vector field 857 
solution: 
analytical 534, 618-19, 621-2 
differential equations 537-9 
exact 616, 617-18, 620-1, 624 
general see general solution 
independent 568 
non-trivial 294-6, 298-300 
numerical 220, 534, 617-18, 695-8 
particular 538, 556, 563-4, 575 
specific 683 
trivial 294-6, 298 
true 615 
solve, defined 219 
sonar 817, 820 
specific solution 683 
spectra: 
continuous 766 
discrete 766 
Fourier transform 766-8 
line 766 


of pulse width modulation controlled solar charger 743-4 


spherical polar coordinates 170-3 
square matrix 271, 272-3, 275, 300 
square the input rule 58-9, 61, 64 
square wave input 753-4 
square waveform 67 
standard deviation 941-3, 964 
binomial distribution 955 
of a continuous random variable 947 
of a discrete random variable 946-7 


standard forms and Laplace transform 639-40 


standard normal 964-6 

standing waves on transmission lines 580-2 

state equations 609, 613 

state matrix 612 

state-space equations 615 

state-space models 603, 609-15, 660 
for coupled-tanks system 614-15 
for spring-mass-damper system 610-11 

state variables 609-11, 613-14 

state vector 254, 612 

stationary points 842-5 

stationary values 407 

statistics and probability 
binomial distribution 953-7 
combinations 950-2 
continuous random variable 944-5 
continuous variable 933, 934, 936-8 
discrete random variable 943-4 
discrete variable 933, 934, 935 
distributions 933-53 
exponential distribution 962-3 
linking process control computers 948-9 
mean value (arithmetic mean) 938-40 
normal distribution 963-9 
permutations 948-50, 951-2 


Poisson distribution 957-61 
probability density functions 936-8 
random variables 934 
reliability engineering 970-7 
standard deviation 941-3 
of a continuous random variable 947 
of a discrete random variable 946-7 
uniform distribution 961 
see also probability 
steady-state error 657 
steady-state response 556 
step-index optical fibre, attenuation in 89-90 
step input 411 
step size 615, 616-18 
Stokes’ theorem 896-9 
straight line approximation 620 
sub-intervals 936, 971 
subscripts 46 
subsets 178 
substitution: 
back 287-90 
integration by 463-6 
subsystems, parallel 976 
subtraction: 
algebraic fraction 30-1 
complex numbers 329 
matrix 259-60 
vectors 228-30, 337 
success in a single trial 953 
sum to infinity 213 
summation notation 46-50 
Kirchhoff’s current law 47-8 
Kirchhoff’s voltage law 48-9 
summer 604—5, 689 
summing 890 
point 661, 662 
superposition principle 509 
suppressed carrier amplitude modulation 768 
surface integrals 889-95, 896, 898 
symmetric matrix 272 


t-w duality principle 768-70, 771 
take-off point 661 


‘take plus or minus the square root of the input’ 60, 64 
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‘take the positive square root of the input’ rule 60 


tangent (tan) 115, 358 
complex numbers 333-4 
differential equations 543 
functions 120-3 
gradient of 366-7 
hyperbolic 100 
integration 430, 434, 464 


line approximation see straight line approximation 


polar coordinates 161-2 
trigonometric equations 144, 147, 148 
trigonometric identities 125-8 
trigonometric ratios 116-20 
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Taylor polynomials: 


first-order see first-order Taylor polynomial 


fourth-order 518, 840 
non-linear system 511 
of nth order 517-21 
second-order 513-17, 837-40 
third-order 518-19, 840 

Taylor series 524-31 

Taylor series expansion 617, 620, 623, 751 
truncation 620 

Taylor series in two variables 840 


Taylor’s formula, and remainder term 521-4 


telescope drive signal 108 
third law of indices 7, 8 
third-order equation 684, 685 
third-order Taylor polynomials 518-19 
thyristor firing circuit 473-4 
time: 
constant 82, 545, 753 
delay see transport lag 
dependency 340 
displacement 132, 133, 723 
domain 340, 628, 661, 665 
interval 689 
linear time-invariant system 560-1 
see also discrete-time 
toolboxes 67 
‘top-hat’ function 777 
torque 613 


transfer functions 609, 627-8, 637, 659-68, 669 


Laplace transform 674-6 


rule 1: combining two transfer functions in series 661-2 
rule 2: eliminating a negative feedback loop 662-8 


system 663 
transform, discrete 795-801 
transients 556, 669-71 
response 659 
transistors 192-3 
transmission equation 834 
transmission lines 576-8 
transport lag 664-5 
transverse wave 343 
trapezium rule 496-500, 502 
tree diagram 907, 912, 921, 923-4 
triangle law 225-6, 228-30, 233, 336, 351 
triangular waveform 66 
trigonometric functions 115-53, 349 
cosine function 120-3 
degrees 116 
equations, trigonometric 144-50 
integration 431 
integration of 437-9 
inverse 121-3 
radians 116 
ratios, trigonometric 116-20 
sinc function 123-5 
sine function 120-3 


tangent function 120-3 
see also trigonometric identity 
trigonometric identity 345, 348 
Fourier series 724, 732 
integration 481-3 
trivial solution 294-6, 298 
true solution 615 
truth table 186-7 
exclusive OR gate 189 
full-adder circuit 191 
AND gate 184 
NAND gate 185 
NOR gate 184 
NOT gate 184 
OR gate 183 
turbines, wind 73 
turning points 407-9, 412, 414 
two ray propagation model 140-2 


unbounded sequence 205 
unconditional probability 920, 923 
underdamped response 667 
underdamped system 411 
uniform distribution 961 
union (set theory) 178 
unit impulse function see delta function 
unit ramp sequence 699-700 
unit step: 

function 108-10 

functions 712 

input 669 

sequence 202, 699 
unit vectors 231 
universal set 177 
upper bound 56, 499-500, 503-4 
upper limits 484 

of integrals 443-5 


Vandermonde matrix 291-2 
variables 58 
Boolean 185 
complex frequency 636 
continuous 962, 963 
dependent see dependent variable 
discrete 962 
dummy 451 
free 288-9 
generalized frequency 636 
independent see independent variable 
Laplace transform 637 
output 614 
random see random variables 
separation of 541-5, 549 
state 613-14 
variance 941-2, 946-7, 958 
vectors 1, 224-55 
addition 225-6, 336-7 


calculus 849-66 
curl 859-61, 861-4 
divergence 856-9, 861-4 
electric flux and Gauss’s law 857-8 
and electromagnetism 864-6 
electrostatic potential 854-5 
gradient 861-4 
partial differentiation of vectors 849-51, 852-3 
Poisson’s equation 863 
scalar field 861-2 
scalar field gradient 851-5 
vector field 861 
Cartesian components 232—40 
column 233, 258-9 
and complex numbers 336-7 
differentiation 423-6 
displacement 226, 227, 423 


electric field strength and electric displacement 240-1 


electrostatic potential 241 
equal 225 
field 240, 244, 867 
line and multiple integrals 890, 893 
free 225 
Hall effect in semiconductor 251-2 
input 612, 615 
integration of 493 
linearly dependent 238 
linearly independent 238 
magnetic field due to moving charge 250 
magnitude 225 
mesh current 254 
modulus 225 
of n dimensions 253-4 
negative 225 
operator 852 
orthogonal 232, 242 
output 612 
parallel 242 
partial differentiation 849-51, 852-3 
position 423-4 
Poynting 343-4 
product 246-53 
applications of 249-53 
determinants and 248-9, 279-80 
magnetic flux density and magnetic field strength 
249-50 
resolving force into two perpendicular directions 
227-8 
resultant of two forces acting on a body 227 
robot positions 237 
routing of automated vehicle 226-7 
row 233 
scalar multiple 230-1 
scalar product 241-5 
state 254, 612 
subtraction 228-30, 337 
translation and rotation of 267-71 
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unit 231 

work done by force 244 

zero 236 
Venn diagram 904, 906, 910, 914 
Venn diagrams 177, 178-9 
vertical strips 881-2, 884-5 
very-large-scale integration (VLSI) 192-3 
VHDL design languages 193 
virtual earth 436 
visual artefacts 800-1 
voltage 577-8, 580-2 

across a capacitor 636, 655 

across an inductor 368 

across capacitor 435 

gain 88, 90 

input 676 

law see Kirchhoff’s voltage law 

node 47-8, 310-12, 317-19 

non-ideal source 72-3 

output 676 

reflection coefficient 578-80 

saw-tooth waveform 66-7 

sinusoidal signals 134-5 
voltage controlled oscillator (VCO) 599 


wave: 
amplitude of 131 
backward 578-80 
combining 134-8 
destructive interfering 142 
equation 833 
forward 578-80 
input, square 753-4 
modelling, using sine and cosine 131-47 
number 138-44 
rectangular 743 
transverse 343 
two-ray propagation model 140-2 
waveforms 133 
alternating current 134 
periodic 477-8, 677-8, 723-6, 752 
saw-tooth 66-7, 472-3 
sawtooth 733-4, 742-3 
sinusoidal 340 
square 67 
time dependency of 340 
triangular 66 
waveguides 593-5 
wavelength 138-44 
weighted impulses 112, 703-4, 979-81 
whole numbers 176 
wind power turbines 73 
work done by force 244 
world coordinate frame 268 


x axis 58, 157, 159, 168, 171 
sequences and series 220 


1003 


1004 ‘Index 


x coordinate 154-5 
x-y plane 166, 169, 170, 171, 233, 336, 351 


y axis 58, 157 
matrix algebra 270-1 
y coordinate 154-5 


zaxis 157, 167, 169, 171 
z transforms 175, 200, 979, 981 
continuous signal sampling 702-4 
definition 698-702 
and difference equations 718-20 
and Laplace transform, relationship between 704-9 
mapping the s plane to the z plane 706-9 
properties of 709-15 
complex translation theorem 714 


direct inversion 717-18 
first shift theorem 711-12 
inversion 715-18 
linearity 710 
second shift theorem 712-14 
zero frequency component 736 
zero initial conditions 660, 663 
zero-input response 554-7, 573, 575 
zero order equations 684 
zero real part 670 
zero-state response 554-7, 573, 575 
zero vector 236 
zeros 668, 706, 709 
Laplace transform 668-75 
padding signals with 809-11 


